SAS Regression 1

Satisfactory Essays

SAS Regression 1

MIS 6324
Business Intelligence

3. Classification using SAS Enterprise Miner
In this question you will analyze the JUNKMAIL dataset found in the SASHELP library. Follow the procedure we used for analyzing the HMEQ dataset. Detailed instructions for the HMEQ analysis are given in the emcs.pdf document.

You will need to create and execute the process flow diagram shown above. Further requirements for analyzing JUNKMAIL are as given below:
This data will be used to classify emails as junk mail or not. Create the data source and set the role for all variables, including the target variable appropriately.
You can use the default values for everything else when creating the Data Source
Partition the data into a 60/40 split with no data being used for Testing.
Follow the steps shown in the process diagram.
You will try out four different models as described below:
Regression: This model is the default regression model with the original data
Regression – No Model Selection: This is the default regression model after transforming the variables as described below.
Regression – Stepwise: This is the Regression model using stepwise regression and transformed data
Decision Tree: This is the default decision tree model using transformed data
Transform Variables:
Transform all variables using log value
Model Comparison: Run with Selection Statistic set to Misclassification Rate
Now answer the following questions:
1. Which model is selected as the best one by the Model Comparison Node? Regression on the original data.
2. What is the training misclassification rate for this model? What is the validation misclassification rate?
Training Misclassification rate : 0.064879
Validation Misclassification Rate : 0.077090
3. What are the first four most important variables used
Exclamation
CapAvg
Remove
HP
4. What is the

SAS Regression 1

You May Also Find These Documents Helpful

Mm207 Unit 2 Data Set Homework

Mm207 Unit 2 Data Set Homework

Chapter 20 lab

Chapter 20 lab

Bus303 Business Communication Research Paper

Bus303 Business Communication Research Paper

CIS 675 Lab 4: Configuring a pfSense Firewall on the Server

CIS 675 Lab 4: Configuring a pfSense Firewall on the Server

PROJECT PART C: Regression and Correlation Analysis

PROJECT PART C: Regression and Correlation Analysis

Bcom 275 Final Exams Help Essay Example

Bcom 275 Final Exams Help Essay Example

Unit 210 handling mail

Unit 210 handling mail

John's Case

John's Case

Business Studies(handle mail)

Business Studies(handle mail)

Regression Analysis Quiz

Regression Analysis Quiz

sas homework Solutions

sas homework Solutions

Nt1430 Linux Chapter 20 Exercises

Nt1430 Linux Chapter 20 Exercises

External Mail Services

External Mail Services

MBA504 MAC502 Problem Set 8 F14

MBA504 MAC502 Problem Set 8 F14

Regression Analysis

Regression Analysis

Related Topics