Western Connecticut State University
CS 450 Data Mining, Fall 2014
Take-Home Test N#1
Date: September 22nd, 2014
Final deadline for submission September 29th, 2014
Weighting: 5%
Total number of points: 100
Instructions:
1. Attempt all questions.
2. This is an individual test. No collaboration is permitted for assessment items. All submitted materials must be a result of your own work.
Part I
Question 1 [20 points]
Discuss whether or not each of the following activities is a data mining task.
• Dividing the customers of the company according to their gender
No. This is an accounting calculation, followed by the application of a threshold. However, predicting the profitability of a new customer would be data mining.
• Dividing the customers of a company according to their profitability.
Yes, this is a data mining task because it requires data analysis to determine who the costumers are that brings more business to the company.
• Computing the total sales of the company.
No, this is not a data mining task because there is not analysis involve, this information can be pull out of any booking program.
• Sorting a student database based on student ID numbers.
No, this is not a data mining activity because sorting by ID numbers doesn’t involved any data mining task. This is a simple database query
• Predicting the future stock price of a company using historical records.
Yes. We would attempt to create a model that can predict the continuous value of the stock price. This is an example of the area of data mining known as predictive modelling. We could use regression for this modelling, although researchers in many fields have developed a wide variety of techniques for predicting time series.
• Monitoring the heart rate of a patient for abnormalities.
Yes. We would build a model of the normal behavior of heart rate and raise an alarm when an unusual heart behavior occurred. This would