Preview

data mining hw 3

Satisfactory Essays
Open Document
Open Document
505 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
data mining hw 3
Introduction to Data Mining
Summer, 2012
Homework 3
Due Monday June.11, 11:59pm
May 22, 2012

In homework 3, you are asked to compare four methods on three different data sets. The four methods are:

• Indicator Response Matrix
Linear Regression to the Indicator Response Matrix. You need to implement the ridge regression and tune the regularization parameter.
The material of this algorithm can be found in Page 103 to Page 106 in the book ”The Elements of Statistical Learning”
(http://www-stat.stanford.edu/~tibs/ElemStatLearn/).
• Na¨ Bayes ive You need to try Naive Bayes without smoothing and use smoothing.
• k -Nearest Neighbor for kNN, k is a parameter. You need to report two result, k =1 and k =p. you can choose an appropriate p for different datasets.
• Support Vector Machine
Use both LibSVM (http://www.csie.ntu.edu.tw/~cjlin/libsvm/) and LibLinear (http://www.csie.ntu.edu.tw/~cjlin/liblinear/)
Use LibSVM with linear kernel and Gaussian Kernel (tune the parameters)
LibLinear is always linear, you need to compare the different speed of
LibSVM and LibLinear.

The test datasets are as follow:
1

• ORL database
Ten different images of each of 40 distinct subjects. For some subjects, the images were taken at different times, varying the lighting, facial expressions (open / closed eyes, smiling / not smiling) and facial details (glasses / no glasses). All the images were taken against a dark homogeneous background with the subjects in an upright, frontal position (with tolerance for some side movement).
A random subset with 7 images per individual was taken with labels to form the training set, and the rest of the database was considered to be the test set.
You will be given ORL train.mat and ORL test.mat.
• USPS database
The USPS handwritten digit database. We provide here a popular subset contains 9298 16x16 handwritten digit images in total, which is then split into 7291 training images and 2007 test images.
You will be given

You May Also Find These Documents Helpful

  • Satisfactory Essays

    Exercise3statistics

    • 657 Words
    • 2 Pages

    a. The stronger the data analysis technique that can be used to analyze the data…

    • 657 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    • Describe the way in which your selected method can be used to analyze data.…

    • 108 Words
    • 1 Page
    Satisfactory Essays
  • Powerful Essays

    Econ450 Syllabus.

    • 765 Words
    • 5 Pages

    of the term whichever method gives you the higher grade. Method 1 is designed to…

    • 765 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    In “A Failure in Generalship”, LTC Paul Yingling assigns blame for the failure of the military in the Vietnam War and the dire and deteriorating situation in Iraq at the beginning of 2007, placing it on America’s generals, then and now. Though fearless in its attempt, the essay presents a weak academic argument to back up this claim due to a string of fallacies, statements and arguments based on false or invalid inference. Most notable in his essay is “hasty generalization”, “missing the point”, and the “false dichotomy”. The initial fallacy that undermines the argument is that of “hasty generalization”. A “hasty generalization” is a broad sweeping statement placed on a group of people without a sufficient…

    • 613 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Bsbwor501 Final Exam

    • 794 Words
    • 4 Pages

    4. Use exhibit 6.5 to help you create a chart to show the main advantages and disadvantages of each of the four methods.…

    • 794 Words
    • 4 Pages
    Powerful Essays
  • Satisfactory Essays

    Cool people

    • 472 Words
    • 2 Pages

    2 Look at Table 1.1. List the techniques from the table that are based on the:…

    • 472 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Unit 19 P1

    • 2223 Words
    • 9 Pages

    | The child will be capable of imitating facial expressions.They will begin to be able to differentiate between familiar and unfamiliar faces.They are capable of demonstrating certain types of…

    • 2223 Words
    • 9 Pages
    Good Essays
  • Satisfactory Essays

    HW 2

    • 577 Words
    • 3 Pages

    (c) What potential problems are there for the method proposed in (b)? How can you improve it?…

    • 577 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Unit 4 Assessment

    • 464 Words
    • 2 Pages

    4. Using one (or more) of the methods you outlined in Question 1, provide a wide range of examples (at…

    • 464 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Your assignment will be to compare/contrast the methods the two authors use, their basic arguments, and their effectiveness. Topics 2 and 3 will be similar.…

    • 589 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Gazes at faces and copies facial movements. e.g. sticking out tongue, opening mouth and widening eyes.…

    • 1063 Words
    • 5 Pages
    Good Essays
  • Good Essays

    Assignment 302

    • 1215 Words
    • 5 Pages

    d) How standards can be used to help a social care worker reflect on their practice.…

    • 1215 Words
    • 5 Pages
    Good Essays
  • Good Essays

    Everyone’s life begins with birth and ends with death. It’s the nature rule that no one could offense it. From ancient to modern times, there always somebody want to escape from death, but no one success. Death could also bring fear to people. One of the characters created by Edgar Allen Poe, Prince Prospero took some measures to avoid death, because of the fear of death. Edgar Allen Poe uses symbolism in “The Masque of the Red Death” to illustrate that death is inevitable and undefeated.…

    • 1013 Words
    • 5 Pages
    Good Essays
  • Good Essays

    Care Assistant

    • 859 Words
    • 4 Pages

    Face could show wide range of emotion from happy, optimistic, joyful to aggressive, anxious, sad and negative.…

    • 859 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1 Why Mine Data? Commercial Viewpoint O Lots of data is being collected and warehoused – Web data, e-commerce – purchases at department/ grocery stores – Bank/Credit Card transactions…

    • 2304 Words
    • 32 Pages
    Good Essays