Preview

Chapter 7 - K neighbours

Satisfactory Essays
Open Document
Open Document
520 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Chapter 7 - K neighbours
7.1
a. How would this customer be classified?
A. This customer would be classified as not accepting the personal loan offer. According to the KNN_Output there appears to be overfitting due to the discrepancies in the classification matrix for training (Class 0 = 0% error, Class 1 = 0% error, Overall = 0% error), and validation error (Class 0 = 4.2% error, Class 1 = 55.85% error, and Overall = 9.1% error).

b. What is a choice of k that balances between overfitting and ignoring the predictor information?
A. A choice of k that balances between overfitting and ignoring the predictor would be k = 6. The value is chosen because it minimizes the % validation error. After testing various k levels. According to the validation error log for different k the best k points to 6, where %error training is 7.4% and validation % error is 8.75%.

c. Show the classification matrix for the validation data that results from using the best k.

d. Classify the customer using the best k
A. According to the best k the customer would not be inclined to accept the personal loan.
e. Re-partition the data, this time into training, validation, and test sets (50%: 30%: 20%). Apply the k-NN method with the k chosen above, compare the classification matrix of the test set with that of the training and validation sets. Comment on the differences and their reason.
A. Based on the training, validation, and test matrices we can see a steady increase in the percentage errors. There does not appear to be overfitting due to the minimal error discrepancies among all three matrices, from the training to the validation error there is a 5.69% difference, and from validation to test error there is a 14.05% error difference. Based on the lift chart, the model appears to make a difference even though the loan acceptance has a 82% error rate for the test classification matrix.
9.3
i. Compare the tree generated by the CT with the one generated by the RT. Are they

You May Also Find These Documents Helpful

  • Satisfactory Essays

    3505 M2 Fall 2014 Soltn

    • 3355 Words
    • 15 Pages

    e. Apply RAROC to the data on the above loan. Calculate each component of RAROC.…

    • 3355 Words
    • 15 Pages
    Satisfactory Essays
  • Satisfactory Essays

    U Decide

    • 361 Words
    • 1 Page

    According to the bank’s lending policy Daniel cannot be approved (His final two payments due not arrive at all). He asked for a loan that is not too big and has an acceptable FICO score of 490 according to the bank’s policy. I would ask if Eric wants to put more down for down-payment. Because if he pays more down-payment the less chance he will default the loan. I will ask if Daniel is currently employed and what his income and debt status are. I will consider approve the loan if everything meet the expectation he can pay back on time the loan and ask the Bank president’ approval. The bank can charge higher interest rate of 4.5% above prime rate which is still risky.…

    • 361 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    c. Many creditors are currently taking advantage of vulnerable consumers in financial crises by offering credit with extremely high interest rates and additional upfront costs. As a potential first time homebuyer there are many things that need to be made clear prior to selecting a loan. With little knowledge, as much information as possible will be beneficial to ensuring that my loan options are within a reasonable mean to repay. Many consumers find themselves swarming in debt simply because they were misinformed and…

    • 640 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Based on your findings in 1-5, what is your opinion about using SIZE to predict CREDIT BALANCE? Explain. We can expect the model to prediction of credit balance to be within 260.162 x2 (520.32)…

    • 633 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Lawson Case

    • 878 Words
    • 4 Pages

    There are two chief participants in this case study, Paul Mackay and Jackie Patrick. Mackay, a sole proprietor of Lawsons (a general merchandising retail site in Riverdale, Ontario), has approached the Commercial Bank of Ontario in order to acquire an additional $194, 000 bank loan and a $26,000 line of Credit. Patrick, a first time loans officer, has been appointed to Mackay’s request. As such although apprehensive to finish her first loan, she must take into consideration the difficulties of this particular case.…

    • 878 Words
    • 4 Pages
    Good Essays
  • Good Essays

    There are 50 credit customers who were selected for the data collection on five variables such as location, income, size, years, and credit balance. In order to understand more about their customer, AJ DAVIS must use graphical, numerical summary to be able to interpret and better expand their business in the future.…

    • 1166 Words
    • 5 Pages
    Good Essays
  • Satisfactory Essays

    d. Choose one of the variables in your dataset and classify it according to the levels of measurement. Explain how you know.…

    • 343 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    Understanding Fico Scores

    • 2191 Words
    • 9 Pages

    The research in this report was taken from a few different sources. The primary research was conducted by distributing a survey to the general public. The survey was designed to help us understand how much people actually know about their score. However, due to limited time and resources the survey was completed by only 20 people. The information provided by the survey was still useful despite the limitation on sample size. The secondary research was taken from websites, books, and training materials from the lending industry.…

    • 2191 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    ece 6001

    • 509 Words
    • 5 Pages

    A photon counter connected to the output of a fiber detects the number of photons,…

    • 509 Words
    • 5 Pages
    Satisfactory Essays
  • Satisfactory Essays

    HW 2

    • 577 Words
    • 3 Pages

    (c) What potential problems are there for the method proposed in (b)? How can you improve it?…

    • 577 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    6. Complete the exercise as directed, recording any data or information needed in your Data Table below.…

    • 1524 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    f. Explain whether you believe the information in requirement d or e provides the most useful data for evaluating the potential for misstatements. Explain why.…

    • 265 Words
    • 2 Pages
    Good Essays
  • Good Essays

    d) Using the above function to predict the maximum number of heart beats for ages of 25, 55, 65 and 80.…

    • 706 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    Table 5.43 what obstacles does MSMEs face in getting loans (Cross Tabs) Obstacles does MSMEs face in Lending Total Obstacles Most often Often Always Never Lack of quality information 27 (28.7) 10 (10.6) 15 (16.0) 42 (44.7) 94 (100%) Inadequate credit scoring 50 (49.0) 15 (14.7) 12 (11.8) 25 (24.5) 102 (100%) Behaviour cannot evaluate 20 (18.0) 29 (26.1) 30 (27.0) 32 (28.8) 111 (100%)…

    • 86 Words
    • 1 Page
    Satisfactory Essays
  • Satisfactory Essays

    d. Compare the results in parts a, b, and c. What conclusion, if any, can you make from these tests?…

    • 416 Words
    • 2 Pages
    Satisfactory Essays