Preview

Web and Data Mining Introduction

Good Essays
Open Document
Open Document
2304 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Web and Data Mining Introduction
Data Mining: Introduction

Lecture Notes for Chapter 1
Introduction to Data Mining by Tan, Steinbach, Kumar

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

1

Why Mine Data? Commercial Viewpoint
O

Lots of data is being collected and warehoused
– Web data, e-commerce
– purchases at department/ grocery stores
– Bank/Credit Card transactions O

Computers have become cheaper and more powerful

O

Competitive Pressure is Strong
– Provide better, customized services for an edge (e.g. in
Customer Relationship Management)

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

2

Why Mine Data? Scientific Viewpoint
O

Data collected and stored at enormous speeds (GB/hour)
– remote sensors on a satellite
– telescopes scanning the skies
– microarrays generating gene expression data
– scientific simulations generating terabytes of data

O
O

Traditional techniques infeasible for raw data
Data mining may help scientists
– in classifying and segmenting data
– in Hypothesis Formation

Mining Large Data Sets - Motivation
O
O
O

There is often information “hidden” in the data that is not readily evident
Human analysts may take weeks to discover useful information Much of the data is never analyzed at all
4,000,000
3,500,000

The Data Gap

3,000,000
2,500,000
2,000,000
1,500,000

Total new disk (TB) since 1995

1,000,000

Number of analysts 500,000
0
1995

1996

1997

1998

1999

©From:
Tan,Steinbach,
R. Grossman,
Kumar
C. Kamath, V. Kumar,
Introduction
“Data Mining to Data for Mining
Scientific and Engineering Applications”
4/18/2004

4

What is Data Mining?
O Many

Definitions

– Non-trivial extraction of implicit, previously unknown and potentially useful information from data – Exploration & analysis, by automatic or semi-automatic means, of large quantities of data in order to discover meaningful patterns

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

5

What is (not) Data Mining?
What is not Data
Mining?

O

O

What is Data

You May Also Find These Documents Helpful

  • Powerful Essays

    Bsc303 Chapter 1 Study Guide

    • 4685 Words
    • 19 Pages

    Data Mining- the process of searching huge amounts of data with the hope of finding a pattern…

    • 4685 Words
    • 19 Pages
    Powerful Essays
  • Powerful Essays

    P1 Unit 4 Business Research

    • 2470 Words
    • 10 Pages

    Data is simply a "scientific" term for facts, figures, information and measurements. Example; People with white hair.…

    • 2470 Words
    • 10 Pages
    Powerful Essays
  • Good Essays

    c) To provide the infrastructure and tools to transform raw data into usable corporate information of the highest quality.…

    • 2215 Words
    • 17 Pages
    Good Essays
  • Powerful Essays

    Study Guide

    • 3863 Words
    • 16 Pages

    4. In a questionnaire, respondents are asked to mark their gender as male or female. Gender is an example of the…

    • 3863 Words
    • 16 Pages
    Powerful Essays
  • Powerful Essays

    14. Data mining is the process of engineering mathematical patterns from usually large sets of data…

    • 2021 Words
    • 9 Pages
    Powerful Essays
  • Good Essays

    People need information for planning their work, meet deadlines, and achieve their goals. They also need information to analyze problems and make important decisions. Data is most definitely not in short supply these days, but not all data is useful or reliable.…

    • 592 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Cis 500 Data Mining Report

    • 2046 Words
    • 9 Pages

    This report is an analysis of the benefits of data mining to business practices. It also assesses the reliability of data mining algorithms and with examples. “Data Mining is a process that uses statistical, mathematical, artificial intelligence, and machine learning techniques…

    • 2046 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    Web Programming

    • 480 Words
    • 2 Pages

    Good evening Mr. Charles. Earlier today you asked me to research some possible web conferencing programs that may help the company weekly status meetings. Since you assigned me to this task, I have found some programs that may work. I believe the best program that might fit the company needs would be due to cost is Skype a free web conferencing program.…

    • 480 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    Data Mining Problems

    • 1295 Words
    • 6 Pages

    Suppose that we are responsible for managing product placement within a local supermarket. Our shelving units have 6 shelves each and are numbered from 1 to 6—with 1 being the lowest shelf and proceeding upward until the highest shelf is assigned the number 6. While there are many placement options that we should consider, we decide to look for any correlations between the row a product is placed on and its sales. Since we have our data stored in a data warehouse, it is easily accessible and responds quickly to our data request. Consider each of the following:…

    • 1295 Words
    • 6 Pages
    Powerful Essays
  • Good Essays

    Business Intelligence

    • 812 Words
    • 4 Pages

    Data mining is tightly positioned at the intersection of many disciplines. Those disciplines include all of the following except:…

    • 812 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    How to Increase Retail Sales

    • 5808 Words
    • 24 Pages

    References: Berry, M.J.A., Linoff, G.S.: Data Mining Techniques: for Marketing, Sales and Customer Relationship Management (second edition), Hungry Minds Inc., 2004…

    • 5808 Words
    • 24 Pages
    Powerful Essays
  • Good Essays

    Web Analytics

    • 1128 Words
    • 5 Pages

    Google analytics is the next generation web analytics tools from Google that show you how people find your site. How they navigate and how they become customers. In much the same way that Google search engine has made it easy to use powerful technology, it brings a new accessibility to enterprise-class web analytics making it possible for all advertisers, publishers and website owners. Focusing your marketing resources on campaigns and initiatives that deliver ROI can improve your site to convert more visitors. A flexible graphing tool allows you to see larger trends even as you analyze and compare specific time periods. Short narratives; score cards and spark lines summarize your results while detail report is just a click away. Report controls allow you to play detail with in context and visualize data in new/different ways. Segmentation menu provides a way to slice data along a variety of factors.…

    • 1128 Words
    • 5 Pages
    Good Essays
  • Powerful Essays

    Mining data management

    • 1595 Words
    • 7 Pages

    Of all the information assets held by a mining company, exploration data is likely to be…

    • 1595 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Web Design

    • 614 Words
    • 3 Pages

    Web design encompasses many different skills and disciplines in the production and maintenance of websites. The different areas of web design include web graphic design; interface design; authoring, including standardised code and proprietary software; user experience design; and search engine optimization. Often many individuals will work in teams covering different aspects of the design process, although some designers will cover them all. The term web design is normally used to describe the design process relating to the front-end (client side) design of a websiteincluding writing mark up. Web design partially overlaps web engineering in the broader scope of web development. Web designers are expected to have an awareness of usability and if their role involves creating mark up then they are also expected to be up to date with web accessibility guidelines.…

    • 614 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Overview of the Data Mining

    • 8497 Words
    • 34 Pages

    Jeffrey W. Seifert Analyst in Information Science and Technology Policy Resources, Science, and Industry Division…

    • 8497 Words
    • 34 Pages
    Good Essays

Related Topics