Preview

The Sudden Surge of Use of Big Data

Good Essays
Open Document
Open Document
904 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
The Sudden Surge of Use of Big Data
1

First two lectures about big data. So why this surge?
Isn’t about data at all
Systems being able to process data
Tools exploit and derive valuable nuggets
NOT NEW: Walmart Wallstreet decades
Why not out? Competition
Teradata 20 years, contest MR patent
Digital exhaust incr
Know your Stats, gaga tweet sentiment analysis
Correlation causation

2

Old: Centralized systems (data came from humans)
Sun hardware
Oracle software
Moore’s law -> data grew. Data exhausts growing larger. Can’t keep up.
RDBMS is not going away. Predictable queries on tabular data
Unstructured data doesn’t fit in tables nicely. Complicated data.
Even if it did
Sentiment analysis of the natural language captured in all those tweets
Mapreduce Joke!
Data processing and not transaction processing
Complex data at volume
Terabytes hard not to get
Actively throw data away data due to lack of storage

3

DJ Patil is Data Sci at in residence at Greylock
Made a quote based on his experience of being a data sci and manager @ LinkedIn poor use of scarce labour very few people who can do the interesting parts of this job spending 80% of time cleaning boring bits

4

Survey by Ventana Research
Determine how important were these metrics
Evaluating their large-scale data tech projects

5

Economically finish
Moving from storage to compute costly, pressure N/W infrastructure
Move code to data
Archiving
ONCE Data old, send to tapes or disks due to economics of storage
Many sci argue that data stored, never looked back, Costly to retrieve from archives
Justify economics of storing ROI or return on bytes
Asking questions not from ETL tools but raw data
Data loss due to aggregation ETL
You want to ask questions, build schema for that? With such scale of data?
No schema! Data not transformed
Copied
Not pulling data but pushing work to store

Hadoop by Apache
Open source
Used to organize huge amount of unstructured data
Used in

You May Also Find These Documents Helpful

  • Good Essays

    "By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know- how to use the analysis of bigdata to make effective decisions"…

    • 496 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Rlht2 Task 3

    • 1508 Words
    • 7 Pages

    Datanal, Inc., was established by five IT entrepreneur colleagues in 2002. It enjoys a reputation for outstanding performance and presently employs some 350 IT specialists, most with proven skill in analyzing, organizing, and managing large, diversified streams of data and databases in logical, systematic form, transparently and effectively bridging present artificial separations. By enabling customers to assimilate a consistently large influx of new data while simultaneously drawing from previously unrealized complementary database…

    • 1508 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    Final project

    • 2132 Words
    • 9 Pages

    over the past few years. The huge data that needs to be stored, managed and…

    • 2132 Words
    • 9 Pages
    Powerful Essays
  • Good Essays

    Not including alphabetic characters in a Social Security Number field is an example of _____.…

    • 2865 Words
    • 12 Pages
    Good Essays
  • Powerful Essays

    away from her family, and she still had packing to do at home. Just a few more items to go:…

    • 6279 Words
    • 26 Pages
    Powerful Essays
  • Best Essays

    Big data is the latest buzzword in the tech industry, but what exactly makes it different from traditional BI or data analysis? According to MIT Sloan Management Review, big data is described as “data that is either too voluminous or too unstructured to be managed and analyzed through traditional means” (Davenport, Thomas, Barth, & Bean, 2012). Big data is unlike conventional mathematical intelligence, where a simple sum of…

    • 2200 Words
    • 9 Pages
    Best Essays
  • Better Essays

    Social media and networks allow for individuals to easily find other individuals who share similar beliefs, whether these beliefs are about politics, social issues, sports, or popular culture. In his book “The Internet of Us: Knowing More and Understanding Less in the Age of Big Data”, professor Michael Patrick Lynch examines the impact that the Digital Age and the Internet have had on our ability to acquire knowledge. When examining the argument that the Internet is allowing for “group polarization” to flourish, Lynch quotes Cass Sunstein, a legal scholar. Sunstein has argued that “’repeated exposure to an extreme position, with the suggestion that many people hold that position, will predictably move those exposed, and likely predisposed,…

    • 1139 Words
    • 5 Pages
    Better Essays
  • Good Essays

    Diamond in the Data Mine

    • 877 Words
    • 4 Pages

    The approach that Loveman used was highly effective outlining the importance of providing an exceptional customer service in today’s service industry through deep data mining.…

    • 877 Words
    • 4 Pages
    Good Essays
  • Better Essays

    It Trends

    • 15239 Words
    • 61 Pages

    This report is based on evidence from inspections of information and communication technology (ICT) between September 2005 and July 2008 in 177 maintained schools in England, as well as other visits to schools where good practice was identified.…

    • 15239 Words
    • 61 Pages
    Better Essays
  • Powerful Essays

    Nokia has been in business for more than 150 years, starting with the production of paper in the 1800s and evolving into a leader in mobile and location services that connects more than 1.3 billion people today. Nokia has always transformed resources into useful products – from rubber and paper, to electronics and mobile devices – and today’s resource is data.…

    • 1403 Words
    • 6 Pages
    Powerful Essays
  • Powerful Essays

    Why You Need a Data Warehouse Joseph Guerra, SVP, CTO & Chief Architect David Andrews, Founder Introduction Chances are that you have heard of data warehousing but are a little fuzzy on exactly how it works and whether your organization needs it. It is also highly likely that once you fully understand exactly what a data warehouse can do, you will decide that one is needed. 700 West Johnson Avenue Cheshire, CT 06410 800.775.4261 www.rapiddecision.net © Copyright RapidDecision 2013 All trademarks are the property of their respective owners. Data warehouses are widely used within the largest and most complex businesses in the world.…

    • 3611 Words
    • 20 Pages
    Powerful Essays
  • Satisfactory Essays

    Your company just hired a new CEO, and you figure that a reorganization – maybe even a few terminations – could be on the…

    • 70809 Words
    • 343 Pages
    Satisfactory Essays
  • Good Essays

    This content was originally published on the CIO Update, IT Business Edge and Enterprise Apps…

    • 4919 Words
    • 25 Pages
    Good Essays
  • Satisfactory Essays

    Based on a research carried out by a famous professor last month, Internet causes a lot of effect on human beings. The Internet is a global system of interconnected computer networks hat use the standard Internet protocol suite to serve billions of users worldwide. The explosion of Internet will increase the crime rate, cause low performance in academic and causes chronic health problems.…

    • 409 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    At TSSM we cover VCE IT Applications in more detail than any other education provider in Victoria.…

    • 952 Words
    • 4 Pages
    Satisfactory Essays

Related Topics