Preview

Bigdata

Better Essays
Open Document
Open Document
3484 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Bigdata
Addressing the Challenge of Big Data & MDM in the Large Enterprise

Presented by:

Manish Sood, Founder & CEO, Reltio, Inc. manish@reltio.com October, 2012

Image: "Data Deluge," Brett Ryder, The Economist, Feb. 2010

Agenda 1. What is Big Data? 2. What is NoSQL vs. Relational DBs? 3. What is Hadoop (HDFS and MapReduce)? 4. MDM and Big Data – a Case Study

Confidential and Proprietary – please do not distribute without prior permission

2

Trend – Growing data sets
DATA VOLUME
Zettabyte

1.4 Zettabytes in Enterprise Data

2011

Machine To Machine

Exabyte

Petabyte

Interactions
Terabyte

Transactions
Mainframe PC Internet Mobile Machine

Time

Zettabyte = 1,000,000,000,000,000,000,000 Bytes Graph based on IDC and UC Berkeley Data Growth Estimates, Source: IDC & CosmoBC.com: http://techblog.cosmobc.com/2011/08/26/data‐storage‐ infographic/

Confidential and Proprietary – please do not distribute without prior permission

3

Trend – Information Connectivity

Information Connectivity

Internet of Things

Semantic Web Tagging Social Networks Text Files RDBMS Hypertext Blogs RDF Folksonomies User generated content

Web 1.0

Web 2.0

Web 3.0

1990

2000

2010

2020

Confidential and Proprietary – please do not distribute without prior permission

4

Trend – Data Complexity
Text files and Lists Majority of Webpages

Relational Databases

Performance

Social Networks

Internet of Things

Custom work

Data Complexity
Confidential and Proprietary – please do not distribute without prior permission 5

Characteristics of Big Data Velocity
Volume Variety Value

$
10’s of Billions of Daily Records From Terabytes to Petabytes Multi‐ Structured Business Insights

Big data is where the data volume, acquisition velocity, or data representation limits the ability to perform effective analysis using traditional relational approaches or requires the use of significant



Links: Inderpal Bhandari, VP & Chief Data Officer, Express Scripts October, 2012

You May Also Find These Documents Helpful

  • Satisfactory Essays

    All computers today have GB or TB. I was just in Wal-Mart today and saw a removable hard-drive with a 2TB capacity for only $150, and there was even a 3TB hard-drive. When I went over to staples and looked at all the computers I didn’t see any fewer than 750 GB of ROM and 4 GB of RAM. With technology in the world today expanding so quickly it is not farfetched to see hard-drives with 100 TB capacities in the near future. If you went by Moore’s Law, which I know is for transistors but I think goes along with many other things, I…

    • 420 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    References: Brown, B., Chiu, M., Manyika, J. (2011), Are you ready for the era of big data? Retrieved…

    • 1755 Words
    • 6 Pages
    Powerful Essays
  • Powerful Essays

    As you recall, data is a collection of facts (numbers, text, even audio and video files) that is processed into usable information. Much like a spreadsheet, a database is a collection of such facts that you can then slice and dice in various ways to extract information or make decisions. However, the advantage and primary use of a database over a spreadsheet is its ability to handle a large volume of data and yet allow for quick access to the information that is desired.…

    • 1190 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    Week 6 Discussion 2

    • 582 Words
    • 3 Pages

    Any organization wishing to maintain a competitive advantage can benefit from big data management and analytical tools. When properly utilized, big data can increase efficiency, productivity, and predict future market conditions (Laudon, p. 231). As processors become faster and more affordable, big data management will become a necessary component of all organizations. The actual benefit from big data will lie in the ability to analyze and apply the vast amounts of information that are flooding databases at all times.…

    • 582 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Big data needn 't be a big headache: How to tackle mind-blowing amounts of information.…

    • 1730 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    You probably heard the term Big Data -- it is one of the most hyped terms now. But what exactly is big data?…

    • 3076 Words
    • 13 Pages
    Powerful Essays
  • Powerful Essays

    Data

    • 1644 Words
    • 7 Pages

    The purpose of the report is to assist Aircraft Solutions (AS) in indentifying the most significant Information Technology (IT) security vulnerabilities. AS products and services are at the forefront of the industry and the protection of such is very important as they are an industry leader. The vulnerabilities that will be discussed are the firewall configuration, virtualization of their hardware assets and defining security policy regarding the timeliness of firewall configuration and updates.…

    • 1644 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Demchenko, Zhao, Grosso, Wibisono, & Laat (2012), have described the five primary characteristics of health care big data as five V’s: Volume, Velocity, Variety, Veracity, and Value. Volume refers to vast amounts of health-related data created and accumulated continuously. In 2011 alone, the U.S. healthcare system has reached 150 exabytes, and soon will reach the zettabyte (1021 gigabytes) scale and, not long after, the yottabyte (1024 gigabytes) (Raghupathi & Raghupathi, 2014). Velocity applies to the constant flow of new data accumulating at unprecedented rate, variety pertains to the level of complexity of the data, veracity measures includes questions of trust and uncertainty with regards to data and the outcome of analysis of that data, and value evaluate show how good the quality of the data is in reference to the intended results. (Herland, Khoshgoftaar, & Wald,…

    • 648 Words
    • 3 Pages
    Good Essays
  • Satisfactory Essays

    With that in mind, organizations should always cease to ensure that their data is eagerly managed. With the market changing, the process of data management is becoming more complex and the capacity of data to be managed is steadily increasing, this is sometimes referred to as “big data”. Big data is used in understanding organizations and their decision making process; when decisions are made, they are based on complex data transactions which have become difficult to the system that are using basic database and warehouse management systems (Vael, 2013). This causes many data management difficulties such as an increase in data, immature decision making, legal issues and data securing and integrity to name a few, but they can easily be reduced or resolved by the use of the following:…

    • 707 Words
    • 3 Pages
    Satisfactory Essays
  • Best Essays

    In this fast paced information age, there are many different sources on corporate networks and internet is collecting massive amounts of data, but there is a significant difference in this data compared to the conventional data, much of this data is semi-structured or unstructured and not residing in conventional databases. “Big data” is essentially a huge data set that scales to multiple petabytes of capacity; it can be created, collected, collaborated, and stored in real-time or any other way. However, the challenge with big data is that it is not easily handled using traditional database management tools. It typically consists of unstructured data, which includes text, audio and video files, photographs and other data (Kovar, 2012). The aim of this paper is to examine the concepts associated with the big data architecture, as well as how to handle, process, and effectively utilize big data internally and externally to obtain meaningful and actionable insights.…

    • 2200 Words
    • 9 Pages
    Best Essays
  • Better Essays

    Throughout most of the twenty first century, technology has boomed and many companies are now able to store large quantities of data in a small space, compared to previous years. Big Data is the process of collecting information based on structured data and unstructured data. Big Data is something that companies collect to try and provide the best customer experience, however this mass collection has its setbacks.…

    • 1115 Words
    • 5 Pages
    Better Essays
  • Better Essays

    Cloud Bi

    • 1361 Words
    • 6 Pages

    Now, data can be in vast amounts, of which some might be useful and some might not be useful.…

    • 1361 Words
    • 6 Pages
    Better Essays
  • Best Essays

    Data Warehousing and Olap

    • 2507 Words
    • 11 Pages

    A data warehouse is a “subject-oriented, integrated, time varying, non-volatile collection of data that is used primarily in organizational…

    • 2507 Words
    • 11 Pages
    Best Essays
  • Powerful Essays

    Open-Data

    • 10067 Words
    • 41 Pages

    1. HM Revenue & Customs (HMRC) are fully committed to the transparency agenda, and transparency is a key principle for the Department.…

    • 10067 Words
    • 41 Pages
    Powerful Essays
  • Good Essays

    Data in itself can be powerful, but also has many pitfalls if left to disparate databases and data collection routines. A collection of spreadsheets with account numbers entered into them can be view as a business liability. This same information in a database that can be queried, secured, organized and related to other data for analytical purposes becomes a power business tool. It takes “big data” and makes it business intelligence.…

    • 853 Words
    • 4 Pages
    Good Essays

Related Topics