Preview

Battle Between Hadoop And Data Warehouse Case Study

Powerful Essays
Open Document
Open Document
1206 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Battle Between Hadoop And Data Warehouse Case Study
Battle between Hadoop and Data Warehouse

#1 - Introduction

Once or twice every decade, the IT marketplace experiences a major innovation that shakes the entire IT industry. In recent years, Apache Hadoop has done the same thing by infusing data centres with new infrastructure
By giving the power of parallel processing to the programmer Hadoop is on such an exponential rise in adoption and its ecosystem is expanding in both depth and breadth, it is natural to ask whether Hadoop’s is going to replace traditional Data Warehouse.

Let’s see what Alasdair Anderson (Executive Vice President at Nordea ) said at a Hadoop Summit about this hot topic in the town.

"There's no relationship between the EDW and Hadoop right now — they are going to be complementary. It's NOT about rip and replaces:
…show more content…
A data warehouse is a relational database that is designed for query and analysis data. It usually contains historical data derived from transaction data, but it can include data from other sources as well. It separates analysis workload from transaction workload and enables an organization to consolidate data from several sources.
In addition to a relational database, a data warehouse environment includes an extraction, transportation, transformation, and loading (ETL) solution, an online analytical processing (OLAP) engine, client analysis tools, and other applications that manage the process of gathering data and delivering it to business users.
Let’s summarize what data warehouse is -
1. Subject-oriented
A data warehouse can be used to analyse a particular subject area like sales, finance, and inventory. Each subject area contains detailed data.
2. Integrated
A data warehouse integrates data from multiple data sources. For example, dates are in the same format, male/female codes are consistent. In a data warehouse, there will be only a single way of identifying a product and they use the same customer record, not copies
3.

You May Also Find These Documents Helpful

  • Powerful Essays

    [4] Storage Conference. The Hadoop Distributed File System http://storageconference.org/ 2010/ Papers/ MSST/Shvachko.pdf [5] A Tutorial on Clustering Algorithms. K-Means Clustering http://home.dei.polimi.it/matteucc/ Clustering/ tutorial_html/kmeans.html [6] International Journal of Computer Science Issues. Setting up of an Open Source based Private Cloud http://ijcsi.org/papers/IJCSI-8-3-1-354-359.pdf [7] Eucalyptus. Modifying a prepackaged image http://open.eucalyptus.com/participate/wiki/modifyi ng-prepackaged-image [8] Michael G. Noll. Running Hadoop On Ubuntu Linux (Single-Node Cluster) http://www.michaelnoll.com/tutorials/running-hadoop-on-ubuntu-linuxsingle-node-cluster/ [9] 8K Miles Cloud Solutions. Hadoop: CDH3 – Cluster (Fully-Distributed) Setup http://cloudblog.8kmiles.com/2011/12/08/hadoopcdh3-cluster-fully-distributed-setup/ [10] Apache Mahout. Creating Vectors from Text https://cwiki.apache.org/MAHOUT/creatingvectors-from-text.html…

    • 3006 Words
    • 13 Pages
    Powerful Essays
  • Powerful Essays

    MIS 563 COURSE PROJECT

    • 2795 Words
    • 12 Pages

    This finding has ABC University seeking a streamlined way to manage their data and for users to access the data that is clean. With this, the University has proposed the creation and implementation of a data warehouse to house all the data from each one of these operational databases into one central location where all students, staff and faculty can access the data using a self service tool such as a report or a data connection to Microsoft Excel to pull data into pivot tables.…

    • 2795 Words
    • 12 Pages
    Powerful Essays
  • Satisfactory Essays

    UNIT 8 Manifest Destiny

    • 578 Words
    • 4 Pages

    Target 4 – Analyze the factors that led to Texas becoming independent from Mexico and how and why the Texas issue became so controversial in the United States.…

    • 578 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    Cis 515week 3

    • 1024 Words
    • 4 Pages

    Barron, K. (2013, May 13). Columbus state university implements data warehouse and business intelligence environment to improve student retention and graduation rates. Retrieved from http://www.oracle.com/us/corporate/press/1946491…

    • 1024 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Answer: A data warehouse contain pool of data both current and of the past/historical which in turn are used to support decision making by the managers., Without it, GeoStor would lack the variety of data it needs to be able to perform different tasks for different functions.…

    • 617 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    True False

    • 378 Words
    • 2 Pages

    5. Data mining uses business intelligence tools and techniques on a variety of data sources brought together in a data warehouse.…

    • 378 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Research questions in the three articles were presented by the authors. The first article questioned the reported practice of transformational leadership behavior being high or low depending on the support of higher levels of transformational leadership in those organizations. Which differs from another article on transformational leadership by Emery and Barker(2007) in that it emphasizes transformational leadership 's goals are to align the goals of the workers, who have direct contact with customers, to management. As these values align with management, greater…

    • 2522 Words
    • 11 Pages
    Powerful Essays
  • Good Essays

    | * The data warehouse of St George bank supports the integrated data among different departments * Data from different departments can be accessed freely * Integrated data from the data warehouse is more beneficial and creates more opportunities and BI for all departments (1+1=3) * “Most departments extract what they need from the warehouse using customer relationship management and BI applications without intervention.” * “They have access to all the data, can create their own filters, their own campaigns.”…

    • 341 Words
    • 1 Page
    Good Essays
  • Good Essays

    If I were to design Ben & Jerry's data warehouse I would use several dimensions of information. The first dimension would consist of the company's products; ice cream, frozen yogurt or merchandise. The marketing department has to know which products are selling, if Ben & Jerry's didn't know that their T-shirts are selling out as soon as they hit the stores, then they wouldn't be able to take advantage of the opportunity to sell the shirts. The second dimension would consist of the different areas of sales; US, Canada, Mexico, or Europe. I am not sure if they sell their ice cream in Mexico, but with data collection they can find out if their ice cream would be a better seller in the hot climate, rather than pushing for greater distribution in Canada. The third dimension would consist of the "specifics"; where the sale was made, when the sale was made, and who purchased the product. This information can help in the design of the product to focus on the buyer; it can tailor flavors to seasons, and packaging to buyer who looks for the better-looking product. If Ben & Jerry's could know when a season was coming to an end in a specific area, then they could forecast the need or the decline in need and speed up, or slow down distribution to those areas. The focus of the information is that it needs to be useful, and almost any information is useful.…

    • 605 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Bis Midterm Sheet

    • 1467 Words
    • 6 Pages

    A data warehouse is to extract and clean data from operational systems and other sources to store and catalog that data for processing by BI tools. Data warehouses can include external data purchased from outside sources. Meta data is kept in the data warehouse. Physically, a data warehouse consists of a few fast computers with very large storage devices.…

    • 1467 Words
    • 6 Pages
    Good Essays
  • Powerful Essays

    Du Preez, D. (2012a). Big data: hands on or hands off? 21 Feb 2012. Computing Feature, (n.d.). Retrieved from http://www.computing.co.uk/ctg/feature/2153789/-hands-hands/page/1…

    • 1730 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    Sysco

    • 906 Words
    • 4 Pages

    1. Utilization: Massive amount of data in the data warehouse are stored, but cannot be analyzed.…

    • 906 Words
    • 4 Pages
    Powerful Essays
  • Best Essays

    Raden, N. (2012). Big data analytics architecture: Putting all your eggs in three baskets. Hired…

    • 2200 Words
    • 9 Pages
    Best Essays
  • Powerful Essays

    Data Warehousing Failures

    • 5385 Words
    • 22 Pages

    At first, the planner for the data warehouse wanted to use a dimensional model for tabular information. But political pressure forced the system’s early use. Consequently, mainframe data was largely replicated and these tables did not work well with the managed query environment tools that were acquired. The number of tables and joins, and subsequent catalog growth, prevented Auto Guys from using data as it was intended in a concise and coherent business format.…

    • 5385 Words
    • 22 Pages
    Powerful Essays
  • Best Essays

    Data Warehousing and Olap

    • 2507 Words
    • 11 Pages

    A data warehouse is a “subject-oriented, integrated, time varying, non-volatile collection of data that is used primarily in organizational…

    • 2507 Words
    • 11 Pages
    Best Essays