Preview

High Performance Computing: Linux Cluster

Powerful Essays
Open Document
Open Document
4104 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
High Performance Computing: Linux Cluster
HIGH PERFORMANCE COMPUTING: DESIGN, BUILD AND BENCHMARK A LINUX CLUSTER by GOH YOKE LOONG KEM 050024 Semester 1 Session 2008/09

Final year research project report submitted to the Faculty of Engineering University of Malaya in partial fulfillment of the requirement to be Bachelor’s degree of Engineering

DEPARTMENT OF MECHANICAL ENGINEERING ENGINEERING FACULTY UNIVERSITY OF MALAYA October 2008

ABSTRACT Nowadays almost every industry needs fast processing power especially in engineering field. Manipulating high-resolution interactive graphics in engineering, such as in aircraft engine design, has always been a challenge in terms of performance and scalability because of the sheer volume of data involved.
[1]

Linux clustering is

popular in many industries these days. With the advent of clustering technology and the growing acceptance of open source software, supercomputers can now be created for cheaper cost of traditional high-performance machines. Due to these conditions compromise, the number, variety, and specialized configurations of these machines are increasing dramatically with 32 – 128 node clusters being commonplace in science labs.
[2]

As a result, the intent of this research is to use the latest open source

software and computers available in the computer laboratory of Mechanical Engineering Department, University of Malaya to design and build a High Performance Linux Cluster. This paper will present the clustering fundamentals and details of how to setup the cluster. High performance cluster is mostly parallel programming. This paper shows how to run parallel programming with Message Passing Interface (MPI). High-quality implementations for MPI are freely available for FORTRAN, C, and C++ for Linux. In this research, MPICH 2 is used as MPI implementation. Extensive research will be carried out by benchmarking the performance of the cluster with the standard test codes. The results gained will use to compare with the existing

You May Also Find These Documents Helpful

  • Powerful Essays

    [4] Storage Conference. The Hadoop Distributed File System http://storageconference.org/ 2010/ Papers/ MSST/Shvachko.pdf [5] A Tutorial on Clustering Algorithms. K-Means Clustering http://home.dei.polimi.it/matteucc/ Clustering/ tutorial_html/kmeans.html [6] International Journal of Computer Science Issues. Setting up of an Open Source based Private Cloud http://ijcsi.org/papers/IJCSI-8-3-1-354-359.pdf [7] Eucalyptus. Modifying a prepackaged image http://open.eucalyptus.com/participate/wiki/modifyi ng-prepackaged-image [8] Michael G. Noll. Running Hadoop On Ubuntu Linux (Single-Node Cluster) http://www.michaelnoll.com/tutorials/running-hadoop-on-ubuntu-linuxsingle-node-cluster/ [9] 8K Miles Cloud Solutions. Hadoop: CDH3 – Cluster (Fully-Distributed) Setup http://cloudblog.8kmiles.com/2011/12/08/hadoopcdh3-cluster-fully-distributed-setup/ [10] Apache Mahout. Creating Vectors from Text https://cwiki.apache.org/MAHOUT/creatingvectors-from-text.html…

    • 3006 Words
    • 13 Pages
    Powerful Essays
  • Satisfactory Essays

    “This article describes programmability studies undertaken by Sun Microsystems in its HPCS participation.” FORTRAN has been the preferred language among the High Performance Computing (HPC) community, it is less likely that every programmer would have ended up preferring FORTRAN and more likely that the problems with the original source codes have more to do with reasons than the limitations of existing programming languages. The need for HPC benchmarks in the new languages could reduce source-code volume substantially, but rewriting these benchmarks even in FORTRAN could achieve similar source-code reductions and corresponding improvements in expressivity. This article mentions that the programmer rewrote many of HPC benchmarks and applications using modern FORTRAN taking into account the human costs of software development: programmability and associated characteristics such as readability, verifiability, and maintainability. “These are important considerations; although copy-and-paste is a fast way of writing lines of code, it degrades readability and increases maintenance costs.”…

    • 630 Words
    • 2 Pages
    Satisfactory Essays
  • Best Essays

    IBM SUPERCOMPUTER, WATSON

    • 2209 Words
    • 9 Pages

    The ability to coordinate all of these processors into one functioning logarithmic unit required a group of engineers from IBM to develop a specialized kernel-based virtual machine implementation with the ability to process eighty Tera-flops per seconds . The software that allowed all of this to occur is called Apache Hadoop. Hadoop is an open source framework software that is used to organize and manage grid computing environments. Since the theoretical limit of processors with current technology is set at a central processing unit (CPU) clock speed of three giga-hertz, a software model to enhance parallel processing for supercomputers had to be developed. With the use of Hadoop the programmers at IBM were able to more easily write applications for Watson that benefitted and took advantage of parallel processing to increase the speed at which problems could be solved and questions could be answered. The main reason why this makes things faster is the fact that one question can be researched in multiple paths at one time using parallel processing paths…

    • 2209 Words
    • 9 Pages
    Best Essays
  • Satisfactory Essays

    Guide to UNIX Using Linux Fourth Edition Chapter 4 SolutionsAnswers to the Chapter 4 Review Questions…

    • 1020 Words
    • 5 Pages
    Satisfactory Essays
  • Powerful Essays

    Gm Bankruptcy

    • 11457 Words
    • 46 Pages

    This project work is partial fulfillment of the requirement for the degree of B.Com (H)from Amity University Lucknow Campus, Uttar Pradesh.…

    • 11457 Words
    • 46 Pages
    Powerful Essays
  • Best Essays

    Open Flow

    • 2946 Words
    • 12 Pages

    [6] T. Benson, A. Anand, A. Akella, and M. Zhang. Understanding Datacenter Traffic Characteristics. SIGCOMM WREN workshop, 2009.…

    • 2946 Words
    • 12 Pages
    Best Essays
  • Satisfactory Essays

    I think that our society should keep stirrings because we need to have feelings.It is also a gift from God. If we didn’t have it we wouldn’t have crushes.If we never had crushes then we would never have stirrings, or we would never get married, and we could never have sex.If we never had that it would be bad and we couldn't have or own kids.If we couldn’t have our own kids then we wouldn't value them or love them as much as we would if they were our own biological kid.We would also have to have someone else’s kid not our own.So in my opinion I think that we should have stirrings…

    • 113 Words
    • 1 Page
    Satisfactory Essays
  • Satisfactory Essays

    To keep up with the growing demand for new software and product solutions it is imperative that IBM embrace and support open solution software products like Linux. As the saying goes, “If you can’t beat them, join them.” Linux is growing by leaps and bound and is quickly surpassing the software maker like Microsoft who at this point does not specialize in open source software. If they do not adapt to the new way that software is being used they will be just another software company that has been left behind.…

    • 347 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    Generic Benchmarking

    • 6770 Words
    • 28 Pages

    M-Core is on the brink of creating one of the most innovative technological advances of the 21st century. The super processing chip will enhance the planet with the ability to have faster computers and decrease the energy used to make computers function. On a worldwide scale, this advancement is remarkable.…

    • 6770 Words
    • 28 Pages
    Powerful Essays
  • Powerful Essays

    Yahoo, an American internet corporation currently utilizes a global cloud computing infrastructure that relies heavily on a technology called Apache Hadoop. Yahoo’s ability to crunch unimaginable amounts of data for the purpose of creating increasingly relevant experiences for its users is based on this Apache Hadoop technology. Apache Hadoop works with the cloud to process and analyze all of the data Yahoo collects efficiently. Because Yahoo places a high importance on its ability to customize results and stories for users, Hadoop is crucial, specifically by enabling previously unattainable feats of efficiency and speed. One area of IT infrastructure that partners really well with the capabilities of cloud computing that Yahoo uses is green computing, utilizing green server farms. I’m going to recommend that Yahoo increase its utilization of these new green server farms to maximize operational efficiency.…

    • 1177 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    WILEY SERIES ON PARALLEL AND DISTRIBUTED COMPUTING SERIES EDITOR: Albert Y. Zomaya Parallel & Distributed Simulation Systems / Richard Fujimoto Surviving the Design of Microprocessor and Multimicroprocessor Systems: Lessons Learned / Veljko Milutinovic Mobile Processing in Distributed and Open Environments / Peter Sapaty Introduction to Parallel Algorithms / C. Xavier and S.S. Iyengar Solutions to Parallel and Distributed Computing Problems: Lessons from Biological Sciences / Albert Y. Zomaya, Fikret Ercal, and Stephan Olariu (Editors) New Parallel Algorithms for Direct Solution of Linear Equations / C. Siva Ram Murthy, K.N. Balasubramanya Murthy, and Srinivas Aluru Practical PRAM Programming / Joerg Keller, Christoph Kessler, and Jesper Larsson Traeff Computational Collective Intelligence / Tadeusz M. Szuba Parallel & Distributed Computing: A Survey of Models, Paradigms, and Approaches / Claudia Leopold Fundamentals of Distributed Object Systems: A CORBA Perspective / Zahir Tari and Omran Bukhres Pipelined Processor Farms: Structured Design for Embedded Parallel Systems / Martin Fleury and Andrew Downton Handbook of Wireless Networks and Mobile Computing / Ivan Stojmenoviic (Editor) Internet-Based Workflow Management: Toward a Semantic Web / Dan C. Marinescu Parallel Computing on Heterogeneous Networks / Alexey L. Lastovetsky Tools and Environments for Parallel and Distributed Computing Tools / Salim Hariri and Manish Parashar Distributed Computing: Fundamentals, Simulations and Advanced Topics, Second Edition / Hagit Attiya and Jennifer Welch Smart Environments: Technology, Protocols and Applications / Diane J. Cook and Sajal K. Das (Editors) Fundamentals of Computer Organization…

    • 97856 Words
    • 392 Pages
    Good Essays
  • Powerful Essays

    How to get free access

    • 771 Words
    • 4 Pages

    It is based on a hierarchical design targeted at federations of clusters. It relies on a multicast-based listen/announce protocol to monitor state within clusters and uses a tree of point-to-point connections amongst representative cluster nodes to federate clusters and aggregate their state. It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization. It uses carefully engineered data structures and algorithms to achieve very low per-node overheads and high concurrency. The implementation is robust, has been ported to an extensive set of operating systems and processor architectures, and is currently in use on over 500 clusters around the world. It has been used to link clusters across university campuses and around the world and can scale to handle clusters with 2000 nodes.[2]…

    • 771 Words
    • 4 Pages
    Powerful Essays
  • Good Essays

    Virtual Computing

    • 573 Words
    • 2 Pages

    In the world of virtualization there are two kingpins; VMware and Microsoft. As I researched these two vendors it has become apparent that comparing these two giants would be like comparing apples and oranges; like apples and oranges are fruit, and VMware and Microsoft make virtualization software, they are different altogether. This happens to be the scenario that I have seen, not only between the two companies, but in blog posts and forums.…

    • 573 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Types of Computers

    • 418 Words
    • 2 Pages

    The types of computers range from the Hybrid to the Analog types. The computers you come across in the daily course of your day range from laptops, palmtops and towers, to desktop computers, to name a few. But the very word “computers” reminds one of the desktop computers used in offices or homes. Different categories of computes have been devised in keeping with our varied needs.…

    • 418 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Recent, revolutionary integrated circuit technology improvement coupled with computer architecture innovations have resulted to a speed of CPU performance development unprecedented since the middle of the 20th century. Computer architecture innovations has continuously improve Microprocessor-based machines at a speed range of 1.5 and 2 times per year for the last seven years. Improvement speed for minicomputers or mainframes computers are approximately 25% every year. Seemingly, recent…

    • 399 Words
    • 2 Pages
    Satisfactory Essays