Preview

Play for Rangmanch

Better Essays
Open Document
Open Document
2435 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Play for Rangmanch
Intro to CUDA Programming http://www.oit.duke.edu/scsc http://wiki.duke.edu/display/SCSC scsc@duke.edu hpc-support@duke.edu

John Pormann, Ph.D. jbp1@duke.edu

Overview
Basic Introduction Intro to the Operational Model Simple Example ! Memory Allocation and Transfer ! GPU-Function Launch Grids of Blocks of Threads GPU Programming Issues Performance Issues/Hints

CUDA and NVIDIA
CUDA is an NVIDIA product and only runs on NVIDIA GPUs ! AMD/ATI graphics chips will NOT run CUDA ! Older NVIDIA GPUs may not run CUDA either
! *Some* laptops may be capable of running CUDA
"

Not sure what this will do to battery life

! All current and future display drivers from NVIDIA will include

support for CUDA
"

You don’t need to download anything else to run a CUDA program

! To see if your GPU is CUDA-enabled, go to:
"

http://www.nvidia.com/object/cuda_learn_products.html

Why GPU programming?
Parallelism ! CPUs recently moved to dual- and quad-core chips ! The current G100 GPU has 240 cores Memory bandwidth ! CPU (DDR-400) memory can go 3.2GB/sec ! GPU memory system can go 141.7GB/sec Speed ! CPUs can reach 20GFLOPS (per core) ! GPUs can reach 933GFLOPS (single-precision or integer) ! ... 78GFLOPS (double-precision) Cost ... $400-1000

Yesterday’s Announcement
NVIDIA recently held their annual developer conference and released info on the next generation of GPUs ... “Fermi”
3B transistors, 40nm 512 compute elements 8x increase in DP performance (~700GFLOPS) GDDR5 memory (230GB/sec) ECC memory L1 and L2 Cache memory (“configurable”?)

Operational Model
CUDA assumes a heterogeneous architecture -- both CPUs and GPUs -- with separate memory pools
! CPUs are “masters” and GPUs are the “workers”
" " "

CPUs launch computations onto the GPU CPUs can be used for other computations as well GPUs have limited communication back to CPU

! CPU must initiate data transfers to the GPU memory
" "

Synchronous Xfer -- CPU waits for xfer to complete

You May Also Find These Documents Helpful

  • Satisfactory Essays

    CPU is the Central Processing Unit. This is the key component or even the “brain” of the computer. The CPU carries out all the instructions and every single instruction must go through the CPU prior to being performed. It executes arithmetic, logical and input/output operations.…

    • 204 Words
    • 1 Page
    Satisfactory Essays
  • Satisfactory Essays

    Bus 210 - Appendix E

    • 346 Words
    • 2 Pages

    |Microprocessors |This is describes as a “computer on a chip” It rivaled the mainframe all though it was|…

    • 346 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    flash memory, which is a form of EEPROM instead of chips. The size of the CMOS…

    • 281 Words
    • 2 Pages
    Satisfactory Essays
  • Better Essays

    It190 Final Project

    • 1561 Words
    • 6 Pages

    | Because Accounts do not need that much memory and the computer comes with 4GB and a free upgrade to 6GB…

    • 1561 Words
    • 6 Pages
    Better Essays
  • Powerful Essays

    Chapters 1-4

    • 1910 Words
    • 8 Pages

    9. What term does AMD use to describe the processor technology processor that uses a 64-bit instruction set with 32-bit internal core processing?…

    • 1910 Words
    • 8 Pages
    Powerful Essays
  • Satisfactory Essays

    X86 and Internet Explorer

    • 269 Words
    • 2 Pages

    The Intel H55 Chipset utilizes the 1st gen. Intel i core series CPUs integrated graphics capability.…

    • 269 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Opp

    • 422 Words
    • 2 Pages

    NVIDIA GeForce Go 7300: NVIDIA GeForce Go 7300 is a slightly enhanced GeForce Go 6200 with a performance a bit below average. Some not demanding current games can be played with low details fluently. Otherwise severe restrictions of the game settings must be tolerated. The performance is sufficient for office applications and surfing.…

    • 422 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Processor transistors have roughly double in counts every 18 to 24 months. Each new chip contained roughly two times as much size as its predecessor. Moore’s Law described the growth trend of processor transistor that has continued to this day, and it has become the basis for many industry performance forecasts. In terms of size, cost, density, and speed, the growth in the number of transistors used in integrated circuits is reasonable. Compared to the increase in growth over the last decades to now, the growth of the number of transistors used in integrated circuits doesn’t look surprisingly fast or slow at all. Accordingly to Moore’s Law, one can predict that somewhere between 2018 and 2020 100 billion (or even a trillion) transistors may fit on a single chip. Although this trend has continued for…

    • 250 Words
    • 1 Page
    Satisfactory Essays
  • Satisfactory Essays

    Intel case

    • 867 Words
    • 4 Pages

    In 1970, dynamic random access memory (DRAM) had many competitive advantages over magnetic core memory. DRAM was not new technology, but no one had figured out how to mass produce it at a low cost. Intel had three different process technologies it was working on to develop DRAM but needed to determine which would give them the best competitive advantage. Multichip assembly was one approach but it was difficult to mass produce while the three-prong circuit version was too simple and could easily be copied by competitors. By creating the metal-oxide-semiconductor (MOS) transistor, Intel had a smaller product that was complex enough to prevent duplication, increase performance, and was more cost effective to build. Another advantage of the MOS technology was that it consumed lower power allowing it to reduce the risk of substitution. The MOS manufacturing process was prone to impurities, but as the process improved and yield rates rose, the MOS semiconductor grew to dominate the IC market. With Intel’s creation of the first one-kilobit DRAM, the 1103 chip, they were able to maintain dominance in the market place for two years.…

    • 867 Words
    • 4 Pages
    Satisfactory Essays
  • Good Essays

    Supercomputers that are capable of 1 trillion calculations per second, which will allow advances such as simulations of the human body for testing new drugs and computers that respond easily to spoken commands.…

    • 784 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Bibliography: NECG 2001, GPU GasNet - Asset, equity and debt beta, viewed 30 October 2010…

    • 3525 Words
    • 15 Pages
    Powerful Essays
  • Satisfactory Essays

    Gpu

    • 314 Words
    • 2 Pages

    Now video cards built with a programmable architecture. Now we can alter the way vertices and triangles are processed.…

    • 314 Words
    • 2 Pages
    Satisfactory Essays
  • Better Essays

    Example Of Persuasive Memo

    • 1220 Words
    • 5 Pages

    • It must have at least 256mb of memory to efficiently run IE® and Microsoft Office® applications simultaneously…

    • 1220 Words
    • 5 Pages
    Better Essays
  • Satisfactory Essays

    Powerful central processing units (CPUs) demand large amounts of data, and they need that data fast. Otherwise, CPU performance is hindered by the lack of data storage support; therefore, computers have been designed to supply data through tiering memory and caches, which use the hard drive in combination with random access memory (RAM) to deliver massive amounts of data to the CPU. The speed of the RAM is crucial to the performance of data delivery; you want the most RAM at the fastest speed possible so the CPU is not waiting for information.…

    • 396 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    History of Intel

    • 842 Words
    • 4 Pages

    Until 1979, Intel‘s strategy appeared to work well. Across four generations of DRAMS, Intel succeeded in introducing devices and process technologies that were ahead of the competition and in commanding significant price premiums. But then Japanese competitors began to introduce new products more rapidly:…

    • 842 Words
    • 4 Pages
    Good Essays