Scientific Computing Goals Past progress Future. Goals Numerical algorithms & computational strategies Solve specific set of problems associated with.

Slides:



Advertisements
Similar presentations
The first-generation Cell Broadband Engine (BE) processor is a multi-core chip comprised of a 64-bit Power Architecture processor core and eight synergistic.
Advertisements

GPU Programming using BU Shared Computing Cluster
The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
COMPUTER GRAPHICS CS 482 – FALL 2014 NOVEMBER 10, 2014 GRAPHICS HARDWARE GRAPHICS PROCESSING UNITS PARALLELISM.
Prepared 5/24/2011 by T. O’Neil for 3460:677, Fall 2011, The University of Akron.
GRAPHICS AND COMPUTING GPUS Jehan-François Pâris
Development of a track trigger based on parallel architectures Felice Pantaleo PH-CMG-CO (University of Hamburg) Felice Pantaleo PH-CMG-CO (University.
IMGD 4000: Computer Graphics in Games Emmanuel Agu.
OpenFOAM on a GPU-based Heterogeneous Cluster
A many-core GPU architecture.. Price, performance, and evolution.
Parallel Programming Henri Bal Rob van Nieuwpoort Vrije Universiteit Amsterdam Faculty of Sciences.
Interactive Deformation and Visualization of Level-Set Surfaces Using Graphics Hardware Aaron Lefohn Joe Kniss Charles Hansen Ross Whitaker Aaron Lefohn.
L13: Review for Midterm. Administrative Project proposals due Friday at 5PM (hard deadline) No makeup class Friday! March 23, Guest Lecture Austin Robison,
Reconfigurable Application Specific Computers RASCs Advanced Architectures with Multiple Processors and Field Programmable Gate Arrays FPGAs Computational.
1 ITCS 6/8010 CUDA Programming, UNC-Charlotte, B. Wilkinson, Jan 19, 2011 Emergence of GPU systems and clusters for general purpose High Performance Computing.
3D Graphics Processor Architecture Victor Moya. PhD Project Research on architecture improvements for future Graphic Processor Units (GPUs). Research.
Interactive, GPU-Based Level Sets for 3D Segmentation Aaron Lefohn Joshua Cates Ross Whitaker University of Utah Aaron Lefohn Joshua Cates Ross Whitaker.
IN4151 Introduction 3D graphics 1 Introduction to 3D computer graphics part 2 Viewing pipeline Multi-processor implementation GPU architecture GPU algorithms.
Computer Graphics & Scientific Computing
Scientific Visualization Data and Geometry Processing for Scientific Visualization CS 5630 / 6630 August 28, 2007.
Algorithms in a Multiprocessor Environment Kevin Frandsen ENCM 515.
University of Michigan Electrical Engineering and Computer Science Amir Hormati, Mehrzad Samadi, Mark Woh, Trevor Mudge, and Scott Mahlke Sponge: Portable.
Fluid Simulation using CUDA Thomas Wambold CS680: GPU Program Optimization August 31, 2011.
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
CSE 690: GPGPU Lecture 11: Projects, Papers Klaus Mueller Computer Science, Stony Brook University.
Interactive Visualization of Volumetric Data on Consumer PC Hardware: Introduction Daniel Weiskopf Graphics Hardware Trends Faster development than Moore’s.
CSE 690 General-Purpose Computation on Graphics Hardware (GPGPU) Courtesy David Luebke, University of Virginia.
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
Chapter 2 Computer Clusters Lecture 2.3 GPU Clusters for Massive Paralelism.
Computationally Efficient Histopathological Image Analysis: Use of GPUs for Classification of Stromal Development Olcay Sertel 1,2, Antonio Ruiz 3, Umit.
Computer Graphics Graphics Hardware
Roger Jones, Lancaster University1 Experiment Requirements from Evolving Architectures RWL Jones, Lancaster University Ambleside 26 August 2010.
General Purpose Computing on Graphics Processing Units: Optimization Strategy Henry Au Space and Naval Warfare Center Pacific 09/12/12.
GPU in HPC Scott A. Friedman ATS Research Computing Technologies.
GPU-Accelerated Surface Denoising and Morphing with LBM Scheme Ye Zhao Kent State University, Ohio.
A High-Performance Scalable Graphics Architecture Daniel R. McLachlan Director, Advanced Graphics Engineering SGI.
Stream Processing Main References: “Comparing Reyes and OpenGL on a Stream Architecture”, 2002 “Polygon Rendering on a Stream Architecture”, 2000 Department.
1 Multi-valued Geodesic based Fiber Tracking for Diffusion Tensor Imaging Neda Sepasian Supervised by Prof. Bart ter Haar Romeny, Dr. Anna Vilanova Bartoli.
Summary Background –Why do we need parallel processing? Moore’s law. Applications. Introduction in algorithms and applications –Methodology to develop.
1 1 What does Performance Across the Software Stack mean?  High level view: Providing performance for physics simulations meaningful to applications 
Some key aspects of NVIDIA GPUs and CUDA. Silicon Usage.
Introduction: Lattice Boltzmann Method for Non-fluid Applications Ye Zhao.
May 8, 2007Farid Harhad and Alaa Shams CS7080 Overview of the GPU Architecture CS7080 Final Class Project Supervised by: Dr. Elias Khalaf By: Farid Harhad.
GPU Accelerated MRI Reconstruction Professor Kevin Skadron Computer Science, School of Engineering and Applied Science University of Virginia, Charlottesville,
Havok FX Physics on NVIDIA GPUs. Copyright © NVIDIA Corporation 2004 What is Effects Physics? Physics-based effects on a massive scale 10,000s of objects.
Large Scale Time-Varying Data Visualization Han-Wei Shen Department of Computer and Information Science The Ohio State University.
MSIM 842 VISUALIZATION II INSTRUCTOR: JESSICA R. CROUCH 1 A Particle System for Interactive Visualization of 3D Flows Jens Krüger Peter Kipfer.
NA-MIC National Alliance for Medical Image Computing Core 1b – Engineering Computational Platform Jim Miller GE Research.
Debunking the 100X GPU vs. CPU Myth An Evaluation of Throughput Computing on CPU and GPU Present by Chunyi Victor W Lee, Changkyu Kim, Jatin Chhugani,
Linear Algebra Operators for GPU Implementation of Numerical Algorithms J. Krüger R. Westermann computer graphics & visualization Technical University.
From Turing Machine to Global Illumination Chun-Fa Chang National Taiwan Normal University.
© David Kirk/NVIDIA and Wen-mei W. Hwu, ECE408/CS483, University of Illinois, Urbana-Champaign 1 Graphic Processing Processors (GPUs) Parallel.
IWarp Based Remote Interactive Scientific Visualization CENIC 2008 Oakland, CA Scott A. Friedman UCLA Academic Technology Services Research Computing Technologies.
Collaboration with Craig Henriquez’ laboratory at Duke University Multi-scale Electro- physiological Modeling.
Computer Architecture Lecture 24 Parallel Processing Ralph Grishman November 2015 NYU.
Tackling I/O Issues 1 David Race 16 March 2010.
GPGPU introduction. Why is GPU in the picture Seeking exa-scale computing platform Minimize power per operation. – Power is directly correlated to the.
3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.
Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating.
Fast and parallel implementation of Image Processing Algorithm using CUDA Technology On GPU Hardware Neha Patil Badrinath Roysam Department of Electrical.
Fermi National Accelerator Laboratory & Thomas Jefferson National Accelerator Facility SciDAC LQCD Software The Department of Energy (DOE) Office of Science.
Martin Kruliš by Martin Kruliš (v1.0)1.
Computer Graphics Graphics Hardware
VisIt Project Overview
Spatial Analysis With Big Data
Artificial Intelligence
Parallel I/O System for Massively Parallel Processors
Computer Graphics Graphics Hardware
MPM Particle Sim + Galaxy
CIS 6930: Chip Multiprocessor: GPU Architecture and Programming
Presentation transcript:

Scientific Computing Goals Past progress Future

Goals Numerical algorithms & computational strategies Solve specific set of problems associated with center – – simulation, image processing, meshing Parallel architectures Clusters/MPI Multiprocessors Streaming architectures (GPUs) Deliver new technologies – software infrastructure APIs and end-user applications

Activities/Results Shared memory architectures General multithreading capability Linear solvers for PDEs Scriptable capability for exploring parameters spaces GPUs Visualization Volume rendering

Graphics Processing Units (GPUs) “Graphics boards” Built primarily for 3D computer games What they do Geometry (polygons) Textures (images) Trends Increased power – – Data types, speed, memory Increased flexibility – – Arithmetic, data types Inexpensive (e.g. $400)

Other GPU Activities in SCI Interactive 3D level-set segmentation (NSF) Interactive image segmentation (Exxon Mobil) Diffusion tensor image analysis (NIH) Tomographic/MRI reconstruction (GE, NIH) Sorting algorithms with CUDA (NSF, DOE) Unbiased atlas construction (NSF, NIH) Streaming tetrahedra simplification (NSF, DOE) Unstructured grid volume rendering (DOE) Material point method (MPM) simulations (DOE) Dye-advection for flow visualization (KAUST) Schlieren imaging for flow visualization (DOE) Computing on GPU clusters (NVIDIA,NSF)

Planned Activities Simulation Multithreaded work for PDE solvers Clusters/MPI Extend to GPUs Images-to-models N-body interactions – – Particle systems – multiprocessors and GPUs Image analysis – – PDEs, filtering, shortest paths on GPUs Visualization GPUs for rendering, analysis

Challenges Shared memory Load balancing Cache performance Limited parallism Clusters Load balancing Communication Shared datastructures (programming models)

Challenges GPUs Computational model Efficient memory access (RAM model not efficient) Limited memory and GPU-CPU bandwidth Software Portability, reliability are challenges CUDA could help (NVIDIA only)