Interactive Supercomputing Update IDC HPC User’s Forum, September 2008.

Slides:

Advertisements

Similar presentations

E-Science Data Information and Knowledge Transformation Thoughts on Education and Training for E-Science Based on edikt project experience Dr. Denise Ecklund.

Advertisements

Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.

Priority Research Direction (I/O Models, Abstractions and Software) Key challenges What will you do to address the challenges? – Develop newer I/O models.

Teaching Courses in Scientific Computing 30 September 2010 Roger Bielefeld Director, Advanced Research Computing.

IDC HPC User Forum Conference Appro Product Update Anthony Kenisky, VP of Sales.

Information Technology Center Introduction to High Performance Computing at KFUPM.

Using R as enterprise-wide data analysis platform Zivan Karaman.

Microsoft Technical Computing Modeling the world with greater fidelity Wolfgang Dreyer, TC - Microsoft Germany.

Parallel Programming Henri Bal Rob van Nieuwpoort Vrije Universiteit Amsterdam Faculty of Sciences.

FPGA chips and DSP Algorithms By Emily Fabes. 2 Agenda FPGA Background Reasons to use FPGA’s Advantages and disadvantages of using FPGA’s Sample VHDL.

Star-P and the Knowledge Discovery Suite Steve Reinhardt, Viral Shah, John Gilbert,

1 New Architectures Need New Languages A triumph of optimism over experience! Ian Watson 3 rd July 2009.

FLANN Fast Library for Approximate Nearest Neighbors

Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.

1 Down Place Hammersmith London UK 530 Lytton Ave. Palo Alto CA USA.

A Top Level Overview of Parallelism from Microsoft's Point of View in 15 minutes IDC HPC User’s Forum April 2010 David Rich Director Strategic Business.

1 Using R for consumer psychological research Research Analytics | Strategy & Insight September 2014.

Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.

GPU-accelerated Evaluation Platform for High Fidelity Networking Modeling 11 December 2007 Alex Donkers Joost Schutte.

“SEMI-AUTOMATED PARALLELISM USING STAR-P " “SEMI-AUTOMATED PARALLELISM USING STAR-P " Dana Schaa 1, David Kaeli 1 and Alan Edelman 2 2 Interactive Supercomputing.

© 2008 The MathWorks, Inc. ® ® Parallel Computing with MATLAB ® Silvina Grad-Freilich Manager, Parallel Computing Marketing

Company Overview for GDF Suez December 29, Enthought’s Business Enthought provides products and consulting services for scientific software solutions.

Results Matter. Trust NAG. Numerical Algorithms Group Mathematics and technology for optimized performance Alternative Processors Panel IDC, Tucson, Sept.

1 Down Place Hammersmith London UK 530 Lytton Ave. Palo Alto CA USA.

IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.

Uncovering the Multicore Processor Bottlenecks Server Design Summit Shay Gal-On Director of Technology, EEMBC.

If Exascale by 2018, Really? Yes, if we want it, and here is how Laxmikant Kale.

Panel on Training and Developing HPC People HPC User Forum Dearborn MI April 13, 2010 Paul Buerger Avetec/DICE program Jim Kasdorf.

Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,

High Performance Embedded Computing © 2007 Elsevier Lecture 3: Design Methodologies Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte Based.

IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.

4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.

SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.

Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,

MATRIX MULTIPLY WITH DRYAD B649 Course Project Introduction.

Center for Component Technology for Terascale Simulation Software CCA is about: Enhancing Programmer Productivity without sacrificing performance. Supporting.

Introduction to Reconfigurable Computing Greg Stitt ECE Department University of Florida.

HPC User Forum Back End Compiler Panel SiCortex Perspective Kevin Harris Compiler Manager April 2009.

Numerical Libraries Project Microsoft Incubation Group Mary Beth Hribar Microsoft Corporation CSCAPES Workshop June 10, 2008 Copyright Microsoft Corporation,

Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.

Experts in numerical algorithms and HPC services Compiler Requirements and Directions Rob Meyer September 10, 2009.

BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.

Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.

Highest performance parallel storage for HPC environments Garth Gibson CTO & Founder IDC HPC User Forum, I/O and Storage Panel April 21, 2009.

Lecture 4 Page 1 CS 111 Online Modularity and Virtualization CS 111 On-Line MS Program Operating Systems Peter Reiher.

Non-negative Matrix Factorization

An Interactive Environment for Combinatorial Scientific Computing Viral B. Shah John R. Gilbert Steve Reinhardt With thanks to: Brad McRae, Stefan Karpinski,

Workshop on Advanced Computing for Accelerators Day 3 Roger Barlow.

1 Circuitscape Capstone Presentation Team Circuitscape Katie Rankin Mike Schulte Carl Reniker Sean Collins.

Abdul Rahim Ahmad MITM 613 Intelligent System Chapter 10: Tools.

GPU Computing for GIS James Mower Department of Geography and Planning University at Albany.

A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.

Software Design and Architecture

Introduction. News you can use Hardware –Multicore chips (2009: mostly 2 cores and 4 cores, but doubling) (cores=processors) –Servers (often.

Evolution at CERN E. Da Riva1 CFD team supports CERN development 19 May 2011.

Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.

Productive Performance Tools for Heterogeneous Parallel Computing

Code Optimization.

Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming

Introduction Super-computing Tuesday

Learn about MATLAB Engineers – not sales!

CS 179 Project Intro.

Restructuring the multi-resolution approximation for spatial data to reduce the memory footprint and to facilitate scalability Vinay Ramakrishnaiah Mentors:

GENERAL VIEW OF KRATOS MULTIPHYSICS

Computer Services Business challenge

MORE ON ARCHITECTURES The main reasons for using an architecture are maintainability and performance. We want to structure the software into reasonably.

Vrije Universiteit Amsterdam

Panel on Training and Developing HPC People

Presentation transcript:

Interactive Supercomputing Update IDC HPC User’s Forum, September 2008

Agenda Why am I here? Some trends… What does Interactive Supercomputing do? What’s new? (and app examples if there is time) 2

Why I’m here (at least partly) At the April User’s Forum meeting, somebody on a panel said something like; ‘I don’t want to learn MPI, I wish computer scientists would build tools to make my life easier.’ At that very moment, I was interviewing with Interactive Supercomputing… 3 

HPC Conventional Wisdom Includes; Computing cost continues to decline while reality cost continues to rise – creating pull for “in silico” techniques More compute power is needed for multiple reasons; More fidelity; multi-physics; data explosion… Increasing complexity in the compute engine More cores, not faster cores; Potentially less capability / core; Multi-threading HW; The usual pain points are only getting worse. E.g. memory and i/o BW/FLOP, latencies… Creating a more difficult strategy choice for development; multicore, manycore, gpu, thin, thick or fat nodes… 4 There is a strong need for new development tools -- even for experienced parallel programmers. But in the meantime…

The Domain Expert View Swamped by the velocity of their own domain Long ago moved from 3GL’s to VHLL’s E.g. from FORTRAN to some variant of the M language (most likely Matlab®) … and don’t want to move back Now have enough data and math to need more than one desktop worth of compute Our surveys show as many as 40% of users are performance limited for some applications 5

What we do: Make high performance computing accessible to the widest possible range of users; enable domain experts to develop and deploy high performance parallel applications easily 6 Note: “server” includes “cluster”

Star-P Value Proposition Higher Productivity, Quicker Results, No complex programming

Star-P Open Software Platform

What’s New? (courtesy PNNL) 9 We call this step Knowledge Discovery

Why Star-P for Knowledge Discovery? Need to match algorithm to data means users need to experiment with multiple algorithms VHLL makes code changes easy Note, we see this requirement often – e.g. in finance and intelligence where codes must be continually adapted Size of data means HPC is required for experiments With Star-P, good enough speed-up is achieved quickly Star-P includes KD functions which run in parallel and Parallel I/O to remove that potential bottleneck 10

Factoring network flow behavior [Karpinski, Almeroth, Belding]

Algorithmic exploration Many NMF variants exist in the literature –Not clear how useful on large data –Not clear how to calibrate (i.e., number of iterations to converge) NMF algorithms combine linear algebra and optimization methods Basic and “improved” NMF factorization algorithms implemented: –euclidean (Lee & Seung 2000) –K-L divergence (Lee & Seung 2000) –semi-nonnegative (Ding et al. 2006) –left/right-orthogonal (Ding et al. 2006) –bi-orthogonal tri-factorization (Ding et al. 2006) –sparse euclidean (Hoyer et al. 2002) –sparse divergence (Liu et al. 2003) –non-smooth (Pascual-Montano et al. 2006)

NMF traffic analysis results NMF identifies essential components of the traffic Analyst labels different types of external behavior

Computational Ecology Modeling dispersal of species within a habitat (to maximize range) Large geographic areas, linked with GIS data Blend of numerical and combinatorial algorithms Brad McRae and Paul Beier, “Circuit theory predicts gene flow in plant and animal populations”, PNAS, Vol. 104, no. 50, December 11, 2007

Results Solution time reduced from 3 days (desktop) to 5 minutes (14p) for typical problems Aiming for much larger problems: Yellowstone-to-Yukon (Y2Y)

16 Thank You! David Rich VP Marketing