Presentation is loading. Please wait.

Presentation is loading. Please wait.

HPC Challenge Benchmark Suite 2006 SPEC Workshop January 23, 2006 Austin, TX Jack Dongarra Piotr Łuszczek

Similar presentations


Presentation on theme: "HPC Challenge Benchmark Suite 2006 SPEC Workshop January 23, 2006 Austin, TX Jack Dongarra Piotr Łuszczek"— Presentation transcript:

1 HPC Challenge Benchmark Suite 2006 SPEC Workshop January 23, 2006 Austin, TX Jack Dongarra Piotr Łuszczek

2 Jan 26, SPEC Workshop, Austin, TX2/20 High Productivity Computing Systems Impact: Performance (time-to-solution): speed up critical national security applications by a factor 10X to 40X Programmability (idea-to-first-solution): reduce cost and time of developing application solutions Portability (transparency): insulate research and operational application software from system Robustness (reliability): apply all known techniques to protect against outside attacks, hardware faults, and programming errors Goal: Provide a generation of economically viable high productivity computing systems for the national security and industrial user community (2010) Fill the Critical Technology and Capability Gap Today (late 80's HPC Technology)... to... Future (Quantum/Bio Computing) Applications: Intelligence/surveillance, reconnaissance, cryptanalysis, weapons analysis, airborne contaminant modeling and biotechnology HPCS Program Focus Areas

3 Jan 26, SPEC Workshop, Austin, TX3/20 HPCC Motivation and Design 1.Augment TOP500 Not use single number Provide detailed system description 2.Span locality space 3.Test various hardware components Spatial Locality Temporal Locality DGEMM HPL PTRANS STREAM FFT RandomAccess Mission Partner Applications Low High CPU MemoryInterconnect Computational resources

4 Jan 26, SPEC Workshop, Austin, TX4/20 HPCC Components 1.HPL (Hi-Perf LINPACK) 2.STREAM 3.PTRANS (A A T +B) 4.RandomAccess 5.FFT 6.Matrix-matrix multiply 7.b_eff (effective bandwidth/latency) name kernel bytes/iter FLOPS/iter COPY: a(i) = b(i) 16 0 SCALE: a(i) = q*b(i) 16 1 SUM: a(i) = b(i) + c(i) 24 1 TRIAD: a(i) = b(i) + q*c(i) T: T[k] (+) a i 64 bits E R = E E π C R

5 Jan 26, SPEC Workshop, Austin, TX5/20 HPCC Test Variants 1.Local 2.Embarrassingly parallel 3.Global 4.Network only M M P P P P M M P P P P M M P P P P M M P P P P Network M M P P P P M M P P P P M M P P P P M M P P P P M M P P P P M M P P P P M M P P P P M M P P P P

6 Jan 26, SPEC Workshop, Austin, TX6/20 Official HPCC Submission Process 1.Download 2.Install 3.Run 4.Upload results 5.Confirm via 6.Tune 7.Run 8.Upload results 9.Confirm via Only some routines can be replaced Data layout needs to be preserved Multiple languages can be used Only some routines can be replaced Data layout needs to be preserved Multiple languages can be used Provide detailed installation and execution environment Provide detailed installation and execution environment Results are immediately available on the web site: Interactive HTML XML MS Excel Kiviat charts (radar plots) Results are immediately available on the web site: Interactive HTML XML MS Excel Kiviat charts (radar plots) Optional Prequesites: C compiler BLAS MPI Prequesites: C compiler BLAS MPI

7 Jan 26, SPEC Workshop, Austin, TX7/20 Measuring Locality in Code HPC Challenge Benchmarks Select Applications Spatial and temporal data locality here is for one node/processor i.e., locally or in the small Generated by SDSC

8 Jan 26, SPEC Workshop, Austin, TX8/20 HPCC Awards: SC|05 BOF Class 1: Best Performance Figure of merit: raw system performance Submission must be valid HPCC database entry Side effect: populate HPCC database 4 categories: HPCC components HPL STREAM RandomAccess FFT Award certificates 4x $500 from HPCwire Class 2: Most Productivity Figure of merit: performance and elegance Highly subjective Based on committee vote Submission must implement at least 2 out of 4 Class 1 tests The more tests the better Performance numbers are a plus The submission process: Source code Marketing brochure SC|05 BOF presentation Award certificate $1500 from HPCwire HPCwire contribution: press coverage $3500 awards HPCwire contribution: press coverage $3500 awards

9 Jan 26, SPEC Workshop, Austin, TX9/20 HPCC Awards Class 2 Detailed Results

10 Jan 26, SPEC Workshop, Austin, TX10/20 Time line: HPL Submission Stats 259 Tflop/s 110 Gflop/s HPCS goal: 2000 Tflop/s SC04 SC|05 1.IBM BG/L259(LLNL) 2.IBM BG/L 67(Watson) 3.IBM Power5 58(LLNL) x7 TOP500: 280 Tflop/s TOP500 Systems in HPCC database: #1, #2, #3, #4, #10, #14, #17, #35, #37, #71, #80 TOP500 Systems in HPCC database: #1, #2, #3, #4, #10, #14, #17, #35, #37, #71, #80

11 Jan 26, SPEC Workshop, Austin, TX11/20 Time line: STREAM Submission Stats 160 TB/s 27 GB/s HPCS goal: 6500 TB/s SC04 SC|05 1.IBM BG/L160(LLNL) 2.IBM Power5 55(LLNL) 3.IBM BG/L 40(Watson) x40

12 Jan 26, SPEC Workshop, Austin, TX12/20 Time line: FFT Submission Stats 2311 Gflop/s 4 Gflop/s HPCS goal: 500 Tflop/s SC|05 SC04 1.IBM BG/L 2.3(LLNL) 2.IBM BG/L 1.1(Watson) 3.IBM Power5 1.0(LLNL) x200

13 Jan 26, SPEC Workshop, Austin, TX13/20 Timeline: RandomAccess Submission Stats 35 GUPS 0.01 GUPS HPCS goal: GUPS SC04 SC|05 1.IBM BG/L35(LLNL) 2.IBM BG/L17 (Watson) 3.Cray X1E 8 (ORNL) x1800

14 Jan 26, SPEC Workshop, Austin, TX14/20 Kiviat Charts: Multi-network Example AMD Opteron clusters 2.2 GHz 64-processor cluster Interconnects 1.GigE 2.Commodity 3.Vendor Cannot be differentiated based on: HPL Matrix-matrix multiply Available on HPCC website Kiviat chart (radar plot)

15 Jan 26, SPEC Workshop, Austin, TX15/20 HPCC Data Analysis: Normalize Example: divide by peak flop/s

16 Jan 26, SPEC Workshop, Austin, TX16/20 HPCC Data Analysis: Correlate Cray XT3 NEC SX-8 SGI Altix Is HPL an effective peak or just a peak?

17 Jan 26, SPEC Workshop, Austin, TX17/20 HPCC Data Analysis: Correlate More Can I just run DGEMM (local matrix-matrix multiply) instead of HPL? DGEMM alone overestimates HPL performance Note the 1000x difference in scales: Tera vs. Giga Cray XT3 NEC SX-8 SGI Altix

18 Jan 26, SPEC Workshop, Austin, TX18/20 HPCC Data Analysis: Correlate Yet More Cray XT3 NEC SX-8 SGI Altix Cray X1E/opt IBM BG/L Rackable

19 Jan 26, SPEC Workshop, Austin, TX19/20 Future Directions Reduce execution time Preserve relevance of existing results Add new tests but not duplicate effort Sparse matrix operations I/O Smith-Waterman (sequence alignment) Porting Cell/PS3 Languages Co-Array Fortran HPCS languages: Chapel, Fortress, X10 Environments Paradigms

20 Jan 26, SPEC Workshop, Austin, TX20/20 Collaborators David Bailey NERSC/LBL Jeremy Kepner MIT Lincoln Lab David Koester MITRE Bob Lucas ISI/USC Rusty Lusk ANL John McCalpin IBM Austin «» AMD Rolf Rabenseifner HLRS Stuttgart Daisuke Takahashi Tsukuba, Japan


Download ppt "HPC Challenge Benchmark Suite 2006 SPEC Workshop January 23, 2006 Austin, TX Jack Dongarra Piotr Łuszczek"

Similar presentations


Ads by Google