Presentation is loading. Please wait.

Presentation is loading. Please wait.

Jay Lofstead Input/Output APIs and Data Organization for High Performance Scientific Computing November.

Similar presentations


Presentation on theme: "Jay Lofstead Input/Output APIs and Data Organization for High Performance Scientific Computing November."— Presentation transcript:

1 http://www.adiosapi.org1 Jay Lofstead (lofstead@cc.gatech.edu) Input/Output APIs and Data Organization for High Performance Scientific Computing November 17, 2008 Petascale Data Storage Institute Workshop at Supercomputing 2008 Jay Lofstead (GT), Fang Zheng (GT), Scott Klasky (ORNL), Karsten Schwan (GT)

2 http://www.adiosapi.org 2Jay Lofstead (lofstead@cc.gatech.edu) Overview Motivation Architecture Performance

3 http://www.adiosapi.org 3Jay Lofstead (lofstead@cc.gatech.edu) Motivation Many codes write lots of data, but rarely read TBs of data different types and sizes HDF-5 and pNetCDF used convenient tool integration portable format

4 http://www.adiosapi.org 4Jay Lofstead (lofstead@cc.gatech.edu) Performance/Resilience Challenges pNetCDF “right sized” header coordination for each data declaration data stored as logically described HDF-5 b-tree format coordination for each data declaration single metadata store vulnerable to corruption.

5 http://www.adiosapi.org 5Jay Lofstead (lofstead@cc.gatech.edu) Architecture (ADIOS) Change IO method by changing XML file Switch between synchronous and asynchronous Hook into other systems like visualization and workflow External Metadata (XML file) Scientific Codes ADIOS API DART LIVE/DataTap MPI-IO POSIX IO HDF-5 pnetCDF Viz Engines Others (plug-in) bufferingschedulefeedback

6 http://www.adiosapi.org 6Jay Lofstead (lofstead@cc.gatech.edu) Architecture (BP) Individual outputs into “process group” segments Metadata indices next Index offsets and version flag at end Process Group 1 Process Group 2... Process Group n Process Group Index Vars Index Attributes Index Index Offsets and Version #

7 http://www.adiosapi.org 7Jay Lofstead (lofstead@cc.gatech.edu) Resilience Features Random node failure timeouts mark index entries as suspect Root node failure scan file to rebuild index use local size values to find offsets

8 http://www.adiosapi.org 8Jay Lofstead (lofstead@cc.gatech.edu) Data Characteristics Identify file contents efficiently min/max local array sizes Local-only makes it “free” no communication Indices for summaries/direct access copies for resilience

9 http://www.adiosapi.org 9Jay Lofstead (lofstead@cc.gatech.edu) Architecture (Strategy) ADIOS API for flexibility Use PHDF-5/PNetCDF during development for “correctness” Use POSIX/MPI-IO methods (BP output format) during production runs for performance

10 http://www.adiosapi.org 10Jay Lofstead (lofstead@cc.gatech.edu) Performance Overview Chimera (supernova) (8192 processes) relatively small writes (~1 MB per process) 1400 seconds pHDF-5 vs. 1.4 seconds POSIX (or 10 seconds MPI-IO independent) GTC (fusion) (29,000 processes) 25 GB/sec (out of 40 GB/sec theoretical max) writing restarts 3% of wall clock time spent on IO > 60 TB of total output

11 http://www.adiosapi.org 11Jay Lofstead (lofstead@cc.gatech.edu) Performance Overview Collecting characteristics unmeasurable 10, 50, 100 million entry arrays per processes 128-2048 processes weak scaling

12 http://www.adiosapi.org 12Jay Lofstead (lofstead@cc.gatech.edu) Performance Overview Data conversion Chimera 8192 process run took 117 seconds to convert to HDF-5 (compare 1400 seconds to write directly) on a single process Other tests have shown linear conversion performance with size Parallel conversion will be faster...

13 http://www.adiosapi.org 13Jay Lofstead (lofstead@cc.gatech.edu) Summary Use ADIOS API selectively choose consistency BP intermediate format performance resilience later convert to HDF-5/NetCDF Questions?


Download ppt "Jay Lofstead Input/Output APIs and Data Organization for High Performance Scientific Computing November."

Similar presentations


Ads by Google