Presentation is loading. Please wait.

Presentation is loading. Please wait.

Petascale I/O Impacts on Visualization Hank Childs Lawrence Berkeley National Laboratory & UC Davis March 24, 2010 27B element Rayleigh-Taylor Instability.

Similar presentations


Presentation on theme: "Petascale I/O Impacts on Visualization Hank Childs Lawrence Berkeley National Laboratory & UC Davis March 24, 2010 27B element Rayleigh-Taylor Instability."— Presentation transcript:

1 Petascale I/O Impacts on Visualization Hank Childs Lawrence Berkeley National Laboratory & UC Davis March 24, 2010 27B element Rayleigh-Taylor Instability (MIRANDA, BG/L) 2 trillion element mesh 2006 2012

2 How does the {peta-, exa-} scale affect visualization? Large # of time steps Large ensembles High-res meshes Large # of variables Your mileage may vary -Are you running full machine? -How much data do you output? Your mileage may vary -Are you running full machine? -How much data do you output?

3 The soon-to-be “good ole days” … how visualization is done right now P0 P1 P3 P2 P8 P7P6 P5 P4 P9 Pieces of data (on disk) ReadProcessRender Processor 0 ReadProcessRender Processor 1 ReadProcessRender Processor 2 Parallelized visualization data flow network P0 P3 P2 P5 P4 P7 P6 P9 P8 P1 Parallel Simulation Code This technique is called “pure parallelism”

4 Pure parallelism performance is based on # bytes to process and I/O rates. Vis is almost always >50% I/O and sometimes 98% I/O Amount of data to visualize is typically O(total mem) FLOPs MemoryI/O Terascale machine “Petascale machine” Two big factors: ① how much data you have to read ② how fast you can read it  Relative I/O (ratio of total memory and I/O) is key

5 Anedoctal evidence: relative I/O really is getting slower. Machine nameMain memoryI/O rate ASC purple49.0TB140GB/s5.8min BGL-init32.0TB24GB/s22.2min BGL-cur69.0TB30GB/s38.3min Petascale machine ?? >40min Time to write memory to disk

6 Why is relative I/O getting slower? “I/O doesn’t pay the bills” —And I/O is becoming a dominant cost in the overall supercomputer procurement. Simulation codes aren’t as exposed. —And will be less exposed with proposed future architectures.

7 Recent runs of trillion cell data sets provide further evidence that I/O dominates 7 ● Weak scaling study: ~62.5M cells/core 7 #coresProblem Size TypeMachine 8K0.5TZ AIX Purple 16K1TZSun LinuxRanger 16K1TZLinuxJuno 32K2TZCray XT5JaguarPF 64K4TZBG/PDawn 16K, 32K1TZ, 2TZCray XT4Franklin 2T cells, 32K procs on Jaguar 2T cells, 32K procs on Franklin -Approx I/O time: 2-5 minutes -Approx processing time: 10 seconds -Approx I/O time: 2-5 minutes -Approx processing time: 10 seconds

8 Summary: what are the challenges? Scale —We can’t read all of the data at full resolution any more… Q: What can we do? A: We need algorithmic change. Insight —How are we going to understand it? (There is a lot more data than pixels!)

9 Multi-resolution techniques use coarse representations then refine. P0 P1 P3 P2 P8 P7P6 P5 P4 P9 Pieces of data (on disk) ReadProcessRender Processor 0 ReadProcessRender Processor 1 ReadProcessRender Processor 2 Parallelized visualization data flow network P0 P3 P2 P5 P4 P7 P6 P9 P8 P1 Parallel Simulation Code P2 P4

10 Multi-resolution: pros and cons Pros —Drastically reduce I/O & memory requirements Cons —Is it meaningful to process simplified version of the data? —How do we generate hierarchical representations? What costs do they incur?

11 In situ processing does visualization as part of the simulation. P0 P1 P3 P2 P8 P7P6 P5 P4 P9 Pieces of data (on disk) ReadProcessRender Processor 0 ReadProcessRender Processor 1 ReadProcessRender Processor 2 P0 P3 P2 P5 P4 P7 P6 P9 P8 P1 Parallel Simulation Code

12 In situ processing does visualization as part of the simulation. P0 P1 P3 P2 P8 P7P6 P5 P4 P9 GetAccess ToData ProcessRender Processor 0 Parallelized visualization data flow network Parallel Simulation Code GetAccess ToData ProcessRender Processor 1 GetAccess ToData ProcessRender Processor 2 GetAccess ToData ProcessRender Processor 9 …………

13 In situ: pros and cons Pros: —No I/O! —Lots of compute power available Cons: —Very memory constrained —Many operations not possible Once the simulation has advanced, you cannot go back and analyze it —User must know what to look a priori Expensive resource to hold hostage!

14 Now we know the tools … what problem are we trying to solve? Three primary use cases: —Exploration —Confirmation —Communication Examples: Scientific discovery Debugging Examples: Scientific discovery Debugging Examples: Data analysis Images / movies Comparison Examples: Data analysis Images / movies Comparison Examples: Data analysis Images / movies Examples: Data analysis Images / movies Multi-res In situ Will still need pure parallelism and possibly other techniques such as data subsetting and streaming.

15 Prepare for difficult conversations in the future. Multi-resolution: —Do you understand what a multi-resolution hierarchy should look like for your data? —Who do you trust to generate it? —Are you comfortable with your I/O routines generating these hierarchies while they write? —How much overhead are you willing to tolerate on your dumps? 33+%? —Willing to accept that your visualizations are not the “real” data?

16 Prepare for difficult conversations in the future. In situ: —How much memory are you willing to give up for visualization? —Will you be angry if the vis algorithms crash? —Do you know what you want to generate a priori? Can you re-run simulations if necessary?

17 Visualization on BlueWaters: Two Scenarios 1)Pure parallelism continues (no SW change): —Visualization and analysis will be done using large portions of BW, users charging against science allocations Lots of time spent doing I/O This increases overall I/O contention on BW —Vis & analysis is slow  people do less (  insights will be lost (?)) 2)Smart techniques deployed: —Allocations are used for simulation, not vis & analysis —Less artificial I/O contention introduced —Ability to explore / interact with data

18 18 18 4 VisIt is a richly featured, turnkey application VisIt is an open source, end user visualization and analysis tool for simulated and experimental data – Used by: physicists, engineers, code developers, vis experts – >100K downloads on web R&D 100 award in 2005 Used “heavily to exclusively” on 8 of world’s top 12 supercomputers 217 pin reactor cooling simulation. Run on ¼ of Argonne BG/P. 1 billion grid points

19 19 19 Terribly Named!! Intended for more than just visualization ! Data Exploration Visual Debugging Quantitative Analysis Presentations = ? Comparative Analysis

20 20 20 VisIt has a rich feature set that can impact many science areas. Meshes: rectilinear, curvilinear, unstructured, point, AMR Data: scalar, vector, tensor, material, species Dimension: 1D, 2D, 3D, time varying Rendering (~15): pseudocolor, volume rendering, hedgehogs, glyphs, mesh lines, etc… Data manipulation (~40): slicing, contouring, clipping, thresholding, restrict to box, reflect, project, revolve, … File formats (~85) Derived quantities: >100 interoperable building blocks +,-,*,/, gradient, mesh quality, if-then-else, and, or, not Many general features: position lights, make movie, etc Queries (~50): ways to pull out quantitative information, debugging, comparative analysis

21 21 21 VisIt employs a parallelized client-server architecture. Client-server observations: —Good for remote visualization —Leverages available resources —Scales well —No need to move data  Additional design considerations: Plugins Multiple UIs: GUI (Qt), CLI (Python), more… remote machine Parallel vis resources User data localhost – Linux, Windows, Mac Graphics Hardware

22 22 22 The VisIt team focuses on making a robust, usable product for end users. Manuals —300 page user manual —200 page command line interface manual —“Getting your data into VisIt” manual Wiki for users (and developers) Revision control, nightly regression testing, etc Executables for all major platforms Day long class, complete with exercises Slides from the VisIt class

23 23 23 VisIt is a vibrant project with many participants. Over 50 person-years of effort Over one million lines of code Partnership between: Department of Energy’s Office of Nuclear Energy, Office of Science, and National Nuclear Security Agency, and among others —NSF XD centers both expected to make large contributions. 2004-6 User community grows, including AWE & ASC Alliance schools Fall ‘06 VACET is funded Spring ‘08 AWE enters repo 2003 LLNL user community transitioned to VisIt 2005 2005 R&D100 2007 SciDAC Outreach Center enables Public SW repo 2007 Saudi Aramco funds LLNL to support VisIt Spring ‘07 GNEP funds LLNL to support GNEP codes at Argonne Summer‘07 Developers from LLNL, LBL, & ORNL Start dev in repo ‘07-’08 UC Davis & UUtah research done in VisIt repo 2000 Project started ‘07-’08 Partnership with CEA is developed 2008 Institutional support leverages effort from many labs Spring ‘09 More developers Entering repo all the time

24 24 VisIt: What’s the Big Deal? Everything works at scale Robust, usable tool Vis to code development to scientific insight 24

25 VisIt and the smart data techiques Full pure parallelism implementation Data subsetting well integrated —(if you set up your data properly) In situ: yes, but … it’s a memory hog Multi-res: emerging effort in this space

26 26 Three Ways To Get Data Into VisIt (1) Write to a known output format (2) Write a plugin file format reader (3) Integrate VisIt “in situ” —“lib-VisIt” is linked into simulation code (Note: Memory footprint issues with implementation!) —Use model: simulation code advances at some time interval (e.g. end of cycle), hands control to lib-VisIt. lib-VisIt performs vis & analysis tasks, then hands control back to simulation code repeat 26

27 Summary Massive data will force algorithmic change in visualization tools. The VisIt project is making progress on bringing these algorithmic changes to the user community. Contact info: —Hank Childs, LBL & UC Davis —hchilds@lbl.gov / childs@cs.ucdavis.eduhchilds@lbl.govchilds@cs.ucdavis.edu —http://vis.lbl.gov/~hrchilds Questions??


Download ppt "Petascale I/O Impacts on Visualization Hank Childs Lawrence Berkeley National Laboratory & UC Davis March 24, 2010 27B element Rayleigh-Taylor Instability."

Similar presentations


Ads by Google