Ultra-Scale Visualization Workshop November 13, 2006 VAPOR Visualization and Analysis Platform for Ocean, atmosphere, and solar Research.

Slides:



Advertisements
Similar presentations
Supercomputing Challenges at the National Center for Atmospheric Research Dr. Richard Loft Computational Science Section Scientific Computing Division.
Advertisements

National Center for Atmospheric Research John Clyne 4/27/11 4/26/20111.
GPGPU Introduction Alan Gray EPCC The University of Edinburgh.
SC05 November, 2005 Desktop techniques for the exploration of terascale size, time-varying data sets John Clyne & Alan Norton Scientific.
January 5, Feature Tracking in VR for Cumulus Cloud Life-Cycle Studies E. J. Griffith, F. H. Post, M. Koutek, T. Heus and H. J. J. Jonker 11 th.
Modeling the Magnetic Field Evolution of the December Eruptive Flare Yuhong Fan High Altitude Observatory, National Center for Atmospheric Research.
1 Wavelets and compression Dr Mike Spann. 2 Contents Scale and image compression Signal (image) approximation/prediction – simple wavelet construction.
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
SSL (UC Berkeley): Prospective Codes to Transfer to the CCMC Developers: W.P. Abbett, D.J. Bercik, G.H. Fisher, B.T. Welsch, and Y. Fan (HAO/NCAR)
Why does the temperature of the Sun’s atmosphere increase with height? Evidence strongly suggests that magnetic waves carry energy into the chromosphere.
1 Presenter: Chien-Chih Chen Proceedings of the 2002 workshop on Memory system performance.
A Multiresolution Volume Rendering Framework for Large-Scale Time- Varying Data Visualization Chaoli Wang 1, Jinzhu Gao 2, Liya Li 1, Han-Wei Shen 1 1.
RACE: Time Series Compression with Rate Adaptivity and Error Bound for Sensor Networks Huamin Chen, Jian Li, and Prasant Mohapatra Presenter: Jian Li.
1 Motivation Video Communication over Heterogeneous Networks –Diverse client devices –Various network connection bandwidths Limitations of Scalable Video.
Real Parallel Computers. Modular data centers Background Information Recent trends in the marketplace of high performance computing Strohmaier, Dongarra,
Digital Graphics and Computers. Hardware and Software Working with graphic images requires suitable hardware and software to produce the best results.
SC05 November, 2005 Desktop Techniques for the Exploration of Terascale Sized Turbulence Data Sets John Clyne Scientific Computing.
Interactive Visualization of Volumetric Data on Consumer PC Hardware: Introduction Daniel Weiskopf Graphics Hardware Trends Faster development than Moore’s.
Lossy compression of structured scientific data sets
Alan Norton Multiresolution Visualization and Analysis of Turbulence using VAPOR Alan Norton NCAR/CISL Boulder, CO USA Turbulent Theory and Modeling: GTP.
Compression is the reduction in size of data in order to save space or transmission time. And its used just about everywhere. All the images you get on.
1 Chapter No 3 ICT IN Science,Maths,Modeling, Simulation.
Alok 1Northwestern University Access Patterns, Metadata, and Performance Alok Choudhary and Wei-Keng Liao Department of ECE,
Towards a Comprehensive Environment for Data Analysis and Visualization Pamela Gillman, John Clyne Users Forum May 19, 2005.
Alan Norton Software Engineering and VAPOR Alan Norton National Center for Atmospheric Research Boulder, CO USA SEA Seminar June 30, 2011.
SCD User Forum 2005 Dynamically adaptive geophysical fluid dynamics simulation using GASpAR: Geophysics/Astrophysics Spectral-element Adaptive Refinement.
HPDC 2014 Supporting Correlation Analysis on Scientific Datasets in Parallel and Distributed Settings Yu Su*, Gagan Agrawal*, Jonathan Woodring # Ayan.
A Survey of Wavelet Algorithms and Applications, Part 2 M. Victor Wickerhauser Department of Mathematics Washington University St. Louis, Missouri
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
1 Computer Programming (ECGD2102 ) Using MATLAB Instructor: Eng. Eman Al.Swaity Lecture (1): Introduction.
MATLAB
1 © 2012 The MathWorks, Inc. Parallel computing with MATLAB.
A study of relations between activity centers of the climatic system and high-risk regions Vladimir Penenko & Elena Tsvetova.
Lifting Part 1: Introduction Ref: SIGGRAPH96. Outline Introduction to wavelets and lifting scheme Basic Ideas –Split, Predict, Update –In-place computation.
3SAQS Technical Workshop February 25 th, 2015 Data Warehouse Current and Future Development Intermountain West Data Warehouse.
ARGONNE NATIONAL LABORATORY Climate Modeling on the Jazz Linux Cluster at ANL John Taylor Mathematics and Computer Science & Environmental Research Divisions.
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
Module 2 : Part 1 INTRODUCTION TO HARDWARE & SOFTWARE INTRODUCTION TO HARDWARE & SOFTWARE.
Mark Rast Laboratory for Atmospheric and Space Physics Department of Astrophysical and Planetary Sciences University of Colorado, Boulder Kiepenheuer-Institut.
Investigating Adaptive Compilation using the MIPSpro Compiler Keith D. Cooper Todd Waterman Department of Computer Science Rice University Houston, TX.
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
High performance parallel computing of climate models towards the Earth Simulator --- computing science activities at CRIEPI --- Yoshikatsu Yoshida and.
SAGA: Array Storage as a DB with Support for Structural Aggregations SSDBM 2014 June 30 th, Aalborg, Denmark 1 Yi Wang, Arnab Nandi, Gagan Agrawal The.
© 2009 IBM Corporation Motivation for HPC Innovation in the Coming Decade Dave Turek VP Deep Computing, IBM.
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
VAPoR: A Discovery Environment for Terascale Scientific Data Sets Alan Norton & John Clyne National Center for Atmospheric Research Scientific Computing.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
STATISTICS AND OPTIMIZATION Dr. Asawer A. Alwasiti.
Havok FX Physics on NVIDIA GPUs. Copyright © NVIDIA Corporation 2004 What is Effects Physics? Physics-based effects on a massive scale 10,000s of objects.
Large Scale Time-Varying Data Visualization Han-Wei Shen Department of Computer and Information Science The Ohio State University.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
S.Klimenko, LSC March, 2001 Update on Wavelet Compression Presented by S.Klimenko University of Florida l Outline Ø Wavelet compression concept E2 data.
National Center for Supercomputing Applications University of Illinois at Urbana–Champaign Visualization Support for XSEDE and Blue Waters DOE Graphics.
LCSE – NCSA Partnership Accomplishments, FY01 Paul R. Woodward Laboratory for Computational Science & Engineering University of Minnesota October 17, 2001.
Xin Tong, Teng-Yok Lee, Han-Wei Shen The Ohio State University
1 Rocket Science using Charm++ at CSAR Orion Sky Lawlor 2003/10/21.
Tackling I/O Issues 1 David Race 16 March 2010.
SIMD Implementation of Discrete Wavelet Transform Jake Adriaens Diana Palsetia.
Visualization in Scientific Computing (or Scientific Visualization) Multiresolution,...
Data Summit 2016 H104: Building Hadoop Applications Abhik Roy Database Technologies - Experian LinkedIn Profile:
TEMPLATE DESIGN © H. Che 2, E. D’Azevedo 1, M. Sekachev 3, K. Wong 3 1 Oak Ridge National Laboratory, 2 Chinese University.
VisIt Project Overview
HDF5 October 8, 2017 Elena Pourmal Copyright 2016, The HDF Group.
Yu Su, Yi Wang, Gagan Agrawal The Ohio State University
with Computational Scientists
TeraScale Supernova Initiative
The C&C Center Three Major Missions: In This Presentation:
Presentation transcript:

Ultra-Scale Visualization Workshop November 13, 2006 VAPOR Visualization and Analysis Platform for Ocean, atmosphere, and solar Research SC06 Ultra-Scale Visualization Workshop John Clyne & Alan Norton Scientific Computing Division National Center for Atmospheric Research Boulder, CO USA This work is funded in part through a U.S. National Science Foundation, Information Technology Research program grant

Ultra-Scale Visualization Workshop November 13, 2006 [Numerical] models that can currently be run on typical supercomputing platforms produce data in amounts that make storage expensive, movement cumbersome, visualization difficult, and detailed analysis impossible. The result is a significantly reduced scientific return from the nation's largest computational efforts. Mark Rast University of Colorado, LASP Problem Numerical simulations in the earth sciences have reached such extraordinary sizes that researchers can no longer effectively extract insight from their simulation outputs. Result: loss of scientific productivity!!!

Ultra-Scale Visualization Workshop November 13, 2006 Dichotomy of simulation and analysis needs and resources in today’s HPC environments SimulationAnalysis Large systems (O(1000) processors)Small systems (O(10) processors) Batch computing modelInteractive computing model Modest on-line storage requirements Large on-line storage requirements CPU/interconnect boundIO bound Highly tuned, custom, parallel codes COTS serial software applications Historical focus of centersEmerging focus of centers

Ultra-Scale Visualization Workshop November 13, 2006 A sampling of various technology performance curves Not all technologies advance at same rate Impact of parallelization not shown

Ultra-Scale Visualization Workshop November 13, 2006 Communication limits for volume rendering assuming theoretical peak performance GFX Memory 76 GB/sec (GeForce 7900 GX2) CPU Memory 10 GB/sec (AMD Opteron 1000 series) PCI Express 16x 4 GB/sec SATA GB/sec Table shows limits expressed as frames per second imposed by communication alone Assumes only 8-bit data quantities

Ultra-Scale Visualization Workshop November 13, 2006 Visualization and Analysis Platform for Ocean, atmosphere, and solar Research (VAPOR) Key components 1.Domain specific application focus: simulated earth sciences fluid flow 2.Coupled Visualization and quantitative data interrogation and manipulation capabilities 3.Multiresolution enabled terascale data exploration on the desktop Combination of visualization with multiresolution data representation that provide sufficient data reduction to enable interactive work on terascale data from a desktop Visual data browsing Data manipulation Quantitative analysis Refine Coarsen

Ultra-Scale Visualization Workshop November 13, 2006 Fluid flow in the geosciences E.g. Numerically simulated turbulence –Cartesian grids (usually) to Up to “hero” calculations –5 to 8 variables Temperature & Pressure Velocity field components Magnetic field components (MHD calculations) –Hundreds of time steps saved Terabytes of data per experiment –Numerical “experiments” Substantial analysis requirements Yannick Ponty, CNRS 2006

Ultra-Scale Visualization Workshop November 13, 2006 Key Component (1) : Domain specific support Only limited support for: –Grid & data types Cartesian grids, stretched and uniform sampling AMR grids Scalar and vector quantities –Visualization algorithms Volume rendering, flow visualization, cutting planes/probe –Misc. Publication quality graphics Filters File formats (one!) Extensive support for: –Time varying data Uniform as well as non-uniform sampling Missing time steps –Quantitative investigation Mathematical operators and data manipulators –Science driven specialized features Keep it simple! Keep it focused! Make it scientist friendly!

Ultra-Scale Visualization Workshop November 13, 2006 Interactive exploration of time varying data Reduce bandwidth requirements –Regions of interest –Multiresolution –Caching GFX Memory 76 GB/sec (GeForce 7900 GX2) CPU Memory 10 GB/sec (AMD Opteron 1000 series) PCI Express 16x 4 GB/sec SATA GB/sec

Ultra-Scale Visualization Workshop November 13, 2006 Future??? VAPOR Interactive visual browsing IDL Data manipulation & analysis VAPOR Data Collection Disk Array Multi-resolution access and rapid sub-region extraction Key Component (2) : Coupled visualization, quantitative analysis and manipulation capabilities IDL - array based 4GL for scientific data processing –Thousands of mathematical functions –Basic 2D plotting –Array manipulation

Ultra-Scale Visualization Workshop November 13, 2006 Key component (3) : Multiresolution data access Wavelet transformed data –Two parameter linear function decomposition –Hierarchical data representation –Invertible and lossless –Numerically efficient (O(n)) forward and inverse transform –No additional storage cost Enable speed/quality tradeoffs 504x504x2048 Full 252x252x1024 1/8 126x126x512 1/64 63x63x256 1/512

Ultra-Scale Visualization Workshop November 13, 2006 Visual comparison of a compressible convection simulation coarsened M. Rast, native

Ultra-Scale Visualization Workshop November 13, 2006 Performance of forward and inverse Haar wavelet transform System Linux RHEL x Intel 3.4 GHz Xeon EMT64 8 GBs RAM 1Gb/sec Fibre Channel storage Data Scalar Single precision Gains in microprocessor technology enable transforms at very low cost

Ultra-Scale Visualization Workshop November 13, 2006 VAPOR Demo

Ultra-Scale Visualization Workshop November 13, 2006 Summary VAPOR is a domain-specific platform for analysis, not a general purpose visualization tool Target users: fluid flow researchers in earth sciences –Limited value for medical, oil & gas, aerospace, etc. Desktop data exploration of terabyte data possible –Visualization enables rapid ROI identification –Multiresolution enables speed/quality tradeoffs

Ultra-Scale Visualization Workshop November 13, 2006 Acknowledgements Steering Committee –Nic Brummell - CU –Yuhong Fan - NCAR, HAO –Aimé Fournier – NCAR, IMAGe –Pablo Mininni, NCAR, IMAGe –Aake Nordlund, University of Copenhagen –Helene Politano - Observatoire de la Cote d'Azur –Yannick Ponty - Observatoire de la Cote d'Azur –Annick Pouquet - NCAR, ESSL –Mark Rast - CU –Duane Rosenberg - NCAR, IMAGe –Matthias Rempel - NCAR, HAO –Geoff Vasil, CU Developers –Alan Norton – NCAR, SCD –John Clyne – NCAR, SCD –Kenny Gruchalla - CU Research Collaborators –Kwan-Liu Ma, U.C. Davis –Hiroshi Akiba, U.C. Davis –Han-Wei Shen, Ohio State –Liya Li, Ohio State Systems Support –Joey Mendoza, NCAR, SCD

Ultra-Scale Visualization Workshop November 13, 2006 Questions???

Ultra-Scale Visualization Workshop November 13, 2006 Inverse Haar transform with 1/8 th volume subregion extraction System Linux RHEL x Intel 3.4 GHz Xeon EM64 8 GBs RAM 1Gb/sec Fibre Channel storage Data Scalar Single precision Data blocking permits rapid subregion extraction

Ultra-Scale Visualization Workshop November 13, 2006 The Lifting Method of wavelet construction in the spatial domain [Sweldens, 95] Split Predict Update Transform 1 Transform 2Transform j 1) Split: 2) Predict: 3) Update: Split signal into even (λ) and odd (γ) coefficients. λ will contain low frequency information, γ will contain high frequency information. Local correlation permits prediction of odd samples by even using a prediction operator, P. Capture difference between prediction and actual coefficient value. Update λ coefficients to preserve a property (e.g. mean) of original signal. A signal λ j consisting of 2 j samples

Ultra-Scale Visualization Workshop November 13, 2006 Example: Lifting Method with the Haar Wavelet λ 3,k λ 2,k λ 1,k λ 0,k γ 2,k γ 0,k γ 1,k Haar operators

Ultra-Scale Visualization Workshop November 13, 2006 NCAR Historical Estimated Sustained GFLOPS (Batch Production Systems)

Ultra-Scale Visualization Workshop November 13, 2006 NCAR Historical Estimated Sustained GFLOPS (Interactive Production Systems) Current NCAR visualization and analysis resources –~32 processors 8 nodes (6 with gfx) –~100 TB on-line storage –~800 MBs/sec aggregate storage bandwidth –~100 users (99 of which will not leave office)