VAPoR: A Discovery Environment for Terascale Scientific Data Sets Alan Norton & John Clyne National Center for Atmospheric Research Scientific Computing.

Slides:



Advertisements
Similar presentations
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Advertisements

Earth System Curator Spanning the Gap Between Models and Datasets.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
SC05 November, 2005 Desktop techniques for the exploration of terascale size, time-varying data sets John Clyne & Alan Norton Scientific.
SDSC Computing the 21st Century Talk Given to the NSF Sugar Panel May 27, 1998.
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
UCB-SSL Progress Report for the Joint CCHM/CWMM Workshop W.P. Abbett, G.H. Fisher, and B.T. Welsch.
/department of mathematics and computer science Visualization of Transition Systems Hannes Pretorius Visualization Group
E. WES BETHEL (LBNL), CHRIS JOHNSON (UTAH), KEN JOY (UC DAVIS), SEAN AHERN (ORNL), VALERIO PASCUCCI (LLNL), JONATHAN COHEN (LLNL), MARK DUCHAINEAU.
SpaceGRID and EGSO Satu Keski-Jaskari Maria Vappula Parallal Computing – Seminar
Ultra-Scale Visualization Workshop November 13, 2006 VAPOR Visualization and Analysis Platform for Ocean, atmosphere, and solar Research.
IS&T Scientific Visualization Tutorial - Summer 2010 Scientific Visualization Tutorial.
fluidyn – PANAIR Fluidyn-PANAIR
Current Visualization Software NCL, Amira, and OpenDX By Drew Brumm.
UNIVERSITY of MARYLAND GLOBAL LAND COVER FACILITY High Performance Computing in Support of Geospatial Information Discovery and Mining Joseph JaJa Institute.
SC05 November, 2005 Desktop Techniques for the Exploration of Terascale Sized Turbulence Data Sets John Clyne Scientific Computing.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Lossy compression of structured scientific data sets
Alan Norton Multiresolution Visualization and Analysis of Turbulence using VAPOR Alan Norton NCAR/CISL Boulder, CO USA Turbulent Theory and Modeling: GTP.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
An Extensible Python User Environment Jeff Daily Karen Schuchardt, PI Todd Elsethagen Jared Chase H41G-0956 Website Acknowledgements.
Towards a Comprehensive Environment for Data Analysis and Visualization Pamela Gillman, John Clyne Users Forum May 19, 2005.
DEVSView: A DEVS Visualization Tool Wilson Venhola.
Institute for Mathematical Modeling RAS 1 Visualization in distributed systems. Overview. Remote visualization means interactive viewing of three dimensional.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
Rendering Adaptive Resolution Data Models Daniel Bolan Abstract For the past several years, a model for large datasets has been developed and extended.
1 Computer Programming (ECGD2102 ) Using MATLAB Instructor: Eng. Eman Al.Swaity Lecture (1): Introduction.
Where are the Academic Jobs ? Interactive Exploration of Job Advertisements in Geospatial and Topical Space Angela M. Zoss 1, Michael Conover 2 and Katy.
14 Aug 08DOE Review John Huth ATLAS Computing at Harvard John Huth.
Visualization Workshop David Bock Visualization Research Programmer National Center for Supercomputing Applications - NCSA University of Illinois at Urbana-Champaign.
Tone Mapping on GPUs Cliff Woolley University of Virginia Slides courtesy Nolan Goodnight.
ARGONNE NATIONAL LABORATORY Climate Modeling on the Jazz Linux Cluster at ANL John Taylor Mathematics and Computer Science & Environmental Research Divisions.
NIH NCRR Overview The SCIRun and BioPSE Problem Solving Environments Chris Johnson, Rob MacLeod, and David Weinstein Scientific Computing and Imaging Institute.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
Mark Rast Laboratory for Atmospheric and Space Physics Department of Astrophysical and Planetary Sciences University of Colorado, Boulder Kiepenheuer-Institut.
A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
Milestones, Feedback, Action Items Power Aware Distributed Systems Kickoff August 23, 2000.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Efficient Local Statistical Analysis via Integral Histograms with Discrete Wavelet Transform Teng-Yok Lee & Han-Wei Shen IEEE SciVis ’13Uncertainty & Multivariate.
NACP A High-Resolution Daily Surface Weather Database for NACP Investigations Peter E. Thornton 1, Robert B. Cook 2, W. Mac Post 2, Bruce E. Wilson 2,
Volume Visualization Presented by Zhao, hai. What’ volume visualization Volume visualization is the creation of graphical representations of data sets.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Center for Computational Visualization University of Texas, Austin Visualization and Graphics Research Group University of California, Davis Molecular.
Computational Science & Engineering meeting national needs Steven F. Ashby SIAG-CSE Chair March 24, 2003.
Visualization in Problem Solving Environments Amit Goel Department of Computer Science Virginia Tech June 14, 1999.
Mantid Stakeholder Review Nick Draper 01/11/2007.
Visualization with ParaView. Before we begin… Make sure you have ParaView 3.14 installed so you can follow along in the lab section –
Large Scale Time-Varying Data Visualization Han-Wei Shen Department of Computer and Information Science The Ohio State University.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Presented by Visualization at the Leadership Computing Facility Sean Ahern Scientific Computing Center for Computational Sciences.
Power and Cooling at Texas Advanced Computing Center Tommy Minyard, Ph.D. Director of Advanced Computing Systems 42 nd HPC User Forum September 8, 2011.
Visualizing the Cosmos: Smoke or Mirrors? Designing Visualization Imagery David Bock National Center for Supercomputing Applications University of Illinois,
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Data Requirements for Climate and Carbon Research John Drake, Climate Dynamics Group Computer.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Scientific Computing Goals Past progress Future. Goals Numerical algorithms & computational strategies Solve specific set of problems associated with.
Visualization in Scientific Computing (or Scientific Visualization) Multiresolution,...
67 x 89 = ? 67 x
Evaluating Climate Visualization: An Information Visualization Approach -By Mridul Sen 1.
Using Cyberinfrastructure to Study the Earth’s Climate and Air Quality Don Wuebbles Department of Atmospheric Sciences University of Illinois, Urbana-Champaign.
VisIt Project Overview
Document Visualization at UMBC
with Computational Scientists
The SCIRun and BioPSE Problem Solving Environments
Metadata Development in the Earth System Curator
Data Management Components for a Research Data Archive
Scientific Visualization
Introduction to Scientific Visualization
Presentation transcript:

VAPoR: A Discovery Environment for Terascale Scientific Data Sets Alan Norton & John Clyne National Center for Atmospheric Research Scientific Computing Division

Alan Norton VAPoR project overview l GOAL: Improve ability of earth sciences researchers to analyze and explore complex dynamics found in vast data sets resulting from high-resolution, 4D (3D space x time) numerical simulations of fluid flow in the oceans, atmosphere and sun l Open Source software development effort aimed at bridging gap between visualization research and application tool availability l Funded by the NSF ITR program, VAPoR is a collaborative effort between NCAR’s Visualization and Enabling Technology Section, U.C. Davis’ Institute for Data Analysis and Visualization, and Ohio State University’s, Department of Computer and Information Sciences

Alan Norton The numbers – an example from solar compressible convection l 512x512x2048 l 5 variables (u,v,w,rho,temp) l ~500 time steps saved l 9TBs storage l Six months compute time required on 112 IBM SP RS/6000 processors (blackforest) l Three months for post- processing l Data may be analyzed for several years Image courtesy of Joseph Mendoza, NCAR/SCD

Alan Norton Our approach l Exploit power of multiresolution data representation combined with progressive data access and efficient region sub-setting to enable user to make speed/quality tradeoffs [ Pascucci and Frank, SC2001; Clyne, VIIP2003] l Tightly couple traditional, non-visual (MatLab, IDL) analysis with highly interactive, advanced visualization [Clyne and Rast, VDA2005] l Develop domain-specific application for numerical simulation of fluid flow l Emphasize usability for scientists

Alan Norton A visual comparison between coarsened and original data

Alan Norton Combining visual and non-visual analysis tools l When visualization and quantitative tools are appropriately combined, result is a powerful environment for scientific discovery l Couple existing quantitative tools (e.g. IDL, Matlab, NCL) with VAPoR’s advanced visualization capabilities l Benefits of integration with existing analysis tools –Avoids duplication of effort –Lower development time –Leverage what scientists already know

Alan Norton Integrated analysis and visualization system Vtk High quality rendering VAPoR Interactive visual browsing IDL Data manipulation & analysis Multiresolution data access Disk Array

Alan Norton Software Design Priorities l Functional and usability requirements prioritized by scientific users l Cross-platform (Linux, Irix, Windows, Mac,…) l Enable side-by-side comparison of multiple time-varying visualizations –Unique approach to management of multiple parameter sets l Exploit high-performance graphics cards –GPU programming l Incorporate recent advances in data visualization –Volume rendering –Flow visualization –Isosurface generation –Color/transparency mapping l XML data description –Encourage wider usage of multi-resolution data representation

Alan Norton Demonstration l Load multiresolution data into VAPoR application l Combine global and local parameter settings l Edit transfer function(s) l Interactive region selection l Animation l Export to IDL l Use IDL for analysis l Visualize analysis results

Alan Norton Future Plans l Incorporate visualization techniques based on scientists’ needs –Nonuniform grids –Vector field visualization –Isosurfaces –Contour planes l Understand effect of lossy data compression –Error analysis and error visualization –Obtain bounds on degradation of analysis results –Measure the effect of performing analysis on lower-resolution data l Improve access to terabyte datasets –Multiresolution data output as a byproduct of the simulation

Alan Norton Summary l NSF funded collaboration with Davis and Ohio state l Domain specific application aimed at improving productivity of turbulence researchers (though other groups may benefit) l Scientists are target user group (not visualization specialists) l Multiresolution data representation exploited for handling large data l Quantitative and advanced visualization methods combined l Usability is key to adoption by scientists I can directly testify to the crucial importance of this [effort] to our numerical work... to develop interactive tools that can access and analyze the flow in its entirety, and visualize it in complex ways including parallelized algorithms of perspective volume rendering, superposition of scalar and vector fields, and multi-resolution access to the flows. Annick Pouquet, Director of ESSL