Presentation is loading. Please wait.

Presentation is loading. Please wait.

Unidata 2008: Shaping the Future of Data Use in the Geosciences Expanding Horizons: Using Environmental Data for Education, Research, and Decision Making.

Similar presentations


Presentation on theme: "Unidata 2008: Shaping the Future of Data Use in the Geosciences Expanding Horizons: Using Environmental Data for Education, Research, and Decision Making."— Presentation transcript:

1

2 Unidata 2008: Shaping the Future of Data Use in the Geosciences Expanding Horizons: Using Environmental Data for Education, Research, and Decision Making 23 June 2003 Boulder, CO Mohan Ramamurthy Unidata Program Center UCAR Office of Programs Boulder, CO

3 Thank you, Dave I wish to take this opportunity to extend my sincerest gratitude to Dave Fulker, the founding director of Unidata, for his distinguished service to the Unidata Community for nearly 20 years. Unidata would not be what it is today without his vision, leadership, energy and his many extraordinary qualities. And thank you to Ben Domenico for his excellent stewardship during the transition.

4 The Word of the Day for Jun 23 rd is The Word of the Day for Jun 23 is: bloviate \BLOH-vee-ayt\ verb : to speak or write verbosely and windily [Courtesy: Jo Hansen, Unidata Program Center] Example sentence: Mohan can bloviate on a par with the windiest of professors, but he's also capable of being concise and getting right to the point. (yeah, right)

5 Expanding Horizons New Strategic Plan New Director New 5-year proposal Many new and exciting initiatives New logo!

6 Unidata Mission Statement: Mission Statement: Provide data, tools, and community leadership for enhanced Earth-system education and research. At the Unidata Program Center, we Facilitate Data Access Facilitate Data Access Provide Tools Provide Tools Support Faculty and Staff Support Faculty and Staff Build and Advocate for a Community  user workshops come under this activity Build and Advocate for a Community  user workshops come under this activity

7 Technology Portfolio 1) McIDAS: A client/server analysis and display package, originally developed by U. Wisconsin/SSEC, that emphasizes image processing of data from satellite-borne sensors; 2) GEMPAK: An analysis, display, and product generation package for meteorological data; 3) Local Data Manager: Software for capturing, disseminating, and organizing data in near-real time; It is the h eart of the Internet Data Distribution (IDD) system; 4) NetCDF: A software interface for platform- independent access to self describing datasets; 5) Integrated Data Viewer: Java-based, platform- independent data analysis and 3D visualization tools; 6) THREDDS: A project to facilitate remote access to thematic, distributed, interdisciplinary data servers;

8 Unidata as a Diverse Community About 150+ sites are participating in Unidata Internet Data Distribution (IDD) system About 150+ sites are participating in Unidata Internet Data Distribution (IDD) system 120 or so of those sites are in academia and the rest in government and research labs120 or so of those sites are in academia and the rest in government and research labs User community is interdisciplinary - 2/3rd of sites have users outside atmospheric sciences

9 Internet Data Distribution Radar Model Satellite Approximately 2 GB of data injected/hour from distributed sources; Unidata IDD/LDM uses more of the Internet2 than any other advanced application; Approx. 5 Terabytes of data transmitted each week. (Amount varies with weather) By design, the system has no data center.

10 Proposed WSR-88D Data Flow (NWS Plans)

11 Education Drivers (a.k.a. A Community-Articulated Need) Active, student- centered learning Earth-system science or “holistic” approach to education Learning science by doing science Observations (data) Tools (models, visualization) Discovery

12 NSF Director Rita Colwell, 1998: "Interdisciplinary connections are absolutely fundamental. They are synapses in this new capability to look over and beyond the horizon. Interfaces of the sciences are where the excitement will be the most intense...." Science Drivers Grand Challenges in Environmental Sciences National Research Council

13 Multidisciplinary Problems Fire Danger determination requires taking into account past, present and future weather, fuel types, and the state of both live and dead fuel moisture. Dead Fuel Moisture Live fuel moisture (NDVI) Drought conditions Atmospheric stability Lightning maps Lightning ignition efficiency Airflow Recent rainfall Rainfall forecast

14 Dual-Polarization Radar use in Fire Weather Management The differential reflectivity (ZDR) values are noteworthy in the smoke signal The differential reflectivity (ZDR) values are noteworthy in the smoke signal Many regions show ZDR >+6 dB. Many regions show ZDR >+6 dB. Suggests flattened ash particles (like corn flakes) Suggests flattened ash particles (like corn flakes) Source: CHILL Radar Group, CSU

15 Flooding due to Tropical Storms Tropical Storm Allison Research studies and emergency management of hurricane-induced flooding involve integrating data from atmospheric sciences, oceanography, hydrology, geology, geography, and social sciences.

16 Multidisciplinary Synthesis Requires integration of disparate datasets and databases from diverse sources that are distributed geographically and disciplinarily; Requires integration of disparate datasets and databases from diverse sources that are distributed geographically and disciplinarily; Needs integration of Scientific Information Systems with Geographic Information Systems Needs integration of Scientific Information Systems with Geographic Information Systems The integration poses numerous challenges; The integration poses numerous challenges; However, such integration is critical to solving societal problems and advancing science. However, such integration is critical to solving societal problems and advancing science. Metadata is crucial to achieving integration Metadata is crucial to achieving integration

17 Remote Sensing & Data Explosion In the next 10 years, about 100 new satellite instruments will be launched to monitor the environment Five-order magnitude increase in satellite data is expected during that period GIFTS (Geostationary Imaging Fourier Transform Spectrometer) will have about 1700 channels and a resolution of 4 km Each NPOESS satellite will generate one terabyte of data each day Advances in Radar technology 28 fold increase in WSR-88D data volume in 5 years Phased-array radars will generate 100 fold increase data By 2004, NOAA will ingest more data in one year than was contained in the total archive in 1998.

18 Advances in Modeling Shift from a purely deterministic to a more probabilistic approach, requiring the use of ensemble modeling techniques. Growing emphasis on multidisciplinary studies, requiring coupled models: e.g., Hurricane landfall flooding problem: Atmospheric model (WRF/MM5), Ocean model (ROMS), Hydrologic model (HMS)

19 Local Modeling: A Notable Trend Over 30 universities are now running mesoscale models locally. One can think of this aggregation as a national forecasting instrument However, only one or two groups initializing their model runs with local observations As the scale of these local model runs becomes finer, there is a natural desire to integrate their output with information from other sources (e.g., hydrology, infrastructure, societal datasets in GIS form) Iowa St. Linux Cluster

20 Technology Drivers Object-oriented programming Object-oriented programming Open Standards, Interoperability and Open Source Movement Open Standards, Interoperability and Open Source Movement.Metcalfe's Law: the usefulness, or utility, of a network increases as the square of the number of users. Web services (HTTP, Java, XML, SOAP, UDDI, …) Web services (HTTP, Java, XML, SOAP, UDDI, …) Digital libraries (Metadata, discovery, information services…) Digital libraries (Metadata, discovery, information services…) Grid environments and distributed computing Grid environments and distributed computing Commodity microprocessors Commodity microprocessors Cluster computing Cluster computing High bandwidth networks: 10GigE, Fast IP, … High bandwidth networks: 10GigE, Fast IP, … Broadband access Broadband access Wireless networks: 802.11 networks, GPRS, 3G Wireless networks: 802.11 networks, GPRS, 3G IPv6: Next-generation internet protocol IPv6: Next-generation internet protocol Collaborative computing Collaborative computing Scientific data mining and knowledge discovery Scientific data mining and knowledge discovery

21 Users Collections (Data, tools, educational materials) Metadata repository Services Users Collections (Data, tools, educational materials) Metadata repository Services Web services is a technology and process for discovery and connection. The eXtended Markup Language, XML, is accepted as THE emerging standard for data interchange on the Web. XML allows authors to create their own markup, which has led to the proliferation of “MyOwn Markup Language” Web Services and the Wild and Wooly World of Markup Languages

22 Title: Unidata 2008: Shaping the Future of Data Use in the Geosciences Six endeavors are proposed, focusing on Community and Support Services and Data Services, Systems, and Tools The proposed endeavors will enable the community to advance scientific exploration, education, and decision- making. We are moving from an era of data provision to one in which data- and related web-services are emphasized Five-year Core Funding NSF Proposal “The unanimous finding of the panel is that the Unidata Program Center program be supported as fully as possible by NSF for the years 2003-2008.”

23 Proposed Endeavors Endeavor 1. Endeavor 1. Responding to a broader and more diverse community. Respond to increased emphasis on Earth-system science (e.g., bring new data sets to the community) Establish new partnerships with related communities (e.g. with Hydrology via CUAHSI) Support new tools in technically less-sophisticated institutions (e.g., community colleges) Endeavor 2. Endeavor 2. Comprehensive support services Deploy web-based training modulesDeploy web-based training modules Simplify installation and maintenance for all supported packagesSimplify installation and maintenance for all supported packages Explore new technologies (e.g., Access Grid) to facilitate remote collaborationExplore new technologies (e.g., Access Grid) to facilitate remote collaboration

24 Endeavor 3: Real-time, self-managing data flows More flexibility and control Many more feed types for finer control over routing and subsetting Configurable product priorities Self-managing data flows (automatic dynamic routing) Application-level multicast looks promising for hundreds of sites (IP multicast not suitable due to limitations) NLDM: data flooding via Usenet protocols may provide practical routing solution (needs more testing) Support for new standards Use of IP version 6 protocols Internet2, Grid and e-services standards (authentication, resource use,...) Location-transparency for data

25 LDM-5 Vs. LDM-6 Latencies CONDUIT Experience Average delivery time : ~20 seconds to top-tier sites

26 Endeavor 4. Software to analyze and visualize geoscience data Integrate diverse datasets Support analysis and visualization of local and climate modeling efforts climate Develop collaborative tools to make effective use of shared visualizations Allow customized user experiences Adapt to GIS frameworks –Cloud water isosurface from COMMAS storm model data (courtesy Adam Houston and Dan Bramer, NCSA/UIUC)

27 Endeavor 5: Distributed, organized collections of digital material Thematic Real-time Environmental Distributed Data Servers (THREDDS) User applications: e.g., McIDAS, IDV, LAS, IDL, MatLab... DLESE Digital Library for Earth-System Education Hydrology Data, e.g. Geophysical Data, e.g. Satellite Images, e.g. Satellite Imagery... OpenDAP, ADDE, & FTP protocols IDD DL interchange protocol IDD Discovery IDD

28 PeopleDocumentsData Catalog Generation Tools Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication Services Data Catalog Services THREDDSMiddleware

29 THREDDS, GIS, DL Interoperability GIS Client Applications THREDDS Client Applications OpenGIS Protocols: WMS, WFS, WCS OGC or proprietary GIS protocols OGC or OPeNDAP ADDE. FTP… protocols GIS Server GIS Servers Demographic, infrastructure, societal impacts, … datasets THREDDS Server THREDDS Servers Satellite, radar, forecast model output, … datasets Digital Library Discovery Systems Metadata crosswalk Open Archives Initiative (OAI) Metadata Harvesting Metadata crosswalk

30 Current Implementation netCDF Application Parallel file system Proposed Implementation HDF5 (serial and/or parallel) netCDF Application Network or to/from another application Strea m POSIX I/O File Meta- data Raw data User- defined device Custo m MPI- I/O Split files POSIX I/O File NetCDF-HDF Integration Extend netCDF to high-performance computing environment Implement parallel I/O, large grids, etc. Work will directly benefit WRF and CCSM communities Endeavor 6: Improved data access infrastructure

31 The Visual Geophysical Exploration Environment (VGEE) The VGEE is an integrated framework in which students use visualization tools, data, and curricular materials to learn basic physical principles of atmospheric science It includes: A learner interface to the IDVA learner interface to the IDV Java-based concept models to support physical insightJava-based concept models to support physical insight A curriculum to guide inquiryA curriculum to guide inquiry A catalog of data (THREDDS)A catalog of data (THREDDS)

32 Students notice that the Western Pacific is considerably warmer than the East. Identify Relate Explain Integrate VGEE: An Integrated Framework Concept Models, which are used to explore relations in an idealized context.

33 Concluding Remarks We live in an exciting moment in the history of the Earth sciences. Workshops like this and the diversity of representation from academia are testimony to the vibrancy of the community and the program. The portfolio of tools and technologies within Unidata, coupled with the energies of a creative and collaborative community, puts us in an ideal position to meet the important challenges facing the education and research communities in the atmospheric and related sciences.


Download ppt "Unidata 2008: Shaping the Future of Data Use in the Geosciences Expanding Horizons: Using Environmental Data for Education, Research, and Decision Making."

Similar presentations


Ads by Google