In Situ Data Access Some reasons for success or failure Nancy N. Soreide, Donald W. Denbo NOAA Pacific Marine Environmental Laboratory IIPS Session 3B.

Slides:



Advertisements
Similar presentations
ECOOP Data Management System (T2.2/WP2) Declan Dunne 13 th February 2008, Athens.
Advertisements

Integrating NOAA’s Unified Access Framework in GEOSS: Making Earth Observation data easier to access and use Matt Austin NOAA Technology Planning and Integration.
DapperM: A Matlab Interface to Dapper — An OPeNDAP In Situ Data Service D.W. Denbo 1, J. Sirott 2, W.H. Zhu 1 1 UW/JISAO-NOAA/PMEL 2 NOAA/PMEL IIPS Session.
Summary previous session 1 3 D:\ tools models add meta information netCDF on web server transform to netCDF netCDF on OPeNDAP server data.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Distributed components
City University London
Computer Engineering 203 R Smith Agile Development 1/ Agile Methods What are Agile Methods? – Extreme Programming is the best known example – SCRUM.
Integration of Applications MIS3502: Application Integration and Evaluation Paul Weinberg Adapted from material by Arnold Kurtz, David.
Damian Gordon.  Summary and Relevance of topic paper  Definition of Usability Testing ◦ Formal vs. Informal methods of testing  Testing Basics ◦ Five.
Information Technology for Ocean Observations and Climate Research TYKKI Workshop, December 9-11, 1998, Tokyo, Japan Nancy N. Soreide NOAA Pacific Marine.
Tools for accessing distributed in-situ data collections Donald W. Denbo, NOAA/PMEL-JISAO Jason E. Fabritz, NOAA/PMEL-JISAO Bernard J. Kilonsky, Sea Level.
5/5/2005Toni Räikkönen Internet based data collection from enterprises using XML questionnaires and XCola engine CoRD Meeting May 11th 2005.
U.S. Department of the Interior U.S. Geological Survey Data Integration Progress and Guiding Principles Disciplines, generalization, and open-access. David.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
THROUGH DIVERSITY EFFECTIVENESS AIR Forum 2006 May 18, 2006 Dynamic Charts: An approach to making institutional data available through graphical means.
Distributed Systems: Concepts and Design Chapter 1 Pages
1 AJAX and Dapper: The Good, the Bad, and the Ugly Joe Sirott PMEL/NOAA.
© 2001 Business & Information Systems 2/e1 Chapter 8 Personal Productivity and Problem Solving.
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 8 Personal Productivity and Problem Solving.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
NcBrowse A Graphical netCDF/OPeNDAP Browser Donald Denbo 1 & John Osborne 2 1 UW/JISAO-NOAA/PMEL, 2 OceanAtlas Software
Fisheries Oceanography Collaboration Software Donald Denbo NOAA/PMEL-UW/JISAO Presented by Nancy Soreide NOAA/PMEL AMS 2002/IIPS 10.3.
1 Dapper and Argo Joe Sirott PMEL/NOAA. 2 What is Dapper? Web server that provides distributed access to in-situ data via OPeNDAP protocol Clients include.
Web-based Interaction with 3D Environmental Data Sets using Virtual Reality Modeling Language Nancy Soreide, NOAA/PMEL Christopher Moore, NOAA/JISAO Cathy.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Web Services. Abstract  Web Services is a technology applicable for computationally distributed problems, including access to large databases What other.
NOAA Research Hot Items! Nancy Soreide and Eugene Burger NOAA/OAR/PMEL American Meteorological Society / IIPS January 13-17, 2002, Orlando, FL Web-based.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
1 DAPPER: An OPENDAP Server for In-Situ Data Joe Sirott Donald W. Denbo, Willa H Zhu University of Washington PMEL/NOAA.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
GIS System Design for the Coastal Storms Initiative Nazila Merati OAR/PMEL & OAR Representative to NOAA Enterprise GIS Chris Moore – OAR/PMEL Tiffany C.
UAF/OSMC Presenters: Kevin O’Brien and Eugene Burger Abstract: Kevin O’Brien and Eugene Burger are from NOAA’s Pacific Marine Environmental Laboratory.
NQuery: A Network-enabled Data-based Query Tool for Multi-disciplinary Earth-science Datasets John R. Osborne.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C.
The Unified Access Framework for Gridded Data … the 1 st year focus of NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) Steve Hankin,
Recent developments and trends in Network Access to Oceanographic In-situ Data Nancy Soreide, NOAA/PMEL John “Oz” Osborne, NOAA/PMEL - OceanAtlas Software.
NdEdit for Interactive in-situ Data Selection John R. Osborne and Donald W. Denbo NOAA/PMEL American Meteorological Society / IIPS January 13-17, 2002,
NOAAServer: Unified access to distributed NOAA data Ernest Daddio, NOAA/ESDIM Steve Hankin, NOAA/PMEL Donald Denbo, NOAA/PMEL/JISAO Nancy Soreide, NOAA/PMEL.
A Climate Data Portal Focused on realtime and retrospective in situ data Nancy Soreide, Don Denbo, Willa Zhu, NOAA/PMEL Charles Sun, NOAA/NODC Bernie Kilonsky,
EPIC: Providing World Wide Web Access to Oceanographic Observations D.W. Denbo 1, N.N. Soreide 2, M.C. Spillane 1, and W.H. Zhu 1 1 Joint Institute for.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Chapter 4- Part3. 2 Implementing User Profiles A local user profile is automatically created at the local computer when you log on with an account for.
Interacting with consumer Software Engineering. So far… What is Software Engineering? Different software process models waterfall, incremental, spiral.
1 OceanShare: Interactive Access to Distributed In Situ Data in a Collaborative Tool Environment D.W. Denbo C.R. Windsor NOAA/Pacific Marine Environmental.
March 23, 1999GOIN 99, Honolulu, HI1 Network Collaborative Tool for Integrated Browsing D.W. Denbo 1 C.R. Windsor 2 NOAA/Pacific Marine Environmental Laboratory.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
A Climate Data Portal An FY2000 HPCC Proposal Nancy Soreide, Don Denbo, Willa Zhu, PMEL Charles Sun, NODC Bernie Kilonsky, U of Hawaii HPCC Project Review.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
A Climate Data Portal Focused on realtime and retrospective in situ data Nancy Soreide, Don Denbo, Willa Zhu, PMEL Charles Sun, NODC Bernie Kilonsky, U.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
EPIC Tools for in-situ data collections Donald W. Denbo, NOAA/PMEL Willa H. Zhu, NOAA/PMEL/JISAO John Osborne, OceanAtlas Software Christopher Moore, NOAA/PMEL/JISAO.
1 Virtual Collaboratory: How Climate Research can be done Collaboratively using the Internet U.S. – China Symposium and Workshop on Climate Variability,
A Collaboration Tool to Support Modeling Groups Donald W. Denbo JISAO/UW-NOAA/PMEL 20 th IIPS/AMS, 12 – 15 January, 2004,
Using Java, XML and XSLT to create secure internal access to local NOAA Research Science project and budget information Jason E. Fabritz, NOAA/PMEL-JISAO/UW.
NcBrowse: A Graphical netCDF File Browser Donald Denbo NOAA-PMEL/UW-JISAO
Mind Mapping Software: Uses and Benefits for Education.
NQuery: A Network-enabled Data-based Query Tool for Multi-disciplinary Earth-science Datasets John R. Osborne 1, Kevin T. McHugh 2, and Donald W. Denbo.
NcBrowse: OPeNDAP Server Access and 3-D Graphics Presented by Nancy N. Soreide NOAA/PMEL Donald W. Denbo UW/JISAO-NOAA/PMEL.
Data Browsing/Mining/Metadata
Principles of Information Systems Eighth Edition
Global Precipitation Data Access, Value-added Services and Scientific Exploration Tools at NASA GES DISC Zhong Liu1,4, D. Ostrenga1,2, G. Leptoukh4, S.
Data and Data Management: Introduction to the BCO-DMO
Recap: introduction to e-science
Integrating Data and Information Across Observing System
Planning and Storyboarding a Web Site
Tutorial 7 – Integrating Access With the Web and With Other Programs
Presentation transcript:

In Situ Data Access Some reasons for success or failure Nancy N. Soreide, Donald W. Denbo NOAA Pacific Marine Environmental Laboratory IIPS Session 3B. Challenges in Data Access American Meteorological Society January 14-18, 2007, San Antonio TX

This paper is an overview of the state of the art in providing access to in- situ data from multiple observing systems over the internet. Describes reasons for success or failure of technologies, in the spirit of Dr. Richard Feynman: "It's a kind of scientific integrity, a principle of scientific thought that corresponds to a kind of utter honesty--a kind of leaning over backwards. For example, if you're doing an experiment, you should report everything that you think might make it invalid--not only what you think is right about it: other causes that could possibly explain your results; and things you thought of that you've eliminated by some other experiment, and how they worked--to make sure the other fellow can tell they have been eliminated.“ -- Richard Feynman, From a Caltech commencement address given in 1974 Also in Surely You're Joking, Mr. Feynman!

PMEL projects have tried many ways to access data & PMEL has participated in efforts to find a paradigm for delivering NOAA Data Internet (pre-Web) Web pages –static graphics & dynamically created graphics and products Web services CORBA OPeNDAP Dapper (OPeNDAP implementation for in situ data) This presentation uses examples drawn from PMEL experience with local and NOAA projects

How and why projects failed topics to be considered Problem too heterogeneous for a single solution Generalized vs Specific project objectives Large project teams vs Small project teams Over-hyped claims for new technologies Why good ideas sometimes fail Failure of funding Some characteristics of successful projects

Trying to find a paradigm for delivering NOAA Data Problem too heterogeneous for a single solution - Recent example: –XML standard for ocean data –Lots of interest over past several years –Still no universally accepted solution –Problem seems to be too big to generalize

Generalized vs Specialized Example: OPeNDAP as a universal solution Worked for gridded data –though handicapped by non-uniformity in netCDF implementation Much harder to make it work for in situ data (i.e., station data) –Still handicapped by non-uniformity in netCDF implementation –No one wants to see a list of several thousand station data file names Lead to development of the OPeNDAP Dapper server, which is customized for, and becoming the standard for, in situ data –Aided by development of clients which made accessing, graphing and using the data easier (e.g., web client, Matlab, GrADS, etc.) OPeNDAP - Dapper - DChart -

Big software projects which fail In comp.risks, big failed projects are often those which are outsourced and which try to solve the whole problem. Problem is too big Requirements are changed during development Features are added during development

Big vs Small Small projects seem far more likely to succeed than large ones The most successful web pages delivering ocean data have always been those which are focused on a specific data type or dataset Examples: –TAO El Nino buoy data, Argo data, LAS for large gridded datasets, EPIC Web and DChart for in situ data TAO - Argo - LAS - EPIC Web - DChart -

New technology over hyped as a solution for everything Technology may be quite sound,but the problem may be too big to generalize easily Example #1: –XML as a formatting standard is powerful and has limitations –XML wrapping does help with parsing and presentation of information. –XML wrapping does not solve problems of different disciplines and sub-disciplines.

New technology over hyped as a solution for everything Technology may be quite sound,but the problem may be too big to generalize easily Example #2: –netCDF is an incredibly useful data format –There have been many attempts to standardize the implementation of netCDF for different types of ocean data and model output. –There are still numerous problems I.e., although there are some de facto standards for implementation of netCDF format for some classes of data, There is still no widely accepted standard for in situ data

Good Ideas Many instances of good ideas which did not work out, due to no fault of the developers Reasons –Users didn’t understand technology –Technology was perceived as too difficult –Availability of off-the-shelf products overtook the developers of a home-grown product –Funding failed or was not consistently sustained so project could not move forward

Failure of funding A successful project which failed due to failure of funding (historical example): –NOAAServer project brought together all five NOAA Line Offices –First phase created database of metadata with pointers to web pages delivering data –Second phase used CORBA for an on-line system for networked access to centralized services for locating, selecting, graphing and downloading distributed in situ data –Despite successful, working prototypes –Funding was redirected, project stopped The Phase One database appears to be no longer updated, nor does it seem to have been replaced. There is still no NOAA data portal like the successful Phase Two prototype

Successful Solutions Success has often been achieved by breaking the problem into smaller problems which can then be integrated into a larger scale.

In summary, successful projects Often start with a prototype Grow incrementally and have constant interaction with a user base Require consistent, sustained funding over many years or even decades Some well known, well established examples: –Ferret, GrADS, EPIC, netCDF, OPeNDAP Ferret - GrADS - EPIC - netCDF - OPeNDAP -