Suggestion for Construction of an Earth Science Collaboratory.

Slides:



Advertisements
Similar presentations
Suggestion for Construction of an Earth Science Collaboratory.
Advertisements

Earth System Curator Spanning the Gap Between Models and Datasets.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
CLIMATE SCIENTISTS’ BIG CHALLENGE: REPRODUCIBILITY USING BIG DATA Kyo Lee, Chris Mattmann, and RCMES team Jet Propulsion Laboratory (JPL), Caltech.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
January, 23, 2006 Ilkay Altintas
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
A PROPOSED EARTH SCIENCE COLLABORATORY K-S Kuo 1,2, Chris Lynnes 1, Rahul Ramachandran 3 1 NASA Goddard Space Flight Center, USA 2 Caelum Research Corporation,
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Geospatial Systems Architecture Todd Bacastow. GIS Evolution
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Sept 19,  Provides a common set of terminology and definitions  A framework for describing resources and processes  Enables computer based interoperability.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
ESIP Federation: Connecting Communities for Advancing Data, Systems, Human & Organizational Interoperability November 22, 2013 Carol Meyer Executive Director.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Evolving toward a Coherent, Collaborative Framework for Earth Science Data, Tools and Services Christopher Lynnes, Kwo-Sen Kuo and Kevin Murphy Earth Science.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
ESIP AQ Cluster Community Components for the Air Quality SBA in AIP-2.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
End-to-End Data Services A Few Personal Thoughts Unidata Staff Meeting 2 September 2009.
Brian Matthews, euroCRIS, 18/09/03 CRIS architecture to support an ERA Brian Matthews.
Event-Based Model for Reconciling Digital Entities Ahmet Fatih Mustacoglu Ahmet E. Topcu Aurel Cami Geoffrey C. Fox Indiana University Computer Science.
The Federated Data System DataFed R. Husar, K. Hoijarvi, S. Falke, DaFed Community EPA Data Summit, Feb. 12, 2008, RTP Non-intrusive data integration infrastructure.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
ESIP Vision: “Achieve a sustainable world” by Serving as facilitator and advisor for the Earth science information community Promoting efficient flow of.
Joslynn Lee – Data Science Educator
VOA3R Virtual Open Access Agriculture & Aquaculture Repository: A platform for sharing scientific and scholarly research related to agriculture, aquaculture.
An Overview of Data-PASS Shared Catalog
Workshop on Cyberinfrastructure National Science Foundation
OGCE Portal Applications for Grid Computing
Metadata Development in the Earth System Curator
Data Management Components for a Research Data Archive
Presentation transcript:

Suggestion for Construction of an Earth Science Collaboratory

The Situation Today Earth Science Stuff is (still) hard to use... data science tools / svcs analysis results knowledge about data tools analysis methods find share reuse put together data + data data + tool tool + tool desktop + online svc

The Situation Today Islands of data and services with selective connectivity Provenance SciFlow ESG ECHO Data Center Giovanni GCMD

Proposed: Convergent Evolution to an Earth Science Collaboratory (ESC) ESC

What Is An Earth Science Collaboratory? A rich data analysis environment that: – Provides access across a wide spectrum of Earth Science data – Provides a diverse set of science analysis services and tools – Supports the application of services and tools to data – Supports collaboration on data analysis – Supports sharing of data, tools, results and knowledge

Earth Science Collaboratory Cyberinfrastructure Tool Library Data Library Laboratory Notebook Data Centers Workflow Mediator

Tool Library (tool) Contributed (tool) (Tool) coincidence feature det. quality filters Community visualizations event service IDL cdat matlab GrADS Provisioned nco ncl Packager autoconf RPM Web wrapper Packager autoconf RPM Web wrapper Social: sharing, tagging, discussion Discovery Configuration Mgmt: testing, versioning (tool) (Tool) Personal “What tools work with MODIS Level 2 Aerosols data?” “I have a reader for radiosonde data; how can I make it available to the rest of the community?”

Data Library Contributed ACCESS MEaSUREs Validation Community Field Exper. Provisioned Packager data probe format check metadata wizard Packager data probe format check metadata wizard Social: sharing, tagging, discussion Discovery Configuration Mgmt: testing, versioning Personal EOSDIS et al. Cache “It looks like there might be an artifact in the data ”

Workflow Library Contributed Giovanni Data Mining SciFlow Community GeoBrain Provisioned Packager workflow editor Packager workflow editor Social: sharing, tagging, discussion Discovery Configuration Mgmt: testing, versioning Personal Processing Algorithms “Here are the steps I used to detect temperature inversions in AIRS Standard Retrievals...” “I wonder how Prof. Taylor deals with quality filtering of AIRS Standard Retrievals ”

Laboratory Notebook Project Education Pkgs Example Uses Community Science Stories Education Pkgs Example Uses Science Stories Provisioned Packager Notebook editor Experiment manager Packager Notebook editor Experiment manager Social: sharing, tagging, discussion Discovery Configuration Mgmt: versioning Personal Project Results “Here is the full set of analyses used in my recent JGR paper: ” “This set of workflows and data makes a good laboratory exercise for undergraduate meteorology students.”

Key Advantages of the Earth Science Collaboratory over the Situation Today Tool availability is a force multiplier – More tools will be usable with more datasets – More tools will be easier to find and more available to more users Knowledge sharing will evolve from text on paper to a rich mixture of data, tools, workflows and articles A “wikihow” for Earth Science data analysis will emerge – Incorporating live data, services and workflows ESC will maintain a record of the analysis process – Share, repeat, build upon analysis techniques – Transparency of the process is built in

Why now? Because we can do it (finally)! – Advances in standards acceptance and implementation (e.g., OPeNDAP) – A consistent, coherent, loosely coupled architecture encapsulates complexity and maximizes flexibility – Social networking has reached the mainstream – Key lessons can be learned from prior efforts – Other fields are doing it (e.g., Canadian Space Science Data Portal) The need is growing – Interest in working with multiple datasets is growing – Calls for transparency and reproducibility are growing

Key Challenges Community buy-in – Developers – User adoption Sweeping scope is scary =8-0 Learning lessons from prior efforts

How to move forward? EOSDIS Coherent Web – Will pull together EOSDIS tools and services – Working with Kevin Murphy of ESDIS Community – Prototypes? Narrow end-to-end prototypes, followed by refactoring, broadening and convergence Bite off chunks as ACCESS or AIST projects – ESIP Earth Science Collaboratory Cluster

The Long View Get community consensus on goal Expand ESC architectural concept to full architecture Build initial framework (critical services) Establish incentives to fit into ESC architecture Generalize proofs of concept to robust versatile services Add missing pieces to architecture Integrate provisioned and community stuff 7+ yrs

Want to Help? Join the Earth Science Collaboratory Cluster – Collaboratory – Mail list: earthcollaboratory

Backup Slides

Mediator Mediates combinations – tool with data – data with data – tool with tool – tool with workflow – data with workflow Based on – data access standards – common data model – semantic + syntactic matching of tools, data and workflows

Cyberinfrastructure Services used by all other components Security – authentication – authorization – code audit/padded cell – integrity checking Social – tagging – sharing – discussions – groups – reputation Cloud – elastic provisioned storage and computing Discovery – data, tools, workflows, experiments – search by keyword, variable, time, author Information Mgmt – provenance – identifiers – archive Semantic Web – data ontology – tools ontology

What’s New? Macro View (forest-level) – Systematic approach to making data available to services and vice versa – Integration of all major analysis components – Seamless integration of desktop with remote services – Consistent view of all architectural components – Cyberinfrastructure services for all architectural components Micro View (tree-level): Nothing! – Prototypes and proofs of concept exist for each piece

Prior Art Talkoot, myexperiment.org – workflow sharing, virtual notebooks Earth System Grid – provisioned tools, format standards/checkers Land Information System – OPeNDAP as access infrastructure Earth Science Modeling Framework – programmatic approach to integration Giovanni, LAS – community services/tools Nebula – cloud provisioning RAMADDA – management of diverse information objects NASA Earth Exchange – collaborative framework for NASA Earth Science projects HUBzero, Zooniverse – science collaboration frameworks EOSDIS – Federated data centers, federated discovery Canadian Space Science Data Portal – collaborative analysis/workflows, scientist-federated search/access, provenance... NCSA Cyberintegrator – user-contributed tools, annotated data, tools and workflows