E ARTHCUBE C ONCEPTUAL D ESIGN A Scalable Community Driven Architecture Overview PI:

Slides:



Advertisements
Similar presentations
NASA Earth Science Data Preservation Content Specification H. K. (Rama) Ramapriyan John Moses 10 th ESDSWG Meeting – November 2, 2011 Newport News, VA.
Advertisements

Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Connecting People With Information DoD Net-Centric Services Strategy Frank Petroski October 31, 2006.
May 17, Capabilities Description of a Rapid Prototyping Capability for Earth-Sun System Sciences RPC Project Team Mississippi State University.
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
Review Concept of Operations for an Enterprise Architecture Intelligence Center Haiping Luo Note: This presentation is my own thinking, with valuable input.
The Use of Zachman Framework Primitives for Enterprise Modeling
PV2013 Summary Results Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Developing Enterprise Architecture
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
1 CCSDS Information Architecture Working Group SEA Plenary Daniel J. Crichton, Chair NASA/JPL 12 September 2005.
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
U.S. Department of the Interior U.S. Geological Survey Next Generation Data Integration Challenges National Workshop on Large Landscape Conservation Sean.
Concept Award PI Meeting Outcomes: Virtual Presentation to the EarthCube Community September 13, 2012.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger September 29, 2009.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Enterprise Architecture, Enterprise Data Management, and Data Standardization Efforts at the U.S. Department of Education May 2006 Joe Rose, Chief Architect.
Information Systems Engineering. Lecture Outline Information Systems Architecture Information System Architecture components Information Engineering Phases.
Unified Modeling Language* Keng Siau University of Nebraska-Lincoln *Adapted from “Software Architecture and the UML” by Grady Booch.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
EPA Geospatial Segment United States Environmental Protection Agency Office of Environmental Information Enterprise Architecture Program Segment Architecture.
NASA Perspectives on Data Quality July Overall Goal To answer the common user question, “Which product is better for me?”
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
Seeking SC Feedback on Draft Technology Strategy and Roadmap for EarthCube Draft of 3 November 2015 The Technology and Architecture Committee (TAC) Chairs:
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Towards an European Network of Earth Observation Networks (ENEON): Addressing Challenges and Facilitating Collaboration for non-space based Earth Observations.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Djc -1 Daniel J. Crichton NASA/JPL 9 May 2006 CCSDS Information Architecture Working Group.
A Reference Model for RDA & Global Data Science Yin ChenWouter Los Cardiff University University of Amsterdam 1.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
1 Steve Hughes Daniel J. Crichton NASA/JPL January 16, 2007 CCSDS Information Architecture Working.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
E ARTH C UBE C ONCEPTUAL D ESIGN A Scalable Community Driven Architecture Setting The.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Connecting Users, Data & Data Repositories Simon J. Goring ORCID: John W. Williams doi: /m9.figshare Distinguished Lecture.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
Enterprise Architectures Course Code : CPIS-352 King Abdul Aziz University, Jeddah Saudi Arabia.
GEOSPATIAL CYBERINFRASTRUCTURE. WHAT IS CYBERINFRASTRUCTURE(CI)?  A combination of data resources, network protocols, computing platforms, and computational.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
F UNDED - R ESEARCH C OORDINATION N ETWORKS iSamples Best practices and standards for physical samples, including preservation, access and curation of.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
National Aeronautics and Space Administration 1 CCSDS Information Architecture Working Group Daniel J. Crichton NASA/JPL 24 March 2005.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
Managing Enterprise Architecture
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
CESSDA SaW Training on Trust, Identifying Demand & Networking
Earth Science Data Infrastructure
NOAA Data Management Perspective & Plans NSF RDMI Workshop
INTAROS WP5 Data integration and management
Identifiers Answer Questions
Bird of Feather Session
VIFI : Virtual Information Fabric for Data-Driven Discovery from Distributed Fragmented Repositories PI: Dr. Ashit Talukder Bank of America Endowed Chair.
Presentation transcript:

E ARTHCUBE C ONCEPTUAL D ESIGN A Scalable Community Driven Architecture Overview PI: G. Djorgovski (Caltech) CO-I: D. Pilone, T. Pilone (Element 84), D. Crichton, E. Law (JPL) Other key personnel: S. Caltagirone (E84), S. Hughes (JPL), T. Huang (JPL), A. Mahabal (Caltech) 1/7/ ESIP Winter Meeting

A high level system blueprint for the definition, construction, and deployment of both existing and new components to ensure that they can be unified and integrated into an evolutionary national infrastructure for EarthCube 1/7/16 2

Methodology Identification of stakeholders, concerns and requirements Identification of architectural use cases and drivers Selection of an architectural framework Development of the architectural principles Development of the architectural models Capture of the architecture artifacts in a consolidated report Generation of recommendations for adopting the architecture for the EarthCube program 1/7/16 3

4

Stakeholders Stakeholder/ActorConcerns NSF Program Managers Make decision and provide guidance at the EarthCube program level. Provide sufficient funding to support the EarthCube mission. EarthCube Scientists Use EarthCube resources and services to conduct scientific research. Publish scientific results & curate data as needed. EarthCube DevelopersDevelop technologies and services that can be integrated into EarthCube. EarthCube Architects Establish EarthCube requirements, framework and operational concept. Develop information model (vocabulary, ontology). Establish standards guidelines. Ensure interoperability between EarthCube Building Blocks. External Data Users Use EarthCube resources and services for research, education, and decision-making. Curator Ensure data is properly captured in EarthCube compliant data repositories. Data Owner Responsible for producing the data. Concerned about its distribution and use. External Data Facility Responsible for archiving data at other agencies (NASA, NOAA, USGS, etc); interoperability with the EarthCube Cyberinfrastructure. EarthCube Governance Committees Responsible for generating and monitoring the governance for the system including data curation, access, use case priority, interoperability standards, etc. EarthCube Office Staff Responsible for maintaining the community involvement within EarthCube and communicating changes and how to use the system. 1/7/16 5

Use Cases Big Science – Discovery, Comparison, Provenance, Model & visualization Collaborative Science Dark Data Contribution Tools Contribution Data Documentation Models Sharing High Performance Computing and Storage Resources Real Time Data Physical Sample Curation 1/7/16 6

Drivers Transform and accelerate research and discovery by turning data into knowledge and enabling interdisciplinary data integration. Provide critically needed data, tools, and computational resources and frameworks for cross-domain scientific collaboration, analysis and with long-term geoscience software and data preservation, discovery and use. Provide a geosicences cyberinfrastructu re and architecture that is scalable, extensible and sustainable. 1/7/16 7

Frameworks Zachman Framework - For organizing stakeholder concerns and perspectives. ISO/IEC/IEEE 42010:2011- For architectural description guidelines. Reference Model for Open Distributed Processing (RM-ODP) – For architectural patterns for distributed systems. Open Group Architecture Framework (TOGAF) – For managing the architecture. Federal Enterprise Architecture Framework (FEAF) – For classifying the architecture into architectural elements and viewpoints. ISO 14721: Open Archival Information System (OAIS) Reference Model - Provides a standard for information objects. ISO/IEC 11179:3 Registry Metamodel and Basic Attributes specification - Provides a schema for a metadata registry. 1/7/16 8

Scalability Community Driven Open Science Interoperability Sustainability Distributed Data Model Driven 1/7/16 9

Science Data Manage Satellite Instrument Data Systems Science Data Manage Airborne Data Science Data Manage Agency Earth Data Archives Data Provider EarthCube CI EarthCube Discovery 1/7/16 10

Science Data Manage Satellite Instrument Data Systems Science Data Manage Airborne Data Science Data Manage Agency Earth Data Archives Data Provider EarthCube CI Other Data Systems (e.g. NOAA) Other Data Systems (In-Situ, University) EarthCube RepositoryEarthCube Discovery 1/7/16 11

Science Data Manage Satellite Instrument Data Systems Science Data Manage Airborne Data Science Data Manage Agency Earth Data Archives Data Provider EarthCube CI Other Data Systems (e.g. NOAA) Other Data Systems (In-Situ, University) EarthCube Repository Data Science Infrastructure (Data, Algorithms, Machines) Science Teams EarthCube Discovery 1/7/16 12

Applications Decision Support Science Data Manage Satellite Instrument Data Systems Science Data Manage Airborne Data Science Data Manage Agency Earth Data Archives Research Data Provider Data Analysis EarthCube CI Other Data Systems (e.g. NOAA) Other Data Systems (In-Situ, University) EarthCube Repository Data Science Infrastructure (Data, Algorithms, Machines) Earthcube Data Analytics Centers Science Teams EarthCube Discovery 1/7/16 13

Benchmark Earth System Grid Federation (ESGF) Early Detection Research Network (EDRN) NASA’s Earth Observing System Data and Information System (EOSDIS) 1/7/16 14

Architecture Elements 1/7/16 15

Data Lifecycle 1/7/16 16

Information Model Context 1/7/16 17

Framework 1/7/16 18

Example Instantiation 1/7/16 19

Thank You Questions? 1/7/16 20

EarthCube Conceptual Architecture Discussion The controversial bits… 1/7/16 21

THIS IS A DISCUSSION. Please Talk. 1/7/16 22

EarthCube Architect EarthCube Developer EarthCube Scientist Curator 1/7/16 23 NSF Program Manager External Data Users External Data Facility Earthcube Staff Governance Committee

Stakeholders Do we have the right stakeholders? Do they overlap at all? Too much? Are they useful to provide use cases and personas that help drive the system? Are we missing key stakeholders? 1/7/16 24

Stakeholders NSF Program ManagersEarthCube Scientists EarthCube DevelopersEarthCube Architects External Data UsersCurators Data OwnerExternal Data Facility EarthCube Governance Committees EarthCube Office Staff 1/7/16 25

Architectural Principles FederationSustainability Standards(Data) Model-Driven ExtensibilityScalability ProvenanceSecurity 1/7/16 26

Standards… We do not advocate a particular standard… Our Conceptual Architecture emphasizes fully defined and self contained data rather than prescribing standard(s). EarthCube’s heterogenous data, applications, and systems appear to justify possible increase in complexity. Common models and representations should be used. 1/7/16 27

EarthCube Software Lifecycle Processes 1/7/16 28 Technology Planning Software Development Release

Research Software Lifecycle Processes 1/7/16 29 Technology Planning Software Development Software Versioning Software Search and Distribution Algorithm Search and Distribution

Software Lifecycle Processes We place an emphasis on software versioning, discovery, etc. for Research Software. Should we treat “EarthCube proper” processes the same way? What about discovery and distribution? 1/7/16 30

Metrics Use Examples: Product Searches Products Downloaded Services Accessed Publications Cited Quality Examples: Ingestion speed Search Response Time User “conversions” 1/7/16 31

Metrics & Conceptual Architectures Is this the right place to advise / mandate metrics? (e.g. we’re not doing this for standards) Should we be specific or just provide categories? Do we go so far as to ”mandate” it for EarthCube components / building blocks / etc? 1/7/16 32

Applications Decision Support Science Data Manage Satellite Instrument Data Systems Science Data Manage Airborne Data Science Data Manage Agency Earth Data Archives Research Data Provider Data Analysis EarthCube CI Other Data Systems (e.g. NOAA) Other Data Systems (In-Situ, University) EarthCube Repository Data Science Infrastructure (Data, Algorithms, Machines) Earthcube Data Analytics Centers Science Teams EarthCube Discovery 1/7/16 33

Places we haven’t expressed an opinion Cloud vs. on-premises hosting Data location (hosted vs. distributed) Compute location Should we? 1/7/16 34

Best Practices Common Software StackCommon Data Model Standard InterfacesService-Oriented Architecture Decoupled Storage, Compute, and Data Management Federated Search Analytic ServicesVisualization 1/7/16 35

Misc Questions How do we make this real? What’s the next thing you need to make EarthCube more valuable to you? How can the Conceptual Architecture effort help you get there? 1/7/16 36

Our Next Steps 1. Solicit Reviewers for Conceptual Architecture Document (NOW!) 2. Incorporate feedback and review comments 3. Write actionable recommendations and incorporate into final Conceptual Architecture 4. Prioritize and Deploy Key Architectural Components 1/7/16 37

We need reviewers! Please contact Emily Law if you’re interested. Thank you! 1/7/16 38