Cyndy Chandler 22 July 2011 Biological and Chemical Oceanography Data Management Office (BCO-DMO) SOST IWG-OP Biodiversity Ad Hoc Committee ~ July 2011.

Slides:



Advertisements
Similar presentations
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Advertisements

Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
OPEN ACCESS PUBLICATION ISSUES FOR NSF OPP Advisory Committee May 30, /24/111 |
Dr Matthew Stiff CEH Director Environmental Informatics Presentation to CRM SIG NeSC Edinburgh 12 July 2007 The Environmental Informatics Programme.
Biological Oceanography Scientific Domain Ed DeLong MIT Department of Biological Engineering Department of Civil and Environmental Engineering DataSpace.
ESIP Air Quality Workgroup and the GEO Air Quality Community of Practice collaboratively building an air quality community network for finding, accessing,
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Digitization at the National Archives and Records Administration Doris Hamburg Director, Preservation Programs James Hastings Director, Access Programs.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
An Oceanographic Event Logger James R. Wilkinson and Karen S. Baker Scripps Institution of Oceanography, University of California San Diego Field Practices.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
The BIO Directorate Microbial Biology Emphasis BIO Advisory Committee April, 2005.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
U.S. Department of the Interior U.S. Geological Survey CDI Data Management Working Group December 12, 2011 Sally Holl, USGS Texas Water Science Center.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
1 Data Integration Community of Practice Meeting September 15, 2009 Science Data Integration.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
U.S. Department of the Interior U.S. Geological Survey A vision for a global community Linda Gundersen Director Science Quality and Integrity US Geological.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Protected Areas Thematic Network IABIN Vision Meeting October 28 th – 29 th, 2008, Washington, DC Presented by Helena Pavese Protected Areas Programme.
Local global disambiguation of terms and concepts The BCO-DMO metadata database uses controlled vocabularies to record many of the important pieces of.
ESIP Federation Air Quality Cluster Partner Agencies.
Introduction GeoData 2014 Workshop #geodata2014 June 17-19, 2014,NCAR, Boulder, CO Peter Fox (RPI)
DISCIPLINARY PERSPECTIVE BIOLOGY/ECOLOGY Workshop on Cyberinfrastructure for Environmental Research and Education November 1, 2002.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
VERTIGO data OCB database status update Cyndy Chandler Ocean Carbon and Biogeochemistry Data Management Office Cyndy Chandler Ocean Carbon and Biogeochemistry.
Biological and Chemical Oceanography Data Management Office slide 1 of 19 CAMEO Data Management Bob Groman Biological and Chemical Oceanography Data Management.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
LTER Data Management Margaret O’Brien Santa Barbara Coastal Long Term Ecological Research (LTER) Project Santa Barbara Channel Biodiversity Observation.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
ESIP Vision: “Achieve a sustainable world” by Serving as facilitator and advisor for the Earth science information community Promoting efficient flow of.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
Technical Issues of Connecting GeoData within and Between Governmental Agencies: Focus on NSF Research Data C YNDY C HANDLER B IOLOGICAL AND C HEMICAL.
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C.
U.S. GLOBEC Georges Bank 2007 Phase 4B SI Meeting April 23, 2007 GoMODP, Data Interoperability and the MapServer Interface to U.S. GLOBEC Data Presented.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
OOI-CYBERINFRASTRUCTURE OOI Cyberinfrastructure Education and Public Awareness Plan Cyberinfrastructure Design Workshop October 17-19, 2007 University.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
DOE Data Management Plan Requirements
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
GeoLink Overview Goal: Develop Semantic Web technologies that facilitate discovery (and reuse) of geoscience data.Goal: Develop Semantic Web technologies.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
The IODE Anniversary Bibliography: 50 years of activities Maria Kalenchits, Estonian Marine Institute, Estonia Pauline Simpson, Central Caribbean Marine.
Library of Congress Partnerships for Managing Geospatial Data North Carolina Geographic Information Coordinating Council Raleigh, NC November 7, 2007 William.
DESIGN AND DEVELOPMENT OF NOAA VIRTUAL LIBRARIES: THE INTERSECTION OF TRADITIONAL LIBRARY KNOWLEDGE AND CUTTING EDGE INFORMATION TECHNOLOGIES Dottie Anderson.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
International Oceanographic Data and Information Exchange - Ocean Data Portal (IODE ODP) Enabling science through seamless and open access to marine data.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
US GLOBEC Georges Bank Phase 4B Scientific Investigators’ Meeting 1 Presentation to the US GLOBEC Georges Bank Phase 4B Scientific Investigators October.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Data Management for the Long Run: OA needs for today and tomorrow Krisa M. Arzayus, NODC “The success of the National Ocean Acidification Enterprise will.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
Strategies for NIS Development
Flanders Marine Institute (VLIZ)
Data and Data Management: Introduction to the BCO-DMO
Metadata Construction in Collaborative Research Networks
Bird of Feather Session
Wrap-Up – NSF Site Visit 8 February 2010
Presentation transcript:

Cyndy Chandler 22 July 2011 Biological and Chemical Oceanography Data Management Office (BCO-DMO) SOST IWG-OP Biodiversity Ad Hoc Committee ~ July 2011 Quarterly Meeting ~ Washington, DC

bco-dmo.org Biological and Chemical Oceanography Data Management Office BCO-DMO What is BCO-DMO? Who is BCO-DMO? Why is BCO-DMO different? How do we accomplish our task? Outline Discussion: Data Management for Biodiversity Research

bco-dmo.org Biological and Chemical Oceanography Data Management Office BCO-DMO staff provide data management support for investigators and projects funded by NSF Ocean Sciences Biological and Chemical Oceanography Sections or NSF OPP ANT Organisms & Ecosystems Program partner with individual investigators and those associated with collaborative research projects data management support throughout the project capture and record documentation (metadata) sufficient to support data reuse and re-purposing load data and metadata into a relational database and ensure their availability online ensure final archive in appropriate data center (e.g. NODC); contribute to special repositories (e.g. CDIAC, OBIS, GenBank) ‘proposal to preservation’ What is BCO-DMO?

bco-dmo.org Biological and Chemical Oceanography Data Management Office BCO-DMO Staff Biology Department Peter Wiebe (Lead Investigator) Robert Groman (co-PI) Dicky Allison (Data Specialist) Tobias Work (Programmer) Marine Chemistry and Geochemistry David Glover (co-PI) Cyndy Chandler (co-PI) Stephen Gegg (Data Specialist) additional data specialists, consultants and collaborators as needed Who is BCO-DMO?

bco-dmo.org Biological and Chemical Oceanography Data Management Office BCO-DMO staff are funded to … support NSF OCE and OPP funded researchers ensure that data are … available to the research community in a timely manner sufficiently documented to facilitate reuse and re-purposing work with investigators during all phases of research: data management planning and stewardship proposal writing cruise preparation cruise and data documentation effective organization of data in the BCO-DMO data system permanent archive of data at NODC Why is BCO-DMO different?

bco-dmo.org Biological and Chemical Oceanography Data Management Office How do we accomplish our task? BCO-DMO staff work in partnership with PIs to create well-documented data sets from research programs involving a wide variety of sampling gear

bco-dmo.org Biological and Chemical Oceanography Data Management Office Data Discovery and Availability our primary task is to ensure that data from NSF OCE funded awards are freely available online the BCO-DMO data system and interfaces facilitate data discovery (text and map-based browse systems) data access to assess fitness-for-purpose data export and download data preservation in a permanent archive (the National Oceanographic Data Center (NODC)) How do we accomplish our task?

bco-dmo.org Biological and Chemical Oceanography Data Management Office Field Data to Database in situ data from research cruises are documented and contributed to the online data system and discoverable through a variety of user interfaces How do we accomplish our task? Original data from Bongo net tows and CTD/Niskin Rosette

bco-dmo.org Biological and Chemical Oceanography Data Management Office slide 9 of 17 MOCNESS data – paper to digital “Data Management in the Wild” ~ MOCNESS Data hauled in by people the samples are processed by people, observations recorded by people, and digital data sets created by people MOCNESS Sampling raw biology data raw physical data digital biology data digital physical data CTD sensor data

bco-dmo.org Biological and Chemical Oceanography Data Management Office slide 10 of 17 MapServer Starting Screen BCO-DMO Geospatial MapServer interface showing all available data.

bco-dmo.org Biological and Chemical Oceanography Data Management Office slide 11 of 17 MapServer with selections access to data

bco-dmo.org Biological and Chemical Oceanography Data Management Office BCO-DMO staff work in partnership with PIs to create well-documented data sets to enable reuse and re-purposing of data to support US contributions to large coordinated research programs and global ocean research themes How do we accomplish our task?

bco-dmo.org Biological and Chemical Oceanography Data Management Office BCO-DMO and Other Data Repositories BCO-DMO is part of a network of distributed data repositories working to support the research community and ensuring that data are available in the public domain. Carbon Dioxide Information Analysis Center North American Carbon Program Long Term Ecological Research Network National Center for Biotechnology Information: GenBank Rolling Deck to Repository (R2R) How do we accomplish our task?

bco-dmo.org Biological and Chemical Oceanography Data Management Office slide 14 of 17 “A scholar’s positive contribution is measured by the sum of the original data that he contributes. Hypotheses come and go but data remain.” In: Advice to a Young Investigator (Santiago Ramón y Cajal, 1897) Thank you. Questions? photo by Chris Linder (WHOI)

bco-dmo.org Biological and Chemical Oceanography Data Management Office What additional cyber-infrastructure is needed to support biodiversity research? What else is needed to support biodiversity research? The remaining slides are a supplement to the talk that may be useful during the data management discussion.

bco-dmo.org Biological and Chemical Oceanography Data Management Office NSF Dimensions of Biodiversity Program data from 9 awards to be managed by BCO-DMO NSF OCE # Dimensions: The Role of Viruses in Structuring Biodiversity in Methanotrophic Marine Ecosystems NSF OCE # and OCE # Dimensions: Significance of nitrification in shaping planktonic biodiversity in the ocean NSF OCE # , , and Dimensions: Biological controls on the ocean C:N:P ratios NSF OCE # and Dimensions: Uncovering the novel diversity of the copepod microbiome and its effect on habitat invasions by the copepod host What else is needed to support biodiversity research?

Marine Biodiversity Operation Network Extended research network being considered bco-dmo.org Biological and Chemical Oceanography Data Management Office slide 17 of 17

bco-dmo.org Biological and Chemical Oceanography Data Management Office Infrastructure Options Challenge: there are currently many sources with overlapping and/or incomplete information researchers must locate resources, resolve conflicts/duplicates, review and ‘repair’ retrieved data Strategies and Solutions: data warehousing - extract, transfer, load data data federation – network of distributed repositories data remain at the source and are retrieved on demand data aggregation – central catalog (e.g. EOL) What else is needed to support biodiversity research?

bco-dmo.org Biological and Chemical Oceanography Data Management Office Advantages and Disadvantages data warehousing – one central repository for all data one system ‘one stop shop’ is rarely appropriate for all data types data and information loss during transfer data federation – network of distributed repositories data remain closer to the ‘source of origin’ and local expertise data and information loss is limited requires negotiated arrangements (standards) to support interoperability of distributed systems Long-term preservation must be considered data aggregation (e.g. EOL) What else is needed to support biodiversity research?

bco-dmo.org Biological and Chemical Oceanography Data Management Office Interoperability the ability of different data repository systems to exchange and integrate data and information and present a unified view to the user requires syntactic (format) compatibility e.g. access/security, file formats, transfer protocols to retrieve data and information requires semantic (language) compatibility e.g. metadata standards, controlled vocabularies, ontologies to understand data and information What else is needed to support biodiversity research?

bco-dmo.org Biological and Chemical Oceanography Data Management Office Trans-disciplinary, cross-agency collaboration and cooperation a workshop of 100 invited participants held in Broomfield, Colorado in March 2011 NSF sponsored with support from USGS primary objective: “to substantially advance discussions and directions of data life cycle, data integration and data citation, with strong emphasis on end-use, and to provide a state-of-the-field report to NSF and the USGS of the geoinformatics community’s capabilities and needs... “ final report (in progress) Geo-Data Informatics 2011 Workshop Exploring the Life Cycle, Citation and Integration of Geo-Data What else is needed to support biodiversity research?

some thoughts... integration of distributed, loosely federated data repositories designed to foster biodiversity research and assessment Microbes to Mammals Habitat to Health Taxonomy to Tipping Points bco-dmo.org Biological and Chemical Oceanography Data Management What else is needed to support biodiversity research?

bco-dmo.org Biological and Chemical Oceanography Data Management Office Data repositories for biodiversity research? What else is needed to support biodiversity research? BCO-DMO LTER sites NCBI GenBank OBIS MICROBIS: ICoMM Marine Microbes Database EOL protein Data Bank (3D structures of DNA, RNA) Cell Image Library (cellimagelibrary.org) NOAA, NASA, EPA and USGS sites Literature (some are proprietary)

bco-dmo.org Biological and Chemical Oceanography Data Management Office Coordinating groups for biodiversity research? What else is needed to support biodiversity research? NSF, NOAA, NASA, EPA and USGS agency program managers, representatives, committees Interagency Working Groups and Advisory Committees Scientific Steering Committees Interagency Working Group on Ocean Observations (IWGOO) Support Office hosted at the Consortium for Ocean Leadership

Other considerations: What are the connection axes (geospatial, temporal, organism/taxon/species name)? PI name (e.g. Web of Science researcher ID; or ORCID - Open Source ID for researchers) Data provenance is very important Persistent identifiers (DOIs ?) References (reciprocal links) to published literature Access to proprietary information bco-dmo.org Biological and Chemical Oceanography Data Management What else is needed to support biodiversity research?

bco-dmo.org Biological and Chemical Oceanography Data Management Office Existing Repositories Other considerations: Long tail or ‘dark data’ (Heidorn 2008)

bco-dmo.org Biological and Chemical Oceanography Data Management Office Other considerations What are the use cases? Benedict, et al. 2007

bco-dmo.org Biological and Chemical Oceanography Data Management Office Final Slide What additional cyber-infrastructure is needed to support biodiversity research? What else is needed to support biodiversity research? Additional repositories? What about the *omics data? Connections between repositories? Standards (semantic and syntactic) Advisory groups, workshops and governance systems

bco-dmo.org Biological and Chemical Oceanography Data Management Office Existing Repositories

CDIAC Carbon Dioxide Information Analysis Center-Ocean CO2 TCO2 (DIC) TALK pH pCO2 CFCs SF6 CC14 CaCO3 DOC, TOC TDN dC14

OBIS - USA Ocean Biodiversity Information System (OBIS) - USA (will redirect)

GenBank SUBMIT DATA SEARCH for DATA