Presentation is loading. Please wait.

Presentation is loading. Please wait.

Jennie Larkin, PhD Senior Advisor

Similar presentations


Presentation on theme: "Jennie Larkin, PhD Senior Advisor"— Presentation transcript:

1 Biomedical Data Stewardship: technologies for making biomedical resources FAIR
Jennie Larkin, PhD Senior Advisor Office of the Associate Director for Data Science (ADDS) SciDataCon  September 12, 2016

2 Biomedical Data Stewardship: technologies for making biomedical resources FAIR
PART 1: Overview and Data Discovery (14:00—15:30) BD2K ELIXIR Data findability approaches and tools RDA IG New Paradigms for Data Discovery. Thursday 11:30 PART 2: Accessibility, Interoperability, & Reusability.  FAIR in practice in biomedical research (16:00—17:30) Findability of interoperability standards Enabling Accessibility and Reusability FAIR Use Cases RDA 8th Plenary Joint meeting: IG Elixir Bridging Force, WG Biosharing Registry, WG Data Type Registries, WG Metadata Standards Catalog. Breakout 2, Thursday 14:00 Joint meeting of IG ELIXIR Bridging Force, WG BioSharing Registry: The FAIR Data Principles - the concepts and exemplars implementations. Breakout 4, Friday 11:00

3 Science A systematically organized body of knowledge (facts) on a particular subject. A process of discovery that allows us to link isolated facts into coherent and comprehensive understanding of the natural world.

4 Process: Rigor & Reproducibility
Cornerstones of Scientific Advancement Rigor in designing and performing scientific research Ability to reproduce biomedical research findings.

5 Scientific output: theories, data, tools…
1398 Cell 166, 1397–1410, September 8, 2016

6 If we cannot get a handle on the scientific body of knowledge,
Why Data Stewardship? If we cannot get a handle on the scientific body of knowledge, how can we do science? So much data, so many articles, so many theories, so many databases, so many computer programs and tools… (Big Data) That we are losing the “systematic organization of knowledge” that is the foundation of the scientific enterprise. (The problem that Big Data causes) So we need help turn the morass of data back into a systematic organization of knowledge: Data Science & Data Stewardship. (How to address the problem that Big Data causes)

7 Data Stewardship Life Cycle
Publication of papers and release of associated data sets Data generated, cleaned, and analyzed Data housed for long-term storage Data Sustainability Data Publication and Sharing Data Generation Data annotated, curated, and formatted. Made FAIR FAIR data can be cited and easily accessed and re-used, as part of a data commons Ensuring high value data sets are maintained for future possible need. FAIR: Findable, Accessible, Interoperable, Reusable

8 Data Generation & Curation
High quality data arises from high quality experimental design and lab practices. Need strong data curation practices and robust analyses. BD2K supports training for data management and data science BD2K supports tools/research for better metadata Data Sustainability Data Publication and Sharing Data Generation

9 Data Sharing & the Commons
Data Sustainability Data Publication and Sharing Data Generation Commons = a shared virtual space Contains digital research objects (data, software, methods, papers, etc.) Co-locates data and compute Conforms to FAIR principles: Findable Accessible (and usable) Interoperable Reusable And citable! Vivien Bonazzi

10 Software: Services & Tools
The Commons Framework Compute Platform: Cloud or HPC Services: APIs, Containers, Indexing Software: Services & Tools Scientific analysis tools/workflows Data “Reference” Data Sets User defined data Digital Object Compliance App store/User Interface Ensuring that high value data resources can be accessed, used, and cited

11 Mapping BD2K Activities to the Commons
Indexing Compute Platform: Cloud or HPC Services: APIs, Containers, Indexing Software: Services & Tools Scientific analysis tools/workflows Data “Reference” Data Sets User defined data Digital Object Compliance App store/User Interface NIH and community defined data sets HMP MODS GDC Cloud Credits

12 Sustaining the Big Data Ecosystem
Address how important resources should be identified and maintained in long-term. Governance, review, and funding options Allen Dearry Breakout session 5: Sustainable Business Models for Data Repositories Data Sustainability Data Publication and Sharing Data Generation

13 Current BD2K Opportunities
RFA # Closes Big Data to Knowledge (BD2K) Community-Based Data and Metadata Standards Efforts (R24) ES 10/19/2016 Big Data to Knowledge (BD2K) Enhancing the Efficiency and Effectiveness of Digital Curation for Biomedical Big Data (U01) LM 12/15/2016 NIH Big Data to Knowledge (BD2K) Enhancing Diversity in Biomedical Data Science (R25) MD 11/14/2016 BD2K Research Education Curriculum Development: Data Science Overview for Biomedical Scientists (R25) ES 12/07/2016 BD2K Open Educational Resources for Skills Development in Biomedical Big Data Science (R25) HG 08/03/2017 NIH/BD2K Participation in the Joint NSF/NIH Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) NOT-EB 09/28/2016

14 Data Science at NIH Data Science at NIH Jennie.Larkin@nih.gov
 #BD2K


Download ppt "Jennie Larkin, PhD Senior Advisor"

Similar presentations


Ads by Google