Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information.

Similar presentations


Presentation on theme: "Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information."— Presentation transcript:

1 Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information Programs Development BRDI/NAS 2/26/13

2 Biomedical Research Enterprise - Today Lots and lots of data – in individual labs Lots and lots of data – in individual labs

3 Biomedical Research Enterprise - Today Lots and lots of data – in individual labs Few data broadly available to research community Few data broadly available to research community  Exceptions: genomic, human subject autism, particular research initiatives (e.g., ADNI, Human Connectome Project)

4 Biomedical Research Enterprise - Today Lots and lots of data – in individual labs Few data broadly available to research community   Exceptions: genomic, human subject autism, particular research initiatives (e.g., ADNI, Human Connectome Project) For much of biomedical research enterprise For much of biomedical research enterprise  Major public products: concepts in scientific papers, not data  Biomedical research is concept-centric, not data-centric

5 Biomedical Research Enterprise - Tomorrow Liberated data - increase data sharing Liberated data - increase data sharing

6 Biomedical Research Enterprise - Tomorrow Liberated data - increase data sharing Advances in relevant data science and data tools Advances in relevant data science and data tools

7 Biomedical Research Enterprise - Tomorrow Liberated data - increase data sharing Advances in relevant data science and data tools Ways to make data Ways to make data  Discoverable  Useful to others  Citable  Linked to scientific literature

8 Biomedical Research Enterprise - Tomorrow Liberated data - increase data sharing Advances in relevant data science and data tools Ways to make data   Discoverable   Useful to others   Citable   Linked to scientific literature Greater prominence of data in science & scholarship Greater prominence of data in science & scholarship

9 TodayTomorrow

10 TodayTomorrow NIH Big Data to Knowledge Initiative for Research Data

11 NIH Big Data to Knowledge (BD2K) Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH  D DeMets & L Tabak

12 NIH Big Data to Knowledge (BD2K) Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH  D DeMets & L Tabak  Recommendations for NIH Research Data:

13 NIH Big Data to Knowledge (BD2K) Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH  D DeMets & L Tabak  Recommendations for NIH Research Data:  Sharing & Standards  Tools  Workforce

14 NIH Big Data to Knowledge (BD2K) Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH Data and Informatics Working Group (DIWG) of the Advisory Committee to the Director of NIH  D DeMets & L Tabak  Recommendations for NIH Research Data:  Sharing & Standards  Tools  Workforce Implementation Groups Implementation Groups  Eric Green (Acting Assoc Dir of NIH for Data Science)  Sharing & Standards M Huerta & J Larkin  Tools (Software Development) V Bonazzi & J Couch  Tools+ (Centers) L Brooks, M Huerta, P Lyster & B Seto  Workforce M Dunn

15 Sharing & Standards Policies to increase data sharing and change the culture Policies to increase data sharing and change the culture  Changes will liberate data

16 Sharing & Standards Policies to increase data sharing and change the culture   Changes will liberate data Frameworks for community-based standards efforts Frameworks for community-based standards efforts  Standards make data useful  Community-base promotes their use

17 Sharing & Standards Policies to increase data sharing and change the culture   Changes will liberate data Frameworks for community-based standards efforts   Standards make data useful   Community-base promotes their use Catalog of data set information  research ecosystem Catalog of data set information  research ecosystem  Discoverable, citable, and linked to the literature

18 Sharing & Standards Policies to increase data sharing and change the culture Policies to increase data sharing and change the culture  Changes will liberate data Frameworks for community-based standards efforts Frameworks for community-based standards efforts  Standards make data useful  Community-base promotes their use Catalog of data set information  research ecosystem Catalog of data set information  research ecosystem  Discoverable, citable, and linked to the literature Each adds value

19 Sharing & Standards Policies to increase data sharing and change the culture Policies to increase data sharing and change the culture  Changes will liberate data Frameworks for community-based standards efforts Frameworks for community-based standards efforts  Standards make data useful  Community-base promotes their use Catalog of data set information  research ecosystem Catalog of data set information  research ecosystem  Discoverable, citable, and linked to the literature Each adds value Synergy together

20 NIH Data Catalog: A Use Case

21 An NIH-funded investigator NIH Data Catalog: A Use Case

22 NIH Data Catalog Just before submitting a scientific paper to a journal, investigator uploads minimal info about the data set to the NIH Data Catalog

23 NIH Data Catalog Minimal info includes: Minimal info includes: -Authors proper credit for data -Data descriptors (controlled) efficient search -Data locus, availability & way to access sharing way to access sharing

24 NIH Data Catalog Minimal info includes: Minimal info includes: -Authors proper credit for data -Data descriptors (controlled) efficient search -Data locus, availability & way to access sharing way to access sharing Upload generates: Upload generates: -Data publication citation -Data Unique IDentifier (DUID)

25 NIH Data Catalog Data Unique IDentifier (DUID) is sent to the investigator

26 NIH Data Catalog Investigator submits manuscript to the scientific journal - with DUID in abstract & data publication cited in manuscript

27 NIH Data Catalog Journal paper is published & indexed in PubMed

28 NIH Data Catalog PubMed pulls DUID from abstract as a separate data element in the PubMed citation

29 NIH Data Catalog Data publication is also sent to PubMed for indexing

30 NIH Data Catalog PubMed also pulls DUID from data publication as an element of PubMed citation

31 NIH Data Catalog DUID now in PubMed citations of both the scientific publication & the data publication forming a 2-way link

32 NIH Data Catalog PubMed uses same data descriptors as data publication for indexing data publication

33 NIH Data Catalog Use of same controlled Terms in catalog and PubMed provides discoverability of info about data sets

34 NIH Data Catalog DUIDs, citations of data publications & scientific publications can be used in NIH administrative systems

35 Bringing Data into the Research Ecosystem

36 Data more available (policies) & useful (standards) Data more available (policies) & useful (standards)

37 Bringing Data into the Research Ecosystem Data more available (policies) & useful (standards) Data sets are discoverable: Data sets are discoverable:  Same descriptors of data sets used in data catalog are used as index and search terms in PubMed

38 Bringing Data into the Research Ecosystem Data more available (policies) & useful (standards) Data sets are discoverable:   Same descriptors of data sets used in data catalog are used as index and search terms in PubMed Data sets are citable: Data sets are citable:  NIH Data Catalog produces citable data publications  Citability + proper credit  incentives related to data

39 Bringing Data into the Research Ecosystem Data more available (policies) & useful (standards) Data sets are discoverable:   Same descriptors of data sets used in data catalog are used as index and search terms in PubMed Data sets are citable:   NIH Data Catalog produces citable data publications   Citability + proper credit  incentives related to data Data sets are linked with the literature Data sets are linked with the literature  Common search & retrieval approach for scientific publications and data publications through PubMed  Use of DUID for direct, two-way linkage

40 Bringing Data into the Research Ecosystem Data more available (policies) & useful (standards) Data sets are discoverable:   Same descriptors of data sets used in data catalog are used as index and search terms in PubMed Data sets are citable:   NIH Data Catalog produces citable data publications   Citability + proper credit  incentives related to data Data sets are linked with the literature   Common search & retrieval approach for scientific publications and data publications through PubMed   Use of DUID for direct, two-way linkage Information in ecosystem - use by NIH and 3 rd parties Information in ecosystem - use by NIH and 3 rd parties  Trend analysis, etc.

41

42


Download ppt "Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information."

Similar presentations


Ads by Google