The NIH Data Commons: A Cloud-based Training Environment Philip E. Bourne, Ph.D. FACMI Associate Director for Data Science National Institutes of Health.

Slides:



Advertisements
Similar presentations
Overview of Mentored K Awards Shawna V. Hudson, PhD Assistant Professor of Family Medicine and Community Health UMDNJ-RWJMS The Cancer Institute of New.
Advertisements

Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information.
Presentation to Educational Policy Committee Department of Biology Revised March, 2013 Biology Department: Position Requests.
Data the NIH What is Happening & What is Coming A Conversation Philip E. Bourne, PhD, FACMI Associate Director for Data Science National Institutes.
George A. Komatsoulis, Ph.D. National Center for Biotechnology Information National Library of Medicine National Institutes of Health U.S. Department of.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
RESEARCH TEAMS OF THE FUTURE Working Groups and Co-Chairs  Interdisciplinary Research Patricia Grady, NINR Ken Olden, NIEHS Larry Tabak, NIDCR  High-risk.
Who Are We? An open, international ecosystem containing 70+ organizations each working in their own self-interest while collaborating toward a common industry.
BD2K-LINCS-Perturbation Data Coordination & Integration Center Applicant Information Webinar for RFA-HG Ajay Pillai and Jennie Larkin January 13,
Bill Newhouse Program Lead National Initiative for Cybersecurity Education Cybersecurity R&D Coordination National Institute of Standards and Technology.
Johns Hopkins Technology Transfer 1 Translational Biomedical Research: Moving Discovery from Academic Centers to the Community Translational Biomedical.
Tata Technologies and Dassault Systemes at TIETECH for collaboration on innovative research, technology or Design solutions.
Overview: FY12 Strategic Communications Plan Meredith Fisher Director, Administration and Communication.
Computational Sciences within NIGMS Protein Ontology Meeting, Georgetown, June 18, 2014 Veerasamy “Ravi” Ravichandran, Ph.D. Program Director Biomedical.
Data! Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health.
The Research Hub AT UNC LIBRARIES Joe Williams, UNC Library :: Peter Leousis, Odum Institute :: Molly Sutphen, Center for Faculty Excellence.
Department of Health and Human Services National Institutes of Health National Center for Research Resources Division of Research Infrastructure Extending.
DDICC Overview: biomedical and healthCAre Data Discovery Index Ecosystem Lucila Ohno-Machado, MD, PhD UCSD Biomedical Informatics NIH BD2K Joint Kick-off.
Big Data to Knowledge (BD2K) Jennie Larkin, Ph.D. NIH RDA P5 March 10,2015.
Data Science at the NIH Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health.
NIH Big Data to Knowledge (BD2K) March 4, 2014 Peter Lyster National Institute of General Medical Sciences (NIGMS) NIH.
Information Security Research and Education Network INSuRE Dr. Melissa Dark Purdue University Award #
National Centers for Biomedical Computing Software and Data Integration Working Group Peter Lyster (Chair) NCBC Workshop Wednesday December 16 (2005)
Brought to you by the Letter “K” Donna L. Vogel, MD, PhD Director, Professional Development Office K Award Workshop December 11, 2007.
Alexis Bakos, MPH, PhD, RN Branch Chief, Diversity Training Branch CRCHD, NCI October 19, 2010.
MPS Workshop 1: Gauging the Impact of Requirements for Public Access to Data November 19, 2015 Jennie Larkin, Ph.D. Office of the Associate Director for.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
NIH: DATA SCIENCE & BD2K Jennie Larkin, PhD Senior Advisor, Extramural Programs and Strategic Planning Office of the Associate Director for Data Science,
Data NIH Philip E. Bourne, PhD Associate Director for Data Science National Institutes of Health Big Data Symposium, Lincoln,
Biomedical and healthCAre Data Discovery Index Ecosystem NIH Core Team Ron Margolis (Lead) Ian Fore (Science Officer) Dawei Lin & Alison Yao (Program Officers)
Center for Nursing Informatics Connie White Delaney, PhD, RN, FAAN, FACMI Dean and Professor Co-Director of the Center for Nursing Informatics September.
National Institutes of Health U.S. Department of Health and Human Services Planning for a Team Science Evaluation ∞ NIEHS: Children’s Health Exposure Analysis.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
Examining Federal Expert Networking and the Economies of Scale: Moving the “HHS Profiles” Pilot Towards “Experts.gov” James King, Jessica N. Berrellez,
Maintaining Scientific Rigor and Enhancing Discovery Philip E. Bourne, PhD, FACMI Associate Director for Data Science The National Institutes of Health.
Enhancements to Galaxy for delivering on NIH Commons
NIH – A Vision Through 2020 Philip E. Bourne, PhD, FACMI Associate Director for Data Science
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
Reproducibility: A Funder and Data Science Perspective
Jennie Larkin, PhD Senior Advisor
FaceBase Consortium NIDCR Update Steve Scholnick, PhD TGRB/NIDCR/NIH.
OUHSC Vision/Initiatives for
Commons Credit Model: Update to the BD2K AHM
SCTR Pilot Project Program Funding
NLM: Meeting Challenges & Seizing Opportunities in & with Big Data
Summit 2017 Breakout Group 2: Data Management (DM)
Dallas Integrated Higher Education Network
ORGANIZATIONAL STRUCTURE
The Data Commons An introduction & Overview
One of the benefits of an ORCID iD is your publications page.
Research Development Office
Get CLUed into VICTR Services and Support
Connecting More Learners with High-quality CTE: Lessons from the Frontier April 26, 2018.
Commons Credits Pilot – Overview
Topics Introduction to Research Development
Statistics Canada and Data’s New Realty
First teleconference/web session Dec 11, 2015
Karen Bartleson, President, IEEE Standards Association
Competing for Scientific Leadership Positions
Needs Employability and Advancement Digital Access Lifelong Learning
Welcome to Special programs night!
Director, ICT Centre of Excellence andOpen Data
PD Goals Program Overview December, 2012
Sachiko A. Kuwabara, PhD, MA
PD Goals Program Overview December, 2012
Welcome to the CCTST Virtual Meeting on Social Determinants of Health
Using Collaborative/Participatory Evaluation Methods to Give Voice to Stakeholders in Clinical and Translational Research AEA 2018 Chair: John F. Stevenson,
K Scholars Program Support for Career Development to Enhance Health, Lengthen Life, and Reduce Illness and Disability Sylvie Naar, PhD Director, Center.
DIVERSITY IN RESEARCH and SCHOLARSHIP
Presentation transcript:

The NIH Data Commons: A Cloud-based Training Environment Philip E. Bourne, Ph.D. FACMI Associate Director for Data Science National Institutes of Health Slides adapted from Vivien Bonazzi

Agenda Why cloud based training is important to the NIH What the NIH is doing to support it

The Data Commons is an NIH endorsed platform that fosters the development of a digital ecosystem

That digital ecosystem allows transactions to occur on FAIR data* at scale *

Data Commons is a Platform that fosters development of a digital Ecosystem Treats products of research – data, software, methods, papers, training materials etc. as a digital asset (object) Digital objects need to conform to FAIR principles - F indable, A ccessible, I nteroperable, R eproducible Digital objects exist in a shared virtual space (initial) - Find, Deposit, Manage, Share and Reuse: digital assets Enables interactions between Producers and Consumers of digital assets Gives currency to digital assets and the people who develop and support them

To understand the Data Commons Platform (and how it works for biomedical data) we need to use a Platform stack to help visualize the concept

NIH Data Commons - Platform Stack

NIH Data Commons - Platform Stack

Digital Market Place, Bazaar, Community Sangeet Paul Choudary – Platform ScalePlatform Scale Network/Com munity Market Place Technology Data

NIH Data Commons Pilots

Current Data Commons Pilots Reference Data Sets Commons Stack Pilots Cloud Credit Model Resource Search & Index Explore feasibility of the Commons Platform (FW) Provide data objects to populate the Commons Facilitate collaboration and interoperability Provide access to cloud (IaaS) and PaaS/SaaS via credits Connecting credits to NIH Grant Making large and/or high value NIH funded data sets and tool accessible in the cloud Developing Data & Software Indexing methods Leveraging BD2K efforts bioCADDIE et al Collaborating with external groups

Data Commons Pilot – connecting the pieces Co-location of large and/or highly utilized NIH funded data on the cloud + commonly used tools for analyzing and sharing digital objects to create an interoperable resource for the research community. Investigators will be able to collaborate and share digital objects within this environment and connect with others

Educational Opportunities

Strengthening a diverse biomedical workforce to utilize data science BD2K funding of Short Courses and Open Educational Resources Building a diverse workforce in biomedical data science BD2K Training programs and Individual Career Awards Fostering Collaborations BD2K Training Coordination Center, NSF/NIH IDEAs Lab Expanding NIH Data Science Workforce Development Center Local courses, e.g. Software Carpentry Discovery of Educational Resources BD2K Training Coordination Center Goal: To strengthen the ability of a diverse biomedical workforce to develop and benefit from data science

Thank you ADDS Office - Vivien Bonazzi, Michelle Dunn, Jennie Larkin, Mark Guyer, Sonynka Ngosso NCBI : George Komatsoulis NHGRI : Valentina di Francesco NIGMS : Susan Gregurik CIT : Andrea Norris, Debbie Sinmao, NCI : Warren Kibbe, Tony Kerlavage, Tanja Davidsen, Ian Fore NIAID: JJ McGowan, Nick Weber, Darrell Hurt, Maria Giovanni, Alison Yao The NIH Common Fund : Betsy Wilder, Jim Anderson, Leslie Derr Trans NIH BD2K Executive Committee & Working groups Many biomedical researchers, cloud providers, IT professionals