1 LS DAM Overview January 31, 2012 Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Elaine Freund, Ph.D., 3 rd Millennium,

Slides:



Advertisements
Similar presentations
Omics Modeling 10/18/2011. Classes reviewed until last meeting – Person – BiologicEntity – Subject – SubjectIdentifier – Performer – ExperimentalStudy.
Advertisements

Clinical Genomics Domain Analysis Model Joyce Hernandez Mukesh Sharma.
HL7 Clinical Genomics and Structured Documents Work Groups CDA Implementation Guide: Genetic Testing Report DRAFT PROPOSAL Amnon Shabo (Shvo), PhD
Looking ahead: caGrid community requirements in the context of caGrid 2.0 Lawrence Brem 7 February 2011.
Systems Engineering in a System of Systems Context
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
Object-Oriented Analysis and Design
FuGO: Development of a Functional Genomics Ontology (FuGO) Patricia L. Whetzel 1, Helen Parkinson 2, Assunta-Susanna Sansone 2,Chris Taylor 2, and Christian.
Overview of Biomedical Informatics Rakesh Nagarajan.
L4-1-S1 UML Overview © M.E. Fayad SJSU -- CmpE Software Architectures Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
1 ECCF Training 2.0 Introduction ECCF Training Working Group January 2011.
Overview Nano WG The National Cancer Institute (NCI) caBIG® Nanotechnology Working Group (Nano WG) Jessica M. Adamick 1, Nathan A. Baker 2, Alan R. Chappell.
1 FACS Data Management Workshop The Immunology Database and Analysis Portal (ImmPort) Perspective Bioinformatics Integration Support Contract (BISC) N01AI40076.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
Specimen-Related Classes in BRIDG BRIDG Overview for HL7 O&O WG Conference Call July 1, 2015 Wendy Ver Hoef NCI Contractor.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
© 2007 Open Grid Forum OGF Modeling Activities DMTF Alliance Partner Symposium Portland, 2007 July 18 Ellen Stokes
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
Managing Information Quality in e-Science using Semantic Web technology Alun Preece, Binling Jin, Edoardo Pignotti Department of Computing Science, University.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
 BRIDG R3.0.2 was released in August 2010  The BRIDG Model passed the initial ISO Joint Initiative Council ballot as a Draft International Standard (DIS)
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Standards Analysis Summary vMR – Pros Designed for computability Compact Wire Format Aligned with HeD Efforts – Cons Limited Vendor Adoption thus far Represents.
Clinical Genomics Work Group (HL7) Mukesh Sharma Washington University in St. Louis June 16, 2011.
NIEM Domain Awareness June 2011 Establishing a Domain within NIEM.
Community Ontology Development Lessons from the Gene Ontology.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
Wheat Data Interoperability. 2  Endorsed in March 2014  Focus:  Improve/reach semantic interoperability of Wheat data  The WG will focus first on.
Interoperability Framework Overview Health Information Technology (HIT) Standards Committee June 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting.
The Cancer Systems Biology Consortium (CSBC)
Clinical Genomics Joint with RCRIM Amnon Shabo Joyce Hernandez Mukesh Sharma.
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
10/27/111 Longitudinal Care Work Group (LCWG) Proposal to Re-Scope and Re-Name the LTPAC WG.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
CaNanoLab Users Group February 2012 Use of Informatics to Expedite and Validate the Application of Nanotechnology in Biomedicine.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
1 LS DAM Overview and the Specimen Core February 16, 2012 Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Elaine Freund,
1 ECCF Training Computationally Independent Model (CIM) ECCF Training Working Group March 2011.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
L6-S1 UML Overview 2003 SJSU -- CmpE Advanced Object-Oriented Analysis & Design Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I College.
Networking and Health Information Exchange Unit 5b Health Data Interchange Standards.
S&I Standards Organization Engagement & Communication Plan DRAFT Standards Support Team 1 September 2011.
1 SAIF-Effects on Data Service Specifications Baris Suzek Georgetown University Architecture/VCDE Joint Face-to-Face June,3, 2010 St. Louis, Missouri.
1 ECCF Training 2.0 Introduction ECCF Training Working Group January 2011.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
S&I Public Health Education Series: Data Provenance July 9th, 2014 Johnathan Coleman Initiative Coordinator – Data Provenance ONC/OCPO/OST (CTR)
BRIDG Imaging Project Nov. 25th, Agenda Project Goals & Objectives Imaging Projects of interest Rationale for aligning with BRIDG Principles on.
1 ICR IRWG Update Feb 16, Information Representation Working Group (IRWG) The caBIG Integrative Cancer Research (ICR) Information Representation.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 ECCF Training Computationally Independent Model (CIM) ECCF Training Working Group January 2011.
Standards Analysis Summary vMR – Pros Designed for computability Compact Wire Format Aligned with HeD Efforts – Cons Limited Vendor Adoption thus far Represents.
CaNanoLab Data Curation Overview NCI Nano WG June 6, 2013.
High throughput biology data management and data intensive computing drivers George Michaels.
Semantic Media Wiki Open Terminology Development - Initial Steps - Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer.
Welcome to the caBIG Community! The cancer Biomedical Informatics Grid (caBIG ® ) offers more than 120 open source tools, technologies and infrastructure.
1 LS DAM Overview August 7, 2012 Current Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Mervi Heiskanen, NCI-CBIIT, Joyce.
1 HL7 SAIF Enterprise Conformance and Compliance Framework (ECCF) Overview Baris E. Suzek Bob Freimuth VCDE Monthly Meeting December, 2010.
Information Representation Working Group WG Meeting September 5, 2008.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
NCI CBIIT LIMS ISIG Meeting– July 2007 NCI CBIIT LIMS Consortium Interface SIG Mission: focus on an overall goal of providing a library of interfaces/adapters.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
OBI – Standard Semantic
Bird of Feather Session
Presentation transcript:

1 LS DAM Overview January 31, 2012 Core Team: Ian Fore, D.Phil., NCI CBIIT, Robert Freimuth, Ph.D., Mayo Clinic, Elaine Freund, Ph.D., 3 rd Millennium, Joyce Hernandez, Merck, Jason Hipp, M.D./Ph.D., University of Michigan, Jenny Kelley, M.A., NCI LPG, Juli Klemm, Ph.D., NCI-CBIIT, Jim McKusker, Yale, Lisa Schick, ScenPro, Inc., Mukesh Sharma, Ph.D., Washington University in St Louis, Grace Stafford, Ph.D., The Jackson Laboratory, Baris Suzek, M.S., Georgetown University

2 Agenda What is it? Informing resources and alignment with BRIDG LS DAM Core Components Examples of the LS DAM in use Publishing the LS DAM Life Sciences Ontology Pilot Subgroup (LS OPS) 2

3 Life Sciences Domain Analysis Model (LS DAM) – What is it? The LS DAM Project grew out of the desire to support interoperability across several NCI cancer Bioinformatics Grid (caBIG®) life sciences applications The project was initiated in February 2009 with NCI caBIG life sciences community as the initial stakeholder Collaboration with the HL7 Clinical Genomics Work Group (HL7 CG WG) began in June 2010 Project Goals To produce a shared view, in UML, of the static semantics of the domain of hypothesis-driven and discovery research at the organism, cell, and molecular level in order to facilitate human and computable semantic interoperability in that domain Maintain alignment of the LS DAM with BRIDG as it also evolves, in order to facilitate human and computable semantic interoperability across the translational research continuum 3

4 Life Sciences DAM Scope Statement Life Sciences research includes (i) in vivo, experiments (ii) ex vivo, in vitro or in situ experiments, and (iii) in silico experiments modeling and analyzing processes or other biological phenomena. The Life Sciences Domain Analysis Model (LS DAM) focuses on concepts important for conducting hypothesis driven and discovery science at the organismal, cellular and molecular level. The LS DAM includes and defines relationships between concepts that are central to specimen collection, processing and banking (human, model organism, cell lines, etc.), in vitro imaging, and molecular biology.

5 Informing Resources Many concepts drawn from well-established, community-vetted caBIG projects E.g., tissue banking related (caTissue), microarray data management (caArray), laboratory information management system (caLIMS), nanomaterial data sharing portal (caNanoLab) Standard reference models and data exchange formats have informed modeling decisions ISA-TAB, FuGE, MAGE, BRIDG Bound to the ISO data type standard Aligned with the Biomedical Research Integrated Domain Group (BRIDG) model on touchpoints Alignment is ongoing as each model continues to evolve Re-using concepts “owned” by BRIDG that apply to our domain 53 of the 129 LS DAM classes originate from BRIDG E.g., Person, Organization, Activity (and much of the hierarchy), Material

6 Alignment with BRIDG to support translational research Subject touch point supports traversing from LS DAM to BRIDG to support use cases as needed LS DAM includes distinct specialization of BRIDG.Subject to support the specific requirements around SpecimenCollectionPr otocolSubject Touch points can be leveraged in support of the translational research continuum

7 Example of coordination with the BRIDG SCC LS DAM has requirement to support concept of “equipment” Seemed aligned to BRIDG Device “Device” as class name didn’t suit domain well Definition and examples seemed to convey a more clinical nature Worked with BRIDG SCC to modify the definition and add additional examples so the single concept would suit both BRIDG and LS DAM needs LS DAM class name is Equipment, but, it is the BRIDG Device (some attributes constrained and added) 7 DEFINITION: An object intended for use whether alone or in combination for diagnostic, prevention, monitoring, therapeutic, scientific, and/or experimental purposes. EXAMPLE(S): tongue depressor, pacemaker, insulin pump, EKG machine, x-ray machine, mass spectrometer, polymerase chain reaction (PCR) machine, microscope, pH meter

8 Person, Organization, and their Roles Document Experiment Core Specimen Core Molecular Biology Core Molecular Databases Core Performed Activities (imaging, observations, etc) Material, Equipment LS DAM 2.2.2

9 LS DAM Experiment Core Component Conceptual representation of the process of experimentation in both hypothesis driven and discovery research in the Life Sciences domain Collaborative effort with HL7 CG WG to model common representation that can support interoperability around expanding array of ‘omics technologies Heavily informed by ISA-TAB and MAGE Includes concepts such as: ExperimentalStudy, Experiment, ExperimentalParameter, ExperimentalFactor

10 LS DAM Experiment Core 10

11 LS DAM Experiment Core - ExperimentalItem 11

12 LS DAM Molecular Biology Core Component Modeling of the molecular components of cells and organisms that will enable linking to genomic and proteomic annotations and experimental findings Heavily informed by ISA-TAB, MAGE-TAB, CGOMS, FuGE, dbSNP, EndNote, various public molecular databases Includes concepts such as Gene, Protein, NucleicAcidSequence, AminoAcidSequence, GeneticVariation

13 LS DAM Molecular Biology Core 13

14 LS DAM Molecular Databases Core Component Supports linking identifying information held in databases (such as Ensembl, UniProt, GenBank) to the molecular components defined in the molecular biology core Includes concepts such as GeneIdentifier, ProteinIdentifier, SNPIdentifier

15 LS DAM Molecular Databases Core 15

16 LS DAM Specimen Core Component Modeling of the collection, processing and storage of physical substances originally obtained from a biological entity Heavily informed by caBIG caTissue, caNanoLab Includes concepts such as Material, BiologicSpecimen, PeformedSpecimenCollection, PerformedSpecimenProcessing, SpecimenCollectionProtocol

17 LS DAM Specimen Core Component 17

18 Examples of LS DAM in use caBIG® Molecular Annotation Service (MAS) A resource for accessing molecular annotations from curated data sources Information model supporting this service derived from the LS DAM Requested support for annotation of genes and gene products in LS DAM HL7 CG WG Omics DAM A DAM to support genetic testing scenarios from contact with a participant through the collection of specimens, the molecular assays performed on the specimen or a derivative, and the lab reports generated as a result LS DAM leveraged as the backbone of the effort Several requests submitted, many addressed in next LS DAM release caBIG® caLIMS2 A laboratory information management system Leveraged several LS DAM concepts that are relevant to a laboratory environment while developing the project’s information model Requested support for several identified gap concepts in LS DAM nano-TAB An ISA-TAB based standard format for describing data related to investigations, nanomaterials, specimens, and assays in nanomedicine Has been mapped to LS DAM in an effort to align the two standards

19 LS DAM – Publishing Needs of the community drive each cycle LS DAM 1.0 – July 2009 LS DAM – May 2011 LS DAM – January 2012 Robust documentation set made publicly available on the LS DAM wiki ( with each iterationhttps://wiki.nci.nih.gov/x/cxRlAQ Release Summary, Change List, Mapping Document, etc. LS DAM User Guide Experiment Implementation Guide (IG) Provides guidance and an illustration of how the model can be instantiated to support a sample scenario Anticipate writing additional IGs as model evolves and user base grows Results of pilot investigation exploring the possibility of an ontological representation made available in January 2012 (LS OPS)

20 Life Sciences Ontology Pilot Subgroup (LS OPS) The LS DAM was developed and is published in UML as per CBIIT practices Interaction with external communities revealed that ontologies are frequently used to represent information (vs. UML) in the life sciences domain An ontological representation of the LS DAM would not only enable the LS DAM to leverage semantics from existing ontologies that are widely used within the life sciences community in general, but would also facilitate the adoption of the LS DAM by members of those ontology- based communities LS OPS was formed to explore the feasibility of creating an ontological representation from the LS DAM

21 LS OPS: Goals To explore the feasibility of expressing a portion of the LS DAM as an ontology that uses components from existing ontologies developed by other groups To continue engaging a variety of groups from the broader scientific community with the intent to form collaborations, exchange information regarding domain information models/ontologies, and identify areas of overlap and gaps among the various communities To identify opportunities to reuse components from existing ontologies to express and extend the semantics of the LS DAM

22 LS DAM: Ontological Representation Information Sources Dublin Core Ontology NCI Thesaurus NCBI Taxonomy Phenotypic Quality Ontology (PATO) W3C Provenance Ontology (PROV) Relations Ontology (RO) Open Biological and Biomedical Ontologies (OBO) Experimental Factors Ontology (EFO) Cell Type Ontology (CL) vcard Friend of a Friend (FOAF) 18 of 20 classes in the constrained model were represented by components from existing ontologies

23 LS OPS Conclusions Producing an ontological representation of the LS DAM would be a worthwhile effort By aligning ourselves with projects on the national level we are facilitating interoperability not only within the caBIG program but also across the entire research community There are a wealth of existing ontologies that are widely used in the life sciences community that could be leveraged to support the semantics represented in the LS DAM

24 LS DAM Constrained for LS OPS