You say potato, I say potahto: Ontological engineering applied within the Biomedical Informatics Research Network J. A. Turner1, C. Fennema-Notestine2,

Slides:



Advertisements
Similar presentations
XCEDE Use Cases: QueryAtlas & other scenarios 2008 fBIRN AHM.
Advertisements

Data mining and neuroinformatics Finn Årup Nielsen Technical University of Denmark Department of Informatics and Mathematical Modelling Lyngby, Denmark.
2005 All Hands Meeting Multi-Site Alzheimer’s Disease Project a.k.a. “MAD” Project Leaders: C. Fennema-Notestine, R. Gollub, B. Dickerson.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
SCHOOL OF INFORMATION UNIVERSITY OF MICHIGAN Biomedical Informatics Research Network.
Organizing Data & Information
Michael Marron, Ph.D., Director Division of Biomedical Technology National Center for Research Resources National Institutes of Health Department of Health.
Information Retrieval in Distributed Environments Based on Context- Aware, Proactive Documents Current Research Information Systems (CRIS 2002) August.
Image Query (IQ) Project Update Building queries one question mark at a time March, 2009.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
All Hands Meeting 2005 Human Morphometry and Function BIRN Testbeds Christine Fennema-Notestine, Ph.D. Jessica Turner, Ph.D. CBiO/BIRN Workshop 2006.
BIRN Update Carl Kesselman Professor of Industrial and Systems Engineering Information Sciences Institute Fellow Viterbi School of Engineering University.
Morphometry BIRN Bruce Rosen, M.D. Ph.D.. Scientific Goal Methods –Multi-site MRI calibration, acquisition –Integrate advanced image analysis and visualization.
Ongoing BIRN-GCRC Collaborations Medical College Wisconsin (non BIRN site) –Functional MRI acquisition calibration University of Texas (non BIRN site)
Data Management David Nathan & Peter Austin & Robert Munro.
Configuration Management (CM)
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Biomedical Informatics Research Network BIRN Overview: Building the Biomedical Informatics Research Network October 9, nd Annual All Hands Meeting.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Mark Ellisman, Ph.D. Professor of Neurosciences and Bioengineering Director, BIRN Coordinating Center Center for Research on Biological Systems University.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Atlas Interoperablity I & II: progress to date, requirements gathering Session I: 8:30 – 10am Session II: 10:15 – 12pm.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
BIOMEDICAL INFORMATICS RESEARCH NETWORK: FUNCTIONAL IMAGING RESEARCH IN SCHIZOPHRENIA TESTBED S.G. Potkin 1 ; J.A. Turner 1 *; G.G. Brown 2 ; G.H. Glover.
BIRN Advantages in Morphometry  Standards for Data Management / Curation File Formats, Database Interfaces, User Interfaces  Uniform Acquisition and.
Clinical Measures Genotype Local Storage BIRN Rack SRB MCAT HID/ XNAT/ LONI DUP Calibration & Analysis Tools GRID Portal Mediator Institution A BIRN Rack.
Databases Shortfalls of file management systems Structure of a database Database administration Database Management system Hierarchical Databases Network.
Standardization Challenges Neuroimaging datasets: FBIRN Human Imaging Database You say potato, I say potahto: Ontological engineering applied within the.
All Hands Meeting 2005 Morphometry BIRN - Overview - Scientific Achievements.
All Hands Meeting 2004, Boston Informatics Workshop.
2004 All Hands Meeting FBIRN 2005 – Database and Informatics Working Group David Keator.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Data resource management
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
All Hands Meeting 2005 FBIRN Tools: 2005 Subtitle added here.
WELCOME FBIRN All Hands Meeting October 26 and 27, 2009 FBIRN AHM.
Neuroinformatics Working Group Update 10/26/2009 H Jeremy Bockholt.
2004 FBIRN Meeting FIRST BIRN: Challenges of Collaborative Multi- site, Multi-discipline Research Steven Potkin, MD Gary Glover, PhD Gregory McCarthy,
Jorge Jovicich, Ph.D. Massachusetts General Hospital - Harvard Medical School Biomedical Informatics Research Network Overview Testbeds Morphometry BIRN.
FBIRN Use Case: Data Storage and Retrieval. User Query Results with standard descriptions in HIDB Results Images in SRB FIPS Result s FMRI Images Automated.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
Atlas Interoperablity I & II: progress to date, requirements gathering Session I: 8:30 – 10am Session II: 10:15 – 12pm.
By 5 pm Tuesday  Each group delivers their action plan/milestones to achieve current goals Phase I publication plan and public access Phase II—Intersite.
Biomedical Informatics Research Network The BIRN Architecture: An Overview Jeffrey S. Grethe, BIRN-CC 10/9/02 BIRN All Hands Meeting 2002.
Function BIRN The ability to find a subject who may have participated in multiple experiments and had multiple assessments done is a critical component.
Contributions to mouse BIRN tools and resources Maryann Martone and Mark Ellisman University of California, San Diego 2008.
2005 All Hands Meeting Data & Data Integration Working Group Summary.
All Hands Meeting 2004 Clinician’s Requirements for HID Query and Statistics Interface Christine Fennema-Notestine, Ph.D. David Kennedy, Ph.D.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
2003 All Hands Meeting FBIRN Complete the Data Analysis  Calibration and statistics issues Assess the variability Global measures ROI measures.
© 2017 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
All Hands Meeting 2004 Ontologies for Data Mediation Christine Fennema-Notestine, Ph.D.
Virtual Laboratory Amsterdam L.O. (Bob) Hertzberger Computer Architecture and Parallel Systems Group Department of Computer Science Universiteit van Amsterdam.
NeurOn: Modeling Ontology for Neurosurgery
UCSD Neuron-Centered Database
Data challenges in the pharmaceutical industry
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Data Base System Lecture : Database Environment
2. An overview of SDMX (What is SDMX? Part I)
Chaitali Gupta, Madhusudhan Govindaraju
Presentation transcript:

You say potato, I say potahto: Ontological engineering applied within the Biomedical Informatics Research Network J. A. Turner1, C. Fennema-Notestine2, M. E. Martone3, A. R. Laird4, J. S. Grethe3, W. Bug5, A. Gupta6, C. Bean7; 1Psychiatry and Human Behavior, Univ California, Irvine, Irvine, CA, 2Psychiatry, UCSD, La Jolla, CA, 3Neuroscience, UCSD, La Jolla, CA, 4Health Sciences Center, Univ Texas, San Antonio, TX, 5Anatomy, Drexel Univ. Coll of Med., Philadelphia, PA, 6Super Comp. Ctr., UCSD, La Jolla, CA, 7NCRR, NIH, Bethesda, MD The Biomedical Informatics Research Network (BIRN) is developing a federated infrastructure linking human and mouse imaging experiment databases. BrainMap is a repository of published human neuroimaging studies. Both efforts aim to enable the research community to formulate questions (e.g., “What are the cortical and subcortical dysfunctions related to negative symptom severity in schizophrenia?”) and retrieve relevant results from independent laboratories and networked data resources for further analysis. Figure. (left) The distributed nature of BIRN sites developing the data sharing and analysis infrastructure. Standardized terminologies and a shared semantic framework, which together define an ontology, are required to integrate and retrieve data from such distinct resources. Within BIRN we have identified several domains requiring standardized ontologies: 1) Assigning standardized geometric locations and relationships to neuroanatomical structures across both temporal and spatial scales (e.g., dendrite to whole brain images across disease progression and development) drives the need for a normalized neuroanatomical nomenclature and a mereotopological ontology integrated into a brain atlasing environment. 2) Translation of findings in animal disease models into a human clinical context requires capturing detailed imaging and experimental parameters; particularly in human studies, formal annotations of cognitive function measured in the clinic (e.g., measures of memory loss) and experimental measures of cognitive processes (e.g., working memory fMRI paradigms) must link to a common ontology. This presentation looks at #2 in more detail. Distributed data in the Biomedical Informatics Research Network Integration of different methods and measures Data Integration A user’s query to a federated database infrastructure requires integrating different kinds of data (left). The software architecture of such integration systems consists of a two-part middleware, called the wrapper and the mediator, that are between the information sources and the user. The wrapper converts the data from the respective information source to a form that the mediator can accept and manipulate. The mediator converts a user’s query into smaller sub-queries that are sent to each source, and integrates the results returned from each source. Example Query of Federated Database PET & fMRI Are chronic, but not first-onset patients, associated with superior temporal gyrus dysfunction? Integrated View Receptor Density BrainMap Web PubMed, Expasy, fMRIDC Wrapper Structure Wrapper Clinical Wrapper Mediator Figure (above) Understanding both the form and content of the different databases motivates the development of the ontologies. Examples and their implications are presented here. Neuroimaging Results: BrainMap BrainMap ( is an online database of published functional neuroimaging experiments in the form of stereotactic (x,y,z) coordinates of results. It is a tool to rapidly retrieve studies in specific research domains, such as language, memory, and attention. BrainMap also archives each paper’s associated meta-data: information on subjects, conditions, experimental paradigms, etc. BrainMap can be used to rapidly retrieve published studies in specific behavioral domains, such as language, memory, and attention (see Figure below). BrainMap has been in step-wise development at the Research Imaging Center in San Antonio since 1988 and in use since The structure of BrainMap data entry involves three levels of information: paper level (authorship, etc.), experiment level, and locations (coordinate) level. For the purposes of the BrainMap database, an experiment refers only to the comparison of two (or more) imaged conditions that result in a statistical parametric image (SPI). The locations level of information is the Talairach (x,y,z) coordinates (e.g., centers-of-mass of sites of activation) extracted from the SPIs and are entered explicitly into the database. Left: The current behavioral domains for BrainMap. Experiments are presented in the results of a behavioral domain, context, experimental paradigm, subject group, instructions, stimulus, etc. Right: Example results of a BrainMap query, showing the location of significant activation loci from many published experiments. Neuroimaging datasets: FBIRN Human Imaging Database The BIRN XCEDE schema was developed to share imaging data of diverse formats from different scanning sites. It provides an extensive metadata hierarchy for describing and documenting the technical details of human imaging studies. The XML schema was organized in correspondence with the BIRN Human Imaging Database schema (HID). This allows for an interchangeable source-sink relationship between the database and the XML files, which live with the actual data files on the Shared Resource Broker (SRB). Meta-data about the images and experimental paradigms MUST be included, and transparent to the general research community. The FBIRN informatics infrastructure is shown below: in 1), imaging data is collected, described and uploaded. In 2), clinical information regarding the subject is included. In 3), users query the system to find datasets around the country; and in 4), standardized analyses are run on the selected data and results are stored in the SRB/HID system. The FBIRN Human Imaging Database is built to store information regarding each subject’s data from many different studies. It includes demographic information (age, gender, handedness), clinical measures, and visit information, as well as links to the functional and structural imaging data on a separate system (Storage Resource Broker, SRB). This has led to the identification of the following concept domains which require standardized terminologies and must be represented in an ontology to include relationships between concepts: 1.Clinical measures 2.Cognitive Taxonomies 3.Cognitive task descriptions 4.Scanning parameters 5.Neuroanatomical nomenclature HID SRB fMRI Scanner Results with standard descriptions in HID Results Images in SRB FIPS Results FSL AutomatedFSL Automated Analysis Pipeline (FIPS) Query Clinical/Other Data entry Other users querying the federated database system Automated Image Upload to SRB/HID for sharing QC/QA Automatic Analysis XML Wrapped Images Scan Metadata/descriptions A potential user query for the federated HID and BrainMap system is: “Find all the right handed, ‘positive’ symptom schizophrenic subjects with fMRI data from a working memory task.” This involves determining Clinical aspects: What clinical assessments measure positive schizophrenia symptoms? Cognitive taxonomies: Which tasks are ‘working memory tasks’? Demographics: We need to be able to find the tables in the database which contain: - Age - Gender - Handedness - Diagnosis, etc. Scanning parameters: - Type of scan: structural, functional - Other imaging parameters, e.g.: TR, TE, Number of slices? (whole brain or single-slab?), Slice thickness/gap thickness, Slice acquisition order (interleaved or serial), for analysis and understanding the results. Brain Cerebrum Temporal Mesial temporal Hippocampus Cerebral cortex CVLT Task and score description Frontal Cognitive impairment Cognition Assessment Neuropsychology Amnesia Memory Learning Figure (left) shows an example of a potential clinical assessment concept hierarchy with a link to the relevant neuroanatomical concept hierarchy. The California Verbal Learning Test (CVLT) is an assessment of cognitive impairment in learning and memory, which is associated with hippocampal function. Concept Domains involved in a Use Case Standardization Challenges Data Integration Progress Find all the right-handed, ‘positive’ symptom schizophrenic subjects with 3T fMRI data from a working memory task. Challenge 0: Different information in different databases. This query when sent to the HID will return individual subjects’ datasets, with sufficient meta-data to be able to combine the datasets into an analysis. When sent to BrainMap, it will return results of studies that fit these criteria. BrainMap results HID results Challenge 1: Identifying subject characteristics in a standard way. Diagnosis may be the result of a SCID measurement or stored in a “diagnosis” field. Each database has to identify where this information is. This query could just as well be “negative symptom” or “depressed subjects” or any subset of the clinical measures. Which clinical scales measure which aspects of the clinical symptoms must be explicitly identified. This also requires updating, as new concepts regarding the disease are developed. A specific example: “Handedness” in BrainMap refers to the group of subjects in a study— “right handed” means only right handed subjects were included, “mixed” means the subject group included both left and right handed subjects. In the HID datasets, “handedness” refers to the individual subject’s characteristic— “mixed” means the subject showed neither left nor right-handed dominance. Challenge 2: Identifying experimental equipment and collection parameters. A key distinction is between the abstract description of the data collection method and the instantiation of it. The HID is built to have “lab book” characteristics, including timing mistakes and information regarding missing data, etc. Challenge 3: Standardized experimental paradigm descriptions. Integrated View Designer Integrating views of results require addressing the challenges noted above across different database schema. Figure (left): Successful query for data from several projects regarding the neostriatum, from the Cell Centered Database (ccdb.uscd.edu). Categorizations such as “working memory paradigm” or “attention paradigm”, while likely to be the first thing users want in their queries, are likely to be neither consistent nor constant over time. What was considered a simple attention paradigm two years ago may now be understood to be far more complex, for example, as the theories of cognitive processes evolve. Therefore, a much less hierarchical approach is being taken, in which paradigms are defined by their physical attributes. Links from paradigms to behavioral domains will need to be more flexible, so that “working memory paradigms” can be redefined as needed. Figure (right above): An example working memory task, the Serial Item Paradigm, is a behavioral paradigm with a particular definition that distinguishes from the N-back or other paradigms. 123