RightField The Semantic Annotation of Experimental Data using Spreadsheets, The Semantic Annotation of Experimental Data using Spreadsheets, Katy Wolstencroft,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
The use of Ontology in Organising and Managing Protein Family Resources Katy Wolstencroft, University Of Manchester.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
An Introduction to RDF(S) and a Quick Tour of OWL
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
SysMO-DB: Towards “just enough” data exchange for the SysMO Consortium Katy Wolstencroft, University of Manchester, UK.
SysMO-DB: Towards “just enough” data exchange for the SysMO Consortium Stuart Owen, University of Manchester.
GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
Tutorial 11: Connecting to External Data
RightField Rich Annotation of Experimental Biology through Stealth Using Spreadsheets Katy Wolstencroft, Stuart Owen, Matthew Horridge, Olga Krebs, Wolfgang.
COMPREHENSIVE Excel Tutorial 8 Developing an Excel Application.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall.1 Exploring Microsoft Office Excel Copyright © 2008 Prentice-Hall. All rights.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Copyright © 2008 Pearson Prentice Hall. All rights reserved. 1 1 Copyright © 2008 Prentice-Hall. All rights reserved. Committed to Shaping the Next Generation.
Taverna in e-Lico  e-Lico is an EU Project ( ) to create a virtual laboratory for data mining and data-intensive sciences  Main partners: –University.
Managing Information Quality in e-Science using Semantic Web technology Alun Preece, Binling Jin, Edoardo Pignotti Department of Computing Science, University.
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
SysMO-DB: Just Enough Exchange for Systems Biology Data and Models Carole Goble, Katy Wolstencroft, Stuart Owen, Sergejs Aleksejevs - University of Manchester.
RightField: Semantic Enrichment of Systems Biology Data using Spreadsheets Katy Wolstencroft myGrid, SysMO-DB University of Manchester.
ONTOLOGY ENGINEERING Lab #1 - August 25, Lab Syllabus 2  Lab 1 – 8/25: Introduction and Overview of Protégé  Lab 2 – 9/8: Building an ontology.
SysMo-DB: Towards “just enough” data exchange for the SysMO Consortium Carole Goble, Uni of Manchester, UK Jacky Snoep, Uni of Manchester, UK / Stellenbosch,
Data-driven research with e-Laboratories Stuart Owen University of Manchester
Spreadsheets to OWL with Populous 8/12/2011 Mikel Egaña Aranguren 3205 School of Computer Science Universidad Politécnica de Madrid (UPM) Boadilla.
Teranode Tools and Platform for Pathway Analysis Michael Kellen, Solution Manager June 16, 2006.
1 CA202 Spreadsheet Application Publishing Information on the Web Lecture # 15 Dammam Community College.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
SysMO-DB: Sharing and Exchanging Data and Models in Systems Biology Katy Wolstencroft University of Manchester.
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
PROGNOCHIP-BASE, FORTH-ICS 1 PrognoChip-BASE: An Information System for the Management of Spotted DNA MicroArray Experiments Extension of BASE v
WP2: ONTOLOGY ENRICHMENT METHODOLOGIES Carole Goble (IMG) Robert Stevens (BHIG) Mikel Egaña Aranguren (BHIG) Manchester University Computer Science: IMG:
Johannes Griss PSI Meeting Heidelberg, April 2011 EBI is an Outstation of the European Molecular Biology Laboratory. mzTab Proposal for.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
SysMO-DB and ISA Katy Wolstencroft, University of Manchester, UK.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
E-LICO An e-Laboratory for Interdisciplinary Collaborative research in data mining and data intensive sciences October 12 th, 2010 Delivering data mining.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Linking Models & Data within the ISA structure Stuart Owen (based upon notes by Olga Krebs).
Chapter 2: Excel Basics and Formatting Spreadsheet-Based Decision Support Systems Prof. Name Position (123) University Name.
Office 2003 Introductory Concepts and Techniques M i c r o s o f t Excel Project 1 Creating a Worksheet and an Embedded Chart.
Workshop: Linking Models and Data in SysMO Katy Wolstencroft, SysMO-DB University of Manchester, UK.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Lessons learned from Semantic Wiki Jie Bao and Li Ding June 19, 2008.
Microsoft Excel Prepared by the Academic Faculty Members of IT.
Ontology Engineering Ron Rudnicki Lab #1 - August 26, 2013.
BioUML – integrated platform for building virtual cell and virtual physiological human Fedor Kolpakov 1,2, Nikita Tolstykh 1,2, Elena Kutumova 1,2, Ilya.
High throughput biology data management and data intensive computing drivers George Michaels.
Describing and Annotating Experimental Data: Hands On.
EBI is an Outstation of the European Molecular Biology Laboratory. Semantic Interoperability Framework Sarala M. Wimalaratne (RICORDO project)
Excel Tutorial 8 Developing an Excel Application
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
OBO Foundry Principles
Overview Gene Ontology Introduction Biological network data
Let’s Learn About Spreadsheets
Taxonomy of public services
Presentation transcript:

RightField The Semantic Annotation of Experimental Data using Spreadsheets, The Semantic Annotation of Experimental Data using Spreadsheets, Katy Wolstencroft, Stuart Owen, Matthew Horridge, Olga Krebs, Wolfgang Mueller Carole Goble

RightField A tool for embedding ranges of ontology terms into spreadsheets to allow the users of those spreadsheets to add semantic annotations from simple drop-down lists

RightField A tool for embedding ranges of ontology terms into spreadsheets to allow the users of those spreadsheets to add semantic annotations from simple drop-down lists Why? Makes annotation quicker and more efficient Standardises annotation Hides the ontology complexity from the users

Describe experiments and results of experiments Minimal Information Models Guidelines, Checklists, vocabularies Managing Biological Data Necessary for publication, submission to public databases and sharing

Describe experiments and results of experiments Minimal Information Models Guidelines, Checklists, Managing Biological Data MIACAMIACA Minimal Information About a Cellular Assay MIAMEMIAME Minimum Information About a Microarray Experiment MIAPEMIAPE Minimum Information About a Proteomics Experiment MIAREMIARE Minimum Information About a RNAi Experiment MIASEMIASE Minimum Information About a Simulation Experiment MIBBI >30

Describe experiments and results of experiments Ontologies and Vocabularies for Annotation Managing Biological Data Gene Ontology ChEBI MGED SBO BioPortal >270 biomedical ontologies

Data MIBBI ModelOntologies Microarray MIAME:Minimum Information about a Microarray Experiment MGED Proteomics MIAPE: Minimum Information about a Proteomics Experiment PSI-MI, PSI-MS, PSI-MOD Interaction experiments MIMIX:Minimum Information about a Molecular Interaction Experiment PSI-MI Protein-Protein Interaction Systems Biology Models MIRIAM:Minimal Information Required In the Annotation of biochemical Models SBO: Systems Biology Ontology Systems Biology Model Simulation MIASE:Minimum Information About a Simulation Experiment KISAO:Kinetic Simulation Algorithm Ontology

SysMO: Systems Biology of Micro- Organisms SysMO Consortium Pan-European consortium > 100 research groups > 320 scientists Distributed, interdisciplinary projects Expected to pool data and results and disseminate Microbiologists, molecular biologists, biochemists, mathematicians....not many informaticians SysMO-DB SysMO-SEEK – a platform for systems biology data sharing Web based environment for sharing in the consortium and disseminating to the community Used in other consortia: Virtual Liver, EraSysBio+, UNICELLSYS and more....

SOP Associating Experiments InvestigationStudyAssay Construction Validation SOP

SOP Data Templates and Vocabularies Construction Validation SOP Metabolomics Mass Spec Transcriptomics Proteomics Fluxomics

Fitting in with Laboratory practices Scientists can continue to do what they have always done Embedding semantics into the tools already in use Excel, excel, excel.....

Ontology terms for marked- up cells in drop-down boxes The End Result

Excel Workbook Ontology “Portion” of ontology terms Terms Embedded into Excel Workbook RightField Client How it Works Marked-up workbook Saved in plain Excel Informaticians/ontologists End Users

RightField Application

Loading Ontologies Published ontologies Multiple versions You can also load local ontologies from file or URL

Loading Ontologies

Excel workbook loaded into RightField with multiple worksheets

Class hierarchies of loaded ontologies

Term lists for selected cells Methods for specifying ontology terms Selected parent term from the ontology

Excel workbook with marked-up cells

Marking-up Columns or Rows

Ontology terms for marked- up cells in drop-down boxes The User View

Ontology Information Ontologies encapsulated Scientists can work offline Ensures same versions of ontologies used for a series of experiments No special macros or plugins required, just Excel or Open Office Versions and URIs captured in hidden worksheets Provenance Comparisons between sheets Linking back to the vocabularies

Provenance Term Label The human readable term label Term IRI The (unique) term identifier Ontology IRI Ontology Version The ontology that defines the term The version of the ontology Physical Location The (web) location of the ontology

RightField Technologies OWL API Loading ontologies and reasoning Apache POI HSSF libraries Loading and saving of Excel Spreadsheets Java Platform Independent

Ontology Languages RDFS - RDF Schema OBO - Open Biomedical Ontologies OWL - Web Ontology Language

RightField in Use SysMO – Systems Biology of MicroOrganisms E-Lico - a virtual laboratory for interdisciplinary collaborative research in data mining and data-intensive sciences. Case Studies in kidney research BioBanking in the Netherlands Outside Biology Oil and Gas industry Egyptology specimen classification

Populate Store / Reuse Extract RDF Graph Using RightField Spreadsheets

Future Developments Auto-complete Validation of annotation Populating ontology content - Populous

Populous Generic tool for populating ontology templates Supports validation at the point of data entry Expressive Pattern language for OWL Ontology generation Helps biologists with ontology design patterns Simon Jupp, Robert Stevens, University of Manchester

Availability Open source

Acknowledgements Stuart OwenKaty WolstencroftCarole Goble Wolfgang MuellerOlga Krebs Matthew Horridge