Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Gene Ontology Project: Content for the Semantic Web.

Similar presentations

Presentation on theme: "The Gene Ontology Project: Content for the Semantic Web."— Presentation transcript:

1 The Gene Ontology Project: Content for the Semantic Web

2 Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary terms (annotation) Develop tools: to query and modify the vocabularies and annotations annotation tools for curators GO Project Goals

3 GO provides two bodies of data: Terms with definitions and cross- references Gene product annotations with supporting data GO Data

4 Molecular Function elemental activity or task nuclease, DNA binding, transcription factor Biological Process broad objective or goal mitosis, signal transduction, metabolism Cellular Component location or complex nucleus, ribosome, origin recognition complex The Three Ontologies

5 DAG Structure Directed acyclic graph: each child may have one or more parents

6 is-a subclass; a is a type of b part-of physical part of (component) subprocess of (process) Relationship Types

7 Every path from a node back to the root must be biologically accurate The True Path Rule

8 ID Text string Definition with source Synonyms (optional) Cross-references (optional) GO Terms: Associated Data

9 Enzyme Commission (EC) Transport Commission (TC) University of Minnesota Biocatalysis/ Biodegradation Database (UM-BBD) MetaCyc GO Terms: Cross-References

10 Association between gene product and applicable GO terms Provided by member databases Made by manual or automated methods GO Annotation

11 Database object: gene or gene product GO term ID Reference publication or computational method Evidence supporting annotation GO Annotation: Data

12 DAG Structure Annotate to any level within DAG

13 Improve coverage: Developmental processes Physiological processes Relational database Support ontology development for additional domains of biology The Future of GO:

14 Names of gene products Protein domains Protein sequence features Phenotypes; diseases Anatomical terms (except as part of terms generated by cross-products) Terms outside the Scope of GO

15 Global Open Biology Ontologies Umbrella site for shared genomics and proteomics vocabularies Present incarnation: subdirectory within GO repository: The GOBO Proposal

16 FlyBase & Berkeley Drosophila Genome Project WormBase Saccharomyces Genome Database DictyBase Mouse Genome Informatics Compugen, Inc The Arabidopsis Information Resource Swiss-Prot/TrEMBL/InterPro Pathogen Sequencing Unit (Sanger Institute) PomBase (Sanger Institute) Rat Genome Database Genome Knowledge Base (CSHL) The Institute for Genomic Research The Gene Ontology Consortium is supported by NHGRI grant HG02273 (R01). The Gene Ontology project thanks AstraZeneca for financial support. The Stanford group acknowledges a gift from Incyte Genomics.

17 Conference: Standards and Ontologies for Functional Genomics (SOFG) Towards unified ontologies for describing biology and biomedicine 17 – 20 November 2002 Hinxton Hall Conference Centre Hinxton, Cambridge, UK

18 First Standards and Ontologies for Functional Genomics (SOFG) Keynote Speakers Ken Buetow, NCI, USA Win Hide, SANBI, South Africa Peter Karp, SRI International, USA 17-20 November 2002, Hinxton, UK

19 Aims and Objectives Bring together scientists developing standards and ontologies, both biologists, bioinformaticians and computer scientists

20 Topics Introduction to Ontologies Tools for building ontologies Go and related ontologies Species specific ontologies Implementation Inter-ontology mapping Ontologies for pathology, toxicology Chemical ontologies

21 Structure 3 keynote speakers ~20 invited talks 10 short talks selected from poster abstracts Panel discussion Parallel working groups/tutorials

22 Programme Committee Michael Ashburner, University of Cambridge, UK (Chair) Cathy Ball, Stanford University, USA Mike Bittner, NHGRI, USA Alvis Brazma, EMBL-EBI, UK Catherine Brooksbank, EMBL-EBI, UK Duncan Davidson, MRC HGU, Edinburgh, UK Liz Ford, EMBL-EBI, UK Midori Harris, EMBL-EBI, UK Victor Markowitz, Gene Logic, USA Helen Parkinson, EMBL-EBI, UK John Quackenbush, TIGR, USA Martin Ringwald, The Jackson Laboratories, USA Steffen Schulze-Kremer, RZPD, Germany Paul Spellman, U.C. Berkeley, USA Robert Stevens, University of Manchester, UK Chris Stoeckert, University of Pennsylvania, USA

23 URL

24 chitin metabolism: before revision The True Path Rule chitin biosynthesis cuticle synthesis chitin catabolism cell wall biosynthesis chitin metabolism

25 chitin metabolism: after revision The True Path Rule

26 chitin metabolism: after revision The True Path Rule cuticle synthesischitin metabolism cuticle chitin biosynthesis chitin biosynthesiscuticle chitin metabolism

27 Open source Can be instantiated in DAML+OIL or GO syntax Orthogonal Shared ID space Defined terms GOBO Criteria

28 hexose glucose fructose DAG Cross-Products metabolism biosynthesis catabolism hexose metabolism hexose biosynthesis glucose biosynthesis fructose biosynthesis hexose catabolism glucose catabolism fructose catabolism glucose metabolism... etc.

29 gene gene_attribute gene_structureSO gene_variationME gene_product gene_product_attribute molecular_functionGO protein_familyINTERPRO phenotype mutant phenotype anatomy For complete current draft see Some GOBO Ontologies

30 Not a way to unify biological databases Not a dictated standard Does not define evolutionary relationships Additional ontologies needed to model biology and experimentation What GO is NOT:

31 DAG Structure Annotate to any level within DAG mitosis S.c. NNF1 mitotic chromosome condensation S.c. BRN1, D.m. barren

32 Using GO Annotation: Example Workflow text

33 ID definition cross-reference synonyms

34 Using GO Annotation: Example Workflow

Download ppt "The Gene Ontology Project: Content for the Semantic Web."

Similar presentations

Ads by Google