Presentation on theme: "Confessions/Disclaimers Ontologies and REDfly CARO SO OBO Foundry."— Presentation transcript:
Confessions/Disclaimers Ontologies and REDfly CARO SO OBO Foundry
Database of known Drosophila cis-regulatory elements over 675 CRMs so far from >150 genes includes sequence and expression pattern includes links to other relevant databases soon will include transcription factor binding site data (FlyReg) soon will include images of expression patterns data exchange with ORegAnno in development REDfly Regulatory Element Database for Drosophila green fluorescent proteinminimal promoterCRM reporter construct make transgenic animal
Ontologies and REDfly
Our overall goal is to create a comprehensive source of sequence and expression pattern data for Drosophila transcriptional cis-regulatory modules within a completely interoperable, ontology-compliant database framework. This framework can then be used as a model for managing CRM data from any other organism.
The OBO Foundry Barry Smith University at Buffalo Ontology: A Vision for the Future and Its Realization
new undergoing rigorous reform GO Gene Ontology ChEBI Chemical Ontology CL Cell Ontology FMA Foundational Model of Anatomy PaTO Phenotype Quality Ontology SO Sequence Ontology CARO Common Anatomy Reference Ontology CTO Clinical Trial Ontology FuGO OBI Ontology of Biomedical Investigation PrO Protein Ontology RnaO RNA Ontology RO Relation Ontology
CARO/Drosophila anatomy ontology OBIGO organism/species ontology CRM type (promoter, silencer, etc.) SO Ontologies and REDfly
Ontologies and REDfly: anatomy
Ontologies and REDfly: GO
Ontologies and REDfly: complex queries mus musculus evx1 central nervous system
CARO: Common Anatomy Reference Ontology
The main focus of this workshop is to pave the way for interoperability between the anatomical ontologies developed for various organisms (including human) by agreeing on shared methodologies for building our respective ontologies. 1. a list of relations (especially part_of) used within anatomical anatomies, including definitions and rules for consistent use within anatomy ontologies; 2. a list of major organizational units of biological organisms at all levels of granular partitions (e.g. biological macromolecule, cell, organ); 3. a representation of developmental stages of organisms; are anatomy and development two separate or one single integrated ontology? If separate what are the relations between them and how should they be applied; 4. a method that allows automated reasoners to recognize homologous anatomical structures of different species. In order to meet our objectives, we need to create a common anatomy reference ontology (CARO) designed to ensure interoperability of the anatomy ontologies developed for specific organisms. This common ontology will comprehend both top-level categories and a common set of relations to be used within anatomical ontologies; CARO will be embedded in a set of principles for constructing anatomy ontologies for different organisms at different developmental stages.
CARO: Common Anatomy Reference Ontology
CARO: Common Anatomy Reference Ontology Committed or likely to commit: zebrafish (ZFIN) Drosophila (FlyBase) human (FMA) amphibians cyprinoforme fishes hymenoptera (bees, wasps, ants) mouse (MGI) Dictyostelium (dictyBase) C. elegans (WormBase)
SO: Sequence Ontology
The Sequence Ontology is a set of terms and relationships used to describe the features and attributes of biological sequence. It encompasses both "raw" features, such as nucleotide similarity hits, and interpretations such as gene models. It also provides a rich set of attributes to describe these features such as "polycistronic" and "maternally imprinted". REDfly : CRM, TFBS ORegAnno: Regulatory haplotype, regulatory polymorphism, regulatory region, TFBS
SO: Sequence Ontology REDfly : CRM, TFBS ORegAnno: Regulatory haplotype, regulatory polymorphism, regulatory region, TFBS Obi asks: Discussion item: Should further sub- categorization of regulatory regions be allowed (e.g. Silencer, enhancer, locus-control region, etc) I say: not only “should” but need to be present
SO: Some problems A cis-acting sequence that increases the utilization of (some) eukaryotic promoters, and can function in either orientation and in any location (upstream or downstream) relative to the promoter. A regulatory_region where more than 1 TF_binding_site together are regulatorily active. Synonyms: CRM A region of a molecule that binds to a transcription factor.
A regulatory_region where more than 1 TF_binding_site together are regulatorily active. Synonyms: CRM is_a TF_module enhancer SO: Some problems
SO: More problems? The region on a DNA molecule involved in RNA polymerase binding to initiate transcription. A region of sequence which is part of a promotor. partial listing of known core promoter motifs
SO: More problems? Combination of short DNA sequence elements which suppress the transcription of an adjacent gene or genes. sequence elements short DNA “sequence elements” not defined in ontology A regulatory_region where more than 1 TF_binding_site together are regulatorily active. Synonyms: CRM is_a TF_module? silencer
SO: Some problems A DNA region that includes DNAse hypersensitive sites located 5' to a gene that confers the high-level, position-independent, and copy number-dependent expression to that gene. is_a TF_module LCR A TF_module that regulates the activity of more than one gene within a defined locus
What terms are required to adequately capture cis-regulatory information? Yuh and Davidson (1998) “Composite_CRM” ? Please provide suggestions/examples/cases to me the RegCreative Wiki, or via the SO mailing list CRM_part?
What terms are required to adequately capture cis-regulatory information? What additional terms & definitions do we need to add and what needs to be changed? Please provide suggestions/examples/cases to me the RegCreative Wiki, or the SO mailing list