Generating Useful Information in Toxicogenomics: Focused Efforts: Microarray Standards Feb. 6, 2003, The National Academies Chris Stoeckert, Ph.D. Center.

Slides:



Advertisements
Similar presentations
Misha Kapushesky November 28, 2003 Expression Profiler: Next Generation.
Advertisements

The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
The MGED Ontology Workshop MGED 7 September 8, 2004 Chris Stoeckert Center for Bioinformatics & Dept. of Genetics University of Pennsylvania.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
Presented by Amr Ali AL-Hossary (M.B.,B.Ch)
1 MAGE: Revised submission against LSR RFP-007 "Gene Expression" Ugis Sarkans, EBI Michael Miller, Rosetta Inpharmatics.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Time line and procedures for datasets BCBC Pre-retreat Workshop Tyson’s Corner, VA May 11, 2011.
Welcome to mini-symposium on ontologies for biological sample description EMBL-EBI Wellcome Trust Genome Campus Deceber 5, 2001.
The European Bioinformatics Institute ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team.
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
FuGO: Development of a Functional Genomics Ontology (FuGO) Patricia L. Whetzel 1, Helen Parkinson 2, Assunta-Susanna Sansone 2,Chris Taylor 2, and Christian.
The MGED Ontology Is An Experimental Ontology Bio-Ontologies Aug 8, 2002 Chris Stoeckert, Helen Parkinson and the MGED Ontology Working Group.
MGED Ontology: An Ontology of Biomaterial Descriptions for Microarrays Microarray Data Analysis and Management: Bio-ontologies for Microarrays EMBL-EBI,
MIAME and Data Standards Phillip Lord. Why Standards? "However, there is a subtle implication that standardization (fixation) is a good thing". An anonymous.
The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for.
Data, data standards and sharing Dr Daniel Swan Bioinformatics Support Unit
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
MARS: Microarray analysis, retrieval, and storage system Albert F. Cervantes.
OntologyEntry in MAGE Chris Stoeckert, Helen Parkinson Trish Whetzel, Joe White Gilberto Fragoso, Liju Fan, Mervi Heiskanen, Angel Pizarro Ontology Working.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
EMBL Outstation — The European Bioinformatics Institute MIAME and ArrayExpress - a standard for microarray data annotation and a database to store it Helen.
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
The importance of meta data capture – problems and solutions Helen Parkinson Microarray Informatics Team European Bioinformatics Institute NERC Meta Data.
Publishing expression data from the SMD Catherine Ball Tuesday, May 30, 2006
Microrray Data Standardisation Microarray Gene Expression Database group -- MGED December, 2000.
The European Bioinformatics Institute MIAME and Ontologies for Sample Description Helen Parkinson Microarray Informatics Team European Bioinformatics Institute.
1 MAGE-OM and ArrayExpress database model Ugis Sarkans, EBI.
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
European Bioinformatics Institute MGED Society Establishing the infrastructure for sharing microarray data Alvis Brazma European Bioinformatics Institute.
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
Susanna-Assunta Sansone (Toxicogenomics project coordinator) Microarray Informatics Team EMBL- EBI (European Bioinformatics Institute) Transcriptome Symposium,
ILSI-HESI agreement with EBI: ArrayExpress, public repository for toxicogenomics data Susanna Assunta Sansone Microarray Informatics.
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
DESPRAD subproject Alvis Brazma EMBL-EBI Hinxton, October 20, 2003.
From MIAME to MAML: Microarray Gene Expression Database (MGED) Chris Stoeckert Center for Bioinformatics University of Pennsylvania Sept. 19, 2001 GE ^
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
MGED Ontology Working Group MGED4 Boston, MA Feb. 15, 2002 Chris Stoeckert, Center for Bioinformatics, U. Penn Helen Parkinson, EBI.
Content, Format, and Standards in Genomics Scale Data The ILSI – EBI Collaboration Wm. B. Mattes, PhD, DABT.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Agenda Intro: Information management in Biology Information management engineering Formats and standards XML MAGE example Perspectives: the Semantic Web.
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
EMBL-EBI EMBL-EBI EMBL-EBI What is the EBI's particular niche? Provides Core Biomolecular Resources in Europe –Nucleotide; genome, protein sequences,
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
RADical microarray data: standards, databases, and analysis Chris Stoeckert, Ph.D. University of Pennsylvania Yale Microarray Data Analysis Workshop December.
FuGE: A framework for developing standards for functional genomics Angel Pizarro Univesrity of Pennsylvania Andrew Jones University of Manchester.
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
TEMBLOR review meeting - EMBL-EBI, Hinxton, October 20 th 2003 Integration of J-Express with ArrayExpress Partner 20 University of Bergen Inge Jonassen.
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
The MGED Ontology W3C Workshop on Semantic Web for life Sciences October 27, 2004 Presented by Liju Fan MGED Ontology Working Group Senior Scientist, KEVRIC.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
The European Bioinformatics Institute ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
EMBL- EBI Wellcome Trust Genome Campus Hinxton, Cambridge CB10 1SD, UK The BioInvestigation Index – Standards and Infrastructure for Omics Data Philippe.
ArrayExpress Ugis Sarkans EMBL - EBI
Using ArrayExpress.
From MIAME to MAML: Microarray Gene Expression Database (MGED)
Functional Genomics Consortium: NIDDK (Kaestner) and (Permutt)
Presentation transcript:

Generating Useful Information in Toxicogenomics: Focused Efforts: Microarray Standards Feb. 6, 2003, The National Academies Chris Stoeckert, Ph.D. Center for Bioinformatics University of Pennsylvania

Microarray Standards Are Needed To Facilitate Moving Data Around Stoeckert, Causton, & Ball Nature Genetics 2002

Standards will Aid Data Integration in Toxicogenomics Efforts EHP Toxicogenomics 2003

Microarray Standards Exist... …And You Are Being Asked To Use Them. … also being adopted by Cell and The Lancet - others to follow…

Microarray Standards Are A Good Thing Because… You won’t have to take people’s results on faith anymore. Get the full dataset and annotations in a form you can use. You will be able to get useful data from public repositories You won’t have to worry about some supplementary web site going away …but it is going to take some effort on your part to comply! Need to learn the standards and use them!

MGED Board of Directors Alvis Brazma - EBI Michael Miller - Rosetta Inpharmatics Paul Spellman - LBL Charles Troup - Agilent Technologies Barbara Jasny - Science Magazine Chris Stoeckert - University of Pennsylvania Gavin Sherlock - Stanford University Helen Causton - MRC Helen Parkinson - EBI Jason Goncalves - Iobion Jason Stewart - Open Informatics John Quackenbush - TIGR Martin Ringwald - MGI Pascal Hingamp - University of Marseilles Ronald Taylor - UCHSC Terry Gaasterland - Rockefeller Yoshio Tateno - NIG Catherine Ball - Stanford University

Goals of MGED Efforts Elucidate information needed for experiments MIAME Provide a means to share this information MAGE Provide a common language for experiments MGED Ontology Provide standard operating procedures for analysis Data transformation

MGED Working Groups MIAME: Defining data standards Minimal Information About a Microarray Experiment Checklist MAGE: Establishing data formats MicroArray Gene Expression Object model implemented as XML Ontologies: Creating ways to describe data Defined concepts and relationships Terms for annotating experiments Data Transformation: Describing methods of normalization and analysis

MIAME: Information Required to Interpret and Replicate Experimental Design Array Design Biological Samples Hybridizations Measurements Data Normalization and Transformation Brazma et al. Nature Genetics 2001

MIAME Checklist Available from:

Call to Deposit Microarray Experiments in Public Repository Who: Journals: Nature, Genome Res., others Granting agencies: NIH Where: ArrayExpress (EBI) GEO (NCBI) CIBEX (DDBJ, under development) How: MAGE MIAME-compliant systems: MIAMExpress, RAD See also:

MAGE-ML Should Be Used To Transfer Data MAGE-ML is automatically generated as an XML DTD from the MAGE Object Model (MAGE-OM) MAGE-OM represents a data driven model of microarray experiments MAGE was developed following OMG specifications (OMG is an industrial standards group) Spellman et al. Genome Biology 2002 Gene Expression RFP (OMG document) Look for latest GE file under OMG Model at: 076

The MAGE-ML Packages Audit and Security Description BQS BioSequence ArrayDesign * DesignElement Array BioEvent BioMaterial * BioAssay* BioAssayData Experiment * HigherLevelAnalysis * Measurement * Protocol QuantitationType * Correspond to MIAME sections. Note that MAGE is designed to hold MIAME and more.

MAGE UML Model for Experiment Package

MGED Core Ontology Provide defined terms for MAGE v 1.0 documents Controlled vocabularies Complex terms like Age (measurement, initial time point) External resources for terms Built from MGED Ontology v 1.6 Built with input by Ontology Working Group Ontology workshops at U Penn with EBI, Stanford, TIGR Will be extended. Improve limited structure of MAGE 1.0 Cover functional genomics in general Stoeckert & Parkinson, Comparative and Functional Genomics 2003

MGED Ontology is in DAML

ArrayExpress MIAMExpress RAD MAGE-ML data exchange Ontology instances propagated to submission/annotation web forms Curation of user defined terms, before inclusion in the ontology User defined terms collected via forms BioMaterialPackage BioMaterialCharacteristics Sex C C C C Gender documentation: Subclass of sex applicable to heterogametic species (i.e., those in which the sexes produce gametes of markedly different size). Males produce small numerous gametes. Females produce small numbers of large gametes. Hermaphrodites are individuals with both male and female characteristics. Mixed refers to a population of individuals with more than one type of gender. used in individuals: female, hermaphrodite,male,mixed_sex,unknown_sex

MGED Promotes Other Standards Efforts in Functional Genomics Standards and Ontologies in Functional Genomics (SOFG) A multitude of overlapping standards can contribute to the problem First meeting was held Nov at the Wellcome Trust Campus, Hinxton, UK Proteomics HUPO Proteomics Standards Initiative Protein-protein interactions Mass spectrometry

Join the MGED Effort in Microarray Standards Standards are driven by the communities who are willing to help develop them. Go to meetings where tutorials are held AAAS, Denver Feb, 2003 MGED 6, Aix-Provence Sept Join discussion lists Communicate with the people who are working on the standards Have the opportunity to contribute Visit

Thank you!

Active MAGE Community Exists MAGE discussion list Links from MAGE Jamborees and tutorials Programming Jamboree IV: Stanford Dec 6-10, 2002 MAGEstk (software tool kit) Freely available software collection (MIT open source license) Includes MAGE-ML Parsers for: Perl -- Software complete and functional Java -- Very nearly functional C months to completion Data annotation tool (MIAME Express)