EBI is an Outstation of the European Molecular Biology Laboratory. MAGE-TAB - The ArrayExpress Production Experience Helen Parkinson, PhD.

Slides:



Advertisements
Similar presentations
The ArrayExpress Gene Expression Database: a Software Engineering and Implementation Perspective Ugis Sarkans European Bioinformatics Institute.
Advertisements

The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
 Goals Unambiguous description of how the investigation was performed Consistent annotation, powerful queries and data integration  Details NOT model.
Transcriptomics Patrick Kemmeren European Bioinformatics Institute Genomics Lab, UMC Utrecht.
Data, data standards and sharing Dr Daniel Swan Bioinformatics Support Unit
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
The European Bioinformatics Institute MIAME and Ontologies for Sample Description Helen Parkinson Microarray Informatics Team European Bioinformatics Institute.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
1 st (RSBI) ISA-Tab Workshop – Scope and Outcome  Tackle today's need for exchange of multi-omics experiments Evaluate the ISA-TAB straw-man (incomplete)
1 Update on ArrayExpress & standards Ugis Sarkans, EBI.
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
The MGED Society Facilitating Data Sharing and Integration with Standards CTSA Omics Data Standards Working Group Chris Stoeckert Dept. of Genetics and.
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
EBI is an Outstation of the European Molecular Biology Laboratory. EBI Bioinformatics Roadshow ILRI/BecA Nairobi Campus 2 nd - 3 rd March 2011.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
Antje Rossmanith, Roche 14th German CDISC User Group, 25-Sep-2012
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Copyright OpenHelix. No use or reproduction without express written consent1.
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
1 Using caArray to Share Pre- Publishing Data Fan Lin Ph.D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.
MIAMExpress development and local installation DESPRAD Meeting,November 2002 Mohammad shojatalab
The European Bioinformatics Institute MGED ontology for consistent annotation of microarray experiments Manchester Bioinformatics Week Ontologies Workshop1.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
Presentation on SubmissionTrackingTool: by Anjan Sharma.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
2 st ISA-TAB workshop Outcome/Summary (to date) Workshops on Data Standards (WODS) – EBI, Cambridge, UK 16 th, 17 th and 18 th June 2008 This workshop.
Ontologically Modeling Sample Variables in Gene Expression Data James Malone EBI, Cambridge, UK.
VectorBase Gene expression data in VectorBase Fotis Kafatos, George Christophides, Bob MacCallum & Seth Redmond Imperial College London (thanks also to.
EBI is an Outstation of the European Molecular Biology Laboratory. Anatomy ontology ArrayExpress Helen Parkinson,
The European Bioinformatics Institute Atlas of Gene Human Gene Expression Proposal - resources Alvis Brazma, Tom Freeman and Helen Parkinson.
Gene Expression Data Annotation – an application of the cell type ontology Helen Parkinson, PhD 19 May 2010.
Copyright OpenHelix. No use or reproduction without express written consent1.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
MIAMExpress development October 2002 Mohammad shojatalab
Documentation NCRR Documentation for BioPSE/SCIRun and map3d All this great software and you want documentation too!?
The European Bioinformatics Institute MAGE-OM and ArrayExpress a brief introduction to the database model Helen Parkinson European Bioinformatics Institute.
EMBL- EBI Wellcome Trust Genome Campus Hinxton, Cambridge, CB10 1SD, UK Standards and infrastructure for managing experimental metadata Philippe Rocca-Serra,
ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team European Bioinformatics Institute MGED.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
CaDSR Software Users Meeting 3.1 Requirements Review 9/19/2005 caDSR Software Team Host: Denise Warzel NCICB, Assistant Director, caDSR.
PROGNOCHIP-BASE, FORTH-ICS 1 PrognoChip-BASE: An Information System for the Management of Spotted DNA MicroArray Experiments Extension of BASE v
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
TEMBLOR review meeting - EMBL-EBI, Hinxton, October 20 th 2003 Integration of J-Express with ArrayExpress Partner 20 University of Bergen Inge Jonassen.
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
The MGED Ontology W3C Workshop on Semantic Web for life Sciences October 27, 2004 Presented by Liju Fan MGED Ontology Working Group Senior Scientist, KEVRIC.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
1 ArrayExpress Ugis Sarkans, EBI. 2 Overview Underlying standards –MIAME –MAGE* Data submission Data access –annotations –actual data –array design descriptions.
The European Bioinformatics Institute ArrayExpress – a public database for microarray gene expression data Helen Parkinson Microarray Informatics Team.
EBI is an Outstation of the European Molecular Biology Laboratory. Tutorial 5: ChEBI - On-line Submission and Curation.
CaArray User Community Meeting Feature Overview and Review of MAGE-TAB Update and Export Specification Call in: Participant Passcode:
ArrayExpress - a Public Repository for Microarray Based Gene Expression Data European Bioinformatics Institute - EMBL outstation and German Cancer Research.
Describing Bioinformatic Metadata at EBI James Malone
ArrayExpress Ugis Sarkans EMBL - EBI
Overview and Demo of CaIntegrator2 A Tool for Publishing and Analyzing Integrated Study Data.
T3/Tutorials: Data Submission
Exploiting semantic technologies to build an application ontology
Using ArrayExpress.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
ADE EDIS READ & Optimizer TRAINING Colorado Department of Education
How to store and visualize RNA-seq data
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. MAGE-TAB - The ArrayExpress Production Experience Helen Parkinson, PhD

Content All change at ArrayExpress Data acquisition Validation Extension Downloads Long Term Future Tutorial – submitting in MAGETAB format

MAGEML AE M.EXPRESS MAGETABULATOR Tracking M.EXPRESS MAGETABULATOR AE 2 MAGETAB MIGRATION MAGETAB

Data acquisition MAGETAB data acquisition is integrated with existing tab2mage submissions MAGETAB export is being added to the MIAMExpress system All MAGE-ML submissions will be converted to MAGETAB We will unify data acquisition on MAGETAB We decided to do most curation/validation/ontology matching at the end for MAGETAB submissions MAGETAB makes curator edit and user update much easier Human readable tab delimited formats=efficient curation 1600 Experiments processed (1600/3700) All curated Subset of ArrayExpress MAGETAB data will be re-curated at migration

Automated processing and validation Sections MAGETAB Column Headers MAGTAB Column Orders MAGETAB Content – length, terms External data files – released monthly vs. ArrayExpress content MIAME score DW candidates

Extensibility Solexa data Proteomics Metabolomics Array Genotype data (Gen2Phen) Association study data (Gen2Phen, Engage) Locus specific SNP data Clinical Data …..

Downloads All ArrayExpress data will be available in MAGETAB format now (exported direct from AE) ~90% is currently available and passes checks (issues with MAGE-OM->MAGETAB) More ontology term sources will be added incrementally – NCI thesaurus/OBI/ArrayExpress Factor Ontology Beta MAGETAB ArrayExpress Bioconductor Module (Huber, Kauffman) All MAGETAB generation code is available All validation code is available

Ontologies Working to develop OBI to replace MGED ontology Generating a sample/factor ontology for ArrayExpress based on data content Developed in Protégé/OWL format Will be served from OLS Also mapping to external ontologies for samples e.g NCI thesaurus Text mining to annotate external data using dictionaries based on NCI thesaurus and some custom ones (GEOimporter, tab2mage->MAGETAB) Data import, meta analysis

Future: ArrayExpress and Community ArrayExpress Submission in MAGETAB ADF format All ArrayExpress ADF in MAGETAB format Alpha ArrayExpress-MAGETAB BioConductor MAGETAB importer AE2 AE2 data migration More people post their MAGETAB examples and we agree on a gold std validated set for typical cases Community lists of MAGETAB supportive tools where people can register their interests and describe their applications (like GO tools) Addressing HLA MAGETAB model, firm up the spec Decide what factors really are, and whether the MAGE case is still valid – controlled vs uncontrolled variables instead? Issues with global variables - inter experiment comparison of compounds needs to know dose even if dose doesn’t vary in an experiment

Acknowledgments Anna Farne Ele Holloway James Malone Margus LukkArrayExpress Production Team Helen Parkinson Tim Rayner Faisal Rezwan Eleanor Williams Mengyao Zhao Holly Zheng Mohammad Shojatalab ArrayExpress Development Team Funding EC - FELICS, EMERALD, Gen2Phen, MUGEN NIH - MAGE grant

Tutorial Creation of MAGETAB templates Completion of a pre-made template Curation Scoring and validation templates Viewing Data in ArrayExpress Backend of the template generation/tracking system