Using ArrayExpress.

Slides:



Advertisements
Similar presentations
EBSCO Discovery Service
Advertisements

PubMed.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Library Online Catalog Tutorial Pentagon Library Last Updated March 2008.
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
PAZAR DATABASE CHIP-SEQ DEPOSIT Wyeth Wasserman.
MIAME Minimum Information About a Microarray Experiment
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Data Extraction cDNA arrays Affy arrays. Stanford microarray database.
Welcome to the Turnitin.com Instructor Quickstart Tutorial ! This brief tour will take you through the basic steps teachers and students new to Turnitin.com.
NCBI resources III: GEO and expression data analysis Yanbin Yin Fall
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Comparing protein structure and sequence similarities Sumi Singh Sp 2015.
An introduction to using the AmiGO Gene Ontology tool.
Wiley Online Library. About Wiley Online Library Wiley Online Library hosts the world's broadest and deepest multidisciplinary collection of online resources.
ARCHIBUS Log On Instructions. Log Into ARCHIBUS Web Central Log In Screen 1.Open your Internet browser. 2.Enter the URL to view the ARCHIBUS Login Page.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Gene Expression Omnibus (GEO)
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
EBI is an Outstation of the European Molecular Biology Laboratory. EBI Bioinformatics Roadshow ILRI/BecA Nairobi Campus 2 nd - 3 rd March 2011.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Copyright OpenHelix. No use or reproduction without express written consent1.
ArrayExpress and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
PLEXdb Plant Expression database Ethalinda Cannon Iowa State University January 15th, 2007.
Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Review of Array Express Thomas, M.D. Georgia Institute of Technology 21 June, 2006.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
Grants.gov Application Process. Grants.gov 5-Step Process Accessing and submitting an application for an announcement (opportunity) through Grants.gov.
PROGNOCHIP-BASE, FORTH-ICS 1 PrognoChip-BASE: An Information System for the Management of Spotted DNA MicroArray Experiments Extension of BASE v
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Gene Expression Omnibus (GEO)
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Applied Bioinformatics Week 9 Jens Allmer. Theory I Gene Expression Microarray.
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
PubMed Basics Barbara A. Wood, MLIS Calder Library University of Miami Miller School of Medicine.
ArrayExpress Ugis Sarkans EMBL - EBI
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
2 Copyright © 2008, Oracle. All rights reserved. Building the Physical Layer of a Repository.
Web Resources for Genomics Kei Cheung, Ph.D. Assistant Professor Yale Center for Medical Informatics (MBB 452a Genomics & Bioinformatics) Oct. 8, 2003.
T3/Tutorials: Data Submission Uploading genotype experiments
T3/Tutorials: Data Submission
Getting GO annotation for your dataset
T3/Tutorials: Data Submission
The Business Source Databases Advanced Searching
Using the Advanced Search Guided Style Find Fields on
PubMed Database Interface (Basic Course Module 4 Part B)
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
How to store and visualize RNA-seq data
Using the Advanced Search Guided Style Find Fields on
Gene Expression Omnibus (GEO)
Code Analysis, Repository and Modelling for e-Neuroscience
Welcome to the Quantitative Trait Loci (QTL) Tutorial
Download from Zotero Home Page
PubMed Database Interface (Basic Course: Module 4)
Presentation transcript:

Using ArrayExpress

ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic hybridization (CGH) and chromatin-immunoprecipitation (ChIP) experiments.

ArrayExpress has three major goals: 1.Serve the scientific community as a repository for data supporting publications 2.Provide easy access to high-quality data in a standard format. 3.Facilitate the sharing of microarray designs and experimental protocols.

ArrayExpress has two major components: 1. ArrayExpress experiment repository – the main database containing complete data supporting publications. 2. ArrayExpress gene expression profile data warehouse – contains gene-indexed expression profiles from a curated subset of experiments from the repository.

Search for experiments by entering ArrayExpress experiment accession Options for sorting and filtering your results. Search for experiments by entering ArrayExpress experiment accession numbers or keywords (e.g. RNAi, breast cancer) in the query box on the left-hand panel.

ID - the unique ArrayExpress accession number of the experiment. Experiment accession numbers are in the format of E-XXXX-n, where XXXX is a code for the source of the data. Experiments and array designs in ArrayExpress are given unique accession numbers in the format of E-XXXX-n for experiments A-XXXX-n for array designs XXXX represents a four letter code and n is a number e.g. E-MEXP-568, A-UHNC-18.

Title - the curated title for the experiment

Hybs - the total number of hybridizations in the experiment

Species - the species of the samples used (can be multiple)

Date - the date that the data were loaded into ArrayExpress

Processed – direct link to the processed data as a zip file (brown icon indicates that this exists)

Raw – a direct link to the raw data (brown/grey icon indicates that this exists/not exists). A wedge shaped icon indicates Affymetrix .CEL files

More – a link to the ArrayExpress advanced interface where you can get subsets of each data file by gene, hybridization and QuantitationTypes (columns in the data file).

Click anywhere on an experiment row and it will expand to allow you see more details about this experiment and see where the term you searched for appears.

Title - curated title of the experiment

MIAME score - this is a score to indicate how close to full MIAME-compliance an experiment is, with a score of 5 being the highest. One point each is given for sufficient annotation of the associated array design essential sample annotation including at least one experimental factor and the species of all samples raw data files for each hybridization final processed (normalized) data for the hybridizations in the experiment essential laboratory and data processing protocols

Sample annotation – a link to .2columns.xls which is a file containing a list of the samples, the experimental factor values associated with these samples and the corresponding data files

Array – the ArrayExpress accession number(s) for the array design(s) used in the experiment. Clicking on the accession number opens a new browser window showing more information about the array design in the advanced query interface.

Downloads – links to the FTP server directory containing data files and sample and hybridization information for the experiment, and to the data retrieval page for the experiment in the advanced user interface

Experiment design – links to a diagram of the sample relationships in .png and .svg format.

Protocols – there is a link taking you to a page listing all the protocols used in the experiment.

Citation - details about any publications that relate to the data, including links to the online article and to the PubMed entry where available

Detailed sample annotation - a link to. sdrf Detailed sample annotation - a link to .sdrf.xls which contains information about the samples, the relationships between the samples, extracts, labeled extracts, hybridizations and data files.

Contact - the name of the experiment submitter

Design types - terms describing design types of the experiment Design types - terms describing design types of the experiment. These can include biological, methodological and technology types e.g. disease state, strain or line, compound treatment, in-vivo, dye swap, co-expression, binding site identification.

Description - the description of the experiment as supplied by the submitter

Factor values - a list of the experimental factor values in the experiment

The four letter code in the accession number generally indicates the source of the MAGE-ML file that was used to load the data into the ArrayExpress database. Sources include our own submission tools (MEXP for MIAMExpress and TABM for Tab2MAGE) as well as MAGE-ML submitted from other organizations or microarray data management tools. The 4 letter code does not necessarily tell you which organization performed the experiment or manufactured the array design. Some experiments have also been extracted from the Gene Expression Omnibus (GEO) at the NCBI. MIAME describes the Minimum Information About a Microarray Experiment that is needed to enable the interpretation of the results of the experiment unambiguously and potentially to reproduce the experiment.