Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix.

Slides:



Advertisements
Similar presentations
Misha Kapushesky November 28, 2003 Expression Profiler: Next Generation.
Advertisements

Garnet.arabidopsis.org.uk Beatrice Schildknecht NASC Data Availability and NASC tools NASC Nottingham Arabidopsis Stock Centre
13:10:58 A New Tool for Mapping Microarray Data onto the Gene Ontology Structure ( Abstract e GOn (explore Gene Ontology) is a.
Visualisationmodule Catherine Leroy, Pierre Marguerite, Bhuwan Tiwari, Niran Abeygunawardena, Sergio Contrino, Anna Farne, Ele Holloway, Gaurab Mukherjee,
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Abstract BarleyBase ( is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Minimum Information About a Microarray Experiment - MIAME MGED 5 workshop.
NYU Microarray Database (NYUMAD)
DNA Microarray Bioinformatics - #27611 Program Normalization exercise (from last week) Dimension reduction theory (PCA/Clustering) Dimension reduction.
Microarray GEO – Microarray sets database
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Demonstration Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Microarray Analysis Software at NIH. BRB ArrayTools Visualization and Statistical analysis of gene expression data Features –Excel Add-in –Flexible Data.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
NCBI resources III: GEO and expression data analysis Yanbin Yin Fall
Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
1 ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI.
Gene expression services: ArrayExpress and the Gene Expression Atlas Contact: Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
Data Curation and Management activities within the UCT Computational Biology Group Dr Nicky Mulder.
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Gene Expression Omnibus (GEO)
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
Copyright OpenHelix. No use or reproduction without express written consent1.
PLEXdb Plant Expression database Ethalinda Cannon Iowa State University January 15th, 2007.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
Agenda Introduction to microarrays
Dr Paul Lewis Lecturer in Bioinformatics Lecturer in Bioinformatics Cardiff University Cardiff University Biostatistics & Bioinformatics Unit Biostatistics.
Copyright OpenHelix. No use or reproduction without express written consent1.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
Introduction to caArray caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
Genomics Laboratory University Medical Center Utrecht... Microarray technology group microarray production and use Transcription regulation genome-wide.
Copyright OpenHelix. No use or reproduction without express written consent1.
3/24/2005 TIGP 1 Bioinformatics for Microarray Studies at IBS Pei-Ing Hwang, Ph.D. Mar. 24, 2005.
MIAMExpress and the development of annotation ontologies for gene expression experiments Ele Holloway Microarray Informatics European Bioinformatics Institute.
A plant-specific annotation and submission tool for the incorporation of Arabidopsis gene expression data into ArrayExpress, the EBI’s public DNA microarray.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.
PROGNOCHIP-BASE, FORTH-ICS 1 PrognoChip-BASE: An Information System for the Management of Spotted DNA MicroArray Experiments Extension of BASE v
Lao H. Saal 1,3,*, Carl Troein 2,*, Johan Vallon-Christersson 1,*, Sofia Gruvberger 1, Björn Samuelsson 2, Åke Borg 1 and Carsten.
Alvis Brazma, Johan Rung, Ugis Sarkans, Thomas Schlitt, Jaak Vilo European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge,
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Computing Co-Expression Relationships Wen-Dar Lin.
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
Data Mining at PLEXdb : Plant and Plant Pathogen Gene Expression Database.
A collaborative tool for sequence annotation. Contact:
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL
TEMBLOR mid-term review Participation in DESPRAD project Bernd Drescher Robert Wagner.
Applied Bioinformatics Week 9 Jens Allmer. Theory I Gene Expression Microarray.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
CaArray User Community Meeting Feature Overview and Review of MAGE-TAB Update and Export Specification Call in: Participant Passcode:
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
ArrayExpress Ugis Sarkans EMBL - EBI
GEO (Gene Expression Omnibus) Deepak Sambhara Georgia Institute of Technology 21 June, 2006.
Web Resources for Genomics Kei Cheung, Ph.D. Assistant Professor Yale Center for Medical Informatics (MBB 452a Genomics & Bioinformatics) Oct. 8, 2003.
Using ArrayExpress.
How to store and visualize RNA-seq data
Dimension reduction : PCA and Clustering
Cancer Cell Line Encyclopedia
Presentation transcript:

Abstract BarleyBase is a USDA-funded public repository for plant microarray data. BarleyBase houses raw and normalized expression data from the 22K Affymetrix Barley1 and Arabidopsis ATH1 GeneChips, presently the only two available Affymetrix high-density arrays from plants, along with experiment and sample information. BarleyBase features a web-based, MIAME-compliant, experiment submission tool, BarleyExpress. BarleyExpress allows users to efficiently submit and manage their experiment descriptions, array design and expression analysis information. BarleyBase contains a broad set of query and display options at all data levels, from experiment, hybridization to probe set and probe levels. Users can query microarray elements by expression profile and by biological information of the probe sets. Probe set queries are seamlessly integrated with visualization and analysis tools such as scatter plots, the R statistical toolbox, and data filters. BarleyBase collaborates with PlantGDB and Gramene databases to perform gene prediction and cross-species comparison at the genome level using the Barley1 GeneChip exemplar sequences. BarleyBase is accessible at BARLEYBASE – AN EXPRESSION PROFILING DATABASE FOR CEREAL GENOMICS Xiaoyun Tang, Jian Gong, Jianqiang Xin, Lishuang Shen, Stacy Turner, Rico A. Caldo, Dan Nettleton, Roger P. Wise, Julie A. Dickerson* Virtual Reality Applications Center, Iowa State University, Ames, Iowa Acknowledgments 1.BarleyBase is funded by USDA-NRI/CGP # ; USDA-CSREES North American Barley Genome Project; USDA Initiative for Future Agriculture and Food Systems (IFAFS) # PlantGDB, Gramene, KEGG, TAIR for providing tools or genomic data. 3.Many people who provided technical support and advice on BarleyBase development. Fig. 2. BarleyBase Homepage BarleyExpress Features MIAME-compliant, web-based data submission and annotation tool Experiment, array design, protocol, sample, expression submissions Enforces plant ontology in collaboration with Gramene. Uses controlled vocabulary for descriptions wherever possible First database to explicitly capture information on experiment factors and levels for presenting experiment in factorial design. Images and other supporting information can be uploaded. Minimal requirements on user’s computer skills and effort. Flexible access control for submitters to designate individuals or groups access to their private data before publication. BarleyBase Data Model BarleyBase uses a hierarchical data model to store gene expression data that is based on the Affymetrix GeneChip data formats. The highest level data structure is experiment, each of which contains one or more treatments, each treatment has one or more samples as replicates, each sample has one or more hybridizations. Protocols are associated with experiment at the hybridization level. Five types of tables: Array, Expression, Experiment, Protocol, Submitter. Follows MIAME principles recommended by MGED and implemented in MIAMExpress, but removes the Extract level and captures the information for hybridization protocol. Added statistical experimental design factors fields. Using plant ontology and controlled vocabulary in experiment description. Biological annotation for microarray probe sets and exemplars. Presently, only stores expression data from Affymetrix GeneChips. Data Access Download complete data sets for experiment annotation, raw and normalized expression data in MAGE-ML, comma-separated values (CSV), or cel-file formats. Experiment, hybridization and probe set browse & query. Query and filter probe sets by expression profiles. Search by biological criteria: annotation keywords, sequence, probe set names, pathway or gene family membership. Data set management and creation for filtered probe sets. Owner-controlled, group access to private submissions. Visualization & Analysis Visualization for experiments, hybridizations, probe sets, and probes. Data analysis uses data sets obtained from probe set filtering. Analysis methods include hierarchical clustering, k-means partitioning, PCA, SOM, and multi-dimensional scaling (MDS) Identification of differentially expressed and co-expressed genes. Most data analysis & visualizations use R and Bioconductor. Probe alignments with exemplar sequence. Gene prediction through interconnections with PlantGDB database. Cross-species comparative genomics through the Gramene database. Future Plans 1.Cross-experiment analysis. 2.Visualization and analysis tool development. 3.Barley1 exemplar annotation. BarleyExpress Submission Steps Experiment design information submission. Submit experiment factors and factor level as treatments. Batch upload raw GeneChip data. Associate raw data files with each studied treatment. Protocol submission – optional. Sample preparation details for each hybridization. Finalize experiment submission. Grant access to designated individuals and groups. Data Acquisition & Processing Experiment and expression raw data submission by submitter. BarleyBase normalizes submitted raw data. Methods are the statistical algorithm from Affymetrix MAS 5 and RMA (Robust Multi-Array Analysis) from Bioconductor. Compute summary statistics and graphs for raw and normalized expression data for summary and quality diagnostics. Store all types of data in an open-source MySQL database. BarleyBase assigns unique accession numbers to experiments, hybridizations & samples. BarleyBase generates MAGE-ML files and CSV files for batch download. Experiment submission and associated data are available for online access and analysis. Fig. 1. BarleyBase Overview Fig. 3. Major Steps in Experiment Submission Fig. 6. Graphs for Hybridization Expression & Cluster Fig. 5. Probe Set Query and Result Visualization BarleyBase: BarleyBase.org Fig. 4. Probe Alignment with Barley1 GeneChip Exemplar Batch Download MAGE-ML Raw Data CSV BarleyBase Overview BarleyBase Data Processing Pipeline Internet User BarleyExpress MAS5.0 RMA Query & Analysis