Presentation is loading. Please wait.

Presentation is loading. Please wait.

Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org

Similar presentations


Presentation on theme: "Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org"— Presentation transcript:

1 Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org http://www.youtube.com/ncbinlm

2 GEO Database: www.ncbi.nlm.nih.gov/geo Public repository that archives and distributes expression data Microarray data and Next-gen Sequencing Data; RNA-seq User-friendly Web based tools to explore data Approximately a billion measurements recorded and available to search 100 organisms and thousands of different expression analysis platforms GEO expression data submission Pre-requisite for publications including expression data Step by step web deposit Proper preparation of sample data spreadsheets and submission forms GEO query Use search terms (text) to locate relevant DataSets or gene profiles Search for and download complete sets of data (including raw array data) Provides on-the-fly data analysis using the built in R stats tools (interesting!) bsrweb.sanfordburnham.org

3 GEO database structure The data is carefully structured around - platforms - the array type - samples - the single sample on a chip -series - the grouping of samples These are the basic building blocks of GEO

4 Linked data tables make a GEO record Affymetrix chip GPL570 HG-U133plus2 Time point at X hrs The samples grouped to make a Series or DataSet

5 How to find data in GEO http://www.ncbi.nlm.nih.gov/geo/ Study level Gene Level GDS4165 Exp profiles of homologs Curated DataSets The Complete Lists

6 After locating data: download ALL data and files, inc platform WARNING! These formats can be inconsistent ALL data and files in XML values of the expression data reliable approach: download the RAW files (chp files for Affy, idat for illumina) and reprocess them

7 Key words in Search box

8 Study type in Search box

9 Select by Study Type and Organism Link to short read archive Compressed txt files

10 GEO DataSet Analysis Tools Compare 2 sets of samples (T-tests) Precomputed Cluster Heatmaps R analysis for differentially expressed genes LIVE DEMO!!!!

11 GEO2R: Analyze GEO microarray Data retrieve a list of differentially expressed genes Use search to find datasets of interest

12 Click on Link to GEO2R

13 The R tool in GEO

14 With help from Bioinformatics Shared Resource Format and submit datasets to GEO Large scale statistical analysis Wide variety of analytical techniques (TFBS search) Advanced data plotting for figures Sequence Analysis (RNA-seq)


Download ppt "Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org"

Similar presentations


Ads by Google