Microarray Data Analysis Roy Williams PhD; Burnham Institute for Medical Research.

Slides:



Advertisements
Similar presentations
Chapter 10 Excel: Data Handling or What do we do with all that data?
Advertisements

1 An Introduction to Pivot Tables Using Excel 2000.
Training Manual HOW TO LOAD A DELIMITED FILE IN X88S PRODUCT PANDORA.
GoMiner: (Zeeberg et al., Genome Biology, March 2003) For Tour of GoMiner: Advance using forward arrow.
® Microsoft Office 2010 Excel Tutorial 1: Getting Started with Excel.
XP New Perspectives on Microsoft Office Excel 2003 Tutorial 1 1 Microsoft Office Excel 2003 Tutorial 1 – Using Excel To Manage Data.
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
Excel Formatting and Editing Worksheets Microsoft Office 2010 Fundamentals 1.
How to Work With Affymetrix .Cel Files in geWorkbench
Database Features. Lists n An Excel worksheet can be used like a table in a relational database. n In Excel, such a table is called a list. n Each row.
Saving and Preparing School Finance Data from GEMS Expenditures Paul Taylor, OPI School Finance.
By Hrishikesh Gadre Session II Department of Mechanical Engineering Louisiana State University Engineering Equation Solver Tutorials.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
XP 1 ﴀ New Perspectives on Microsoft Office 2003, Premium Edition Excel Tutorial 1 Microsoft Office Excel 2003 Tutorial 1 – Using Excel To Manage Data.
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
Data Extraction cDNA arrays Affy arrays. Stanford microarray database.
® Microsoft Office 2010 Excel Tutorial 1: Getting Started with Excel.
Gene Set Enrichment Analysis Petri Törönen petri(DOT)toronen(AT)helsinki.fi.
Tutorial - Analysis of Microarray Data Microarray Core E Consortium for Functional Glycomics Funded by the NIGMS.
1 Excel Lesson 2 Formatting and Editing Worksheets Microsoft Office 2010 Fundamentals Story / Walls.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Introduction to p:IGI-3 Integrated Geochemical Interpretation Ltd. Hallsannery, Bideford, Devon EX39 5HE, UK.
European Computer Driving Licence Syllabus version 5.0 Module 4 – Spreadsheets Chapter 22 – Functions Pass ECDL5 for Office 2007 Module 4 Spreadsheets.
Lesson No:9 MS-Word Tools, Mail Merge and working with Tables CHBT-01 Basic Micro process & Computer Operation.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Technology ICT Core: Spreadsheets. Spreadsheets A spreadsheet is a table consisting of Rows and Columns Where a row and a column meet, the box is called.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
PIVOT TABLES AND CHARTS CS1100 Computer Science and its Applications CS1100Pivot tables and charts1.
Agenda Introduction to microarrays
Managing Data Modeling GO Workshop 3-6 August 2010.
® Microsoft Office 2010 Excel Tutorial 1: Getting Started with Excel.
Manatees of Florida. Standard: MAFS.912.S-ID.1.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). MAFS.912.S-ID.1.3:
CellFateScout step- by-step tutorial for a case study Version 0.94.
Office Management Tools II Ms Saima Gul. Office Management Tools II Ms Saima Gul.
Analysing Data with Excel Viewing Help To view Help 1.On the Start menu, point to Programs, and then click Microsoft Excel. 2.On the Help menu,
Copyright OpenHelix. No use or reproduction without express written consent1.
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
Microsoft ® Office Excel 2003 Training Using XML in Excel SynAppSys Educational Services presents:
1 ArrayTrack Demonstration National Center for Toxicological Research U.S. Food and Drug Administration 3900 NCTR Road, Jefferson, AR
Excel 2013 PivotTables Making Information Usable.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Excel Basics. Differentiating between worksheets and spreadsheets Differentiating between workbooks and worksheets.
NUGO Expression File Creator (by Caroline Reiff) Many programs in GenePattern require as input an expression dataset in RES or GCT file format. In order.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Ing. Martina Majorová, FEM SUA Statistics Lecture 2 – Introduction to MS Excel 2003.
AN INTRODUCTION TO GENE EXPRESSION ANALYSIS BY MICROARRAY TECHNIQUE (PART II) DR. AYAT B. AL-GHAFARI MONDAY 10 TH OF MUHARAM 1436.
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
Roy Williams PhD Sanford | Burnham Medical Research Institute.
URL PHONE FAX ADDRESS #909, VENTURE VALLEY, 958, GOSAEK-DONG, GWONSEON-GU,SUWON,
Introduction to Excel EC 151 Principles of Microeconomics Block 3,
URL PHONE FAX ADDRESS #909, VENTURE VALLEY, 958, GOSAEK-DONG, GWONSEON-GU,SUWON,
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
Lessons Copy and Paste Text Drag and Drop Text 2-Saving Documents 3- Printing 4-Inserting Tables Modifying Page Layout Format Page Margins Insert a Blank.
Day 2: MS Excel for Beginners Aniko Balogh CEU Computer & Statistics Center
Pathway Informatics 16th August, 2017
MS-EXCEL PART 2.
Data Visualizer.
Microsoft Office Access 2003
Pathway Informatics December 5, 2018 Ansuman Chattopadhyay, PhD
Microsoft Office Access 2003
Technology ICT Core: Spreadsheets.
A drag and drop exercise can be created using Word quite easily using tables, text boxes and ensuring the document is saved properly.
Presentation transcript:

Microarray Data Analysis Roy Williams PhD; Burnham Institute for Medical Research

Aims Load normalised illumina data into GeneSpring (undiff verses diff stem cells) Cancel GeneSpring normalisation Define biological replicates Discover significantly differentially regulated genes (undiff verses diff stem cells) Compare this list to Gene Ontologies Attempt to make a conclusion

Essential Tools GeneSpring: Download Demo version Quantiles Normalised Stem Cell Dataset Select data from either: –StemCellCommunity.org database –NIH GeneExpressionOmnibus database –

Stem Cell Microarray Database Automatically exports normalised data table!!! GCT table format is widespread!!!

After QC for low confidence genes (P<0.99) Note: ~50 replicate beads per array Median Outliers 25% quartile 75% quartile BAD CHIP BOXPLOT REPRESENTATION OF DATA SPREAD CHIP NUMBER SIGNAL INTENSITY

The effect of quantiles Normalisation on the filtered 36 data sets IMPORTANT: use non-linear normalisation >library(affy) >Qdata <- normalize.quantiles(Rawdata) All same range

Normalised Tutorial Dataset Using this tool generated the dataset: outputTueJul gct (.gct is portable!) AES_derived Neural stem cells AES_derived Neural stem cells AES_derived Neural stem cells EES cells_ undifferentiated FES cells_ undifferentiated DES cells_ undifferentiated

Genome Import: File ->import genome Load the illumina chip information into GeneSpring

Drag and Drop loads datafile

Define columns using drop down boxes

Define sample attributes (or not)

Give new experiment a name and save

Define new experiment normalisations and parameters New data set loaded appears here

Delete the default normalisations

ALL GONE!

IMPORTANT! DEFINE REPLICATES (PARAMETERS)

Define the custom parameter “group”

Groups define the replicates (use exactly the same text!)

Change the interpretation to look at the “group” replicates

Filter on expression level: Remove the genes which are not expressed (ie absent)

Filtering leaves 25,923 of 49,009 genes: save list as GenesPresent

Reset colour bar range Right click on Colour bar

Using the now default interpretation the data is in 2 groups

To find differentially expressed gene filter on Volcano Plot

With these filters gives a list of ~1800 genes: save list

Export the list with the averaged data: copy annotated GeneList

Select annotations to export

Paste average data into an Excel worksheet

Export all the data and see it is highly reproducible

Check list against GeneOntologies: Development is major group

Look for Pathways significantly differentially expressed Load into Nextbio and Ingenuity GSEA is a good free alternative for pathways ( GeneSet enrichment analysis ) GenePattern is a good free alternative to GeneSpring ( )