Presentation is loading. Please wait.

Presentation is loading. Please wait.

GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.

Similar presentations


Presentation on theme: "GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004."— Presentation transcript:

1 GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004

2 Overview Similarity / Clustering Applications Gene List Annotation Statistical Significance of Over Representation Public Data Formats and Databases

3 Overview Similarity / Clustering Applications Gene List Annotation Statistical Significance of Over Representation Public Data Formats and Databases

4 Sample Clustering Quality Control Class Discovery

5 Similarity to Pattern of Interest Can use real or hypothetical gene Rank all other genes by similarity

6 Hierarchical Clustering Group by expression No clear number of clusters Can define clusters by “pruning tree”

7 Binning by Expression K-means Clustering Self Organizing Maps QT Clustering

8 Binning By Expression K-means and SOM Groups genes into pre-determined number of clusters K-means Self Organizing Map

9 Comparing Clusters Another Look Hierarchical tree trimmed to 6 clusters K-means 6 clusters Coincidence between two methods

10 Binning By Expression QT Clustering Control quality and minimum size of clusters Genes may remain unclustered

11 Overview Similarity / Clustering Applications Gene List Annotation Statistical Significance of Over Representation Public Data Formats and Databases

12 The Gene Ontology (GO) http://www.geneontology.org/ Network of defined biological terms Three main branches Biological Process Molecular Function Cellular Component

13 Gene List Annotation Pathways Functional Groups Affymetrix GeneSpring DAVID GoMiner GenMapp http://apps1.niaid.nih.gov/david/

14 Identifiers to Knowledge

15 Ingenuity Pathway Analysis http://www.ingenuity.co m Curated Interaction and Pathway Database Mine literature as it relates to gene list Associate function with both gene lists and interaction networks

16 Overview Similarity / Clustering Applications Gene List Annotation Statistical Significance of Over Representation Public Data Formats and Databases

17 Gene List Overlap with Pathways GeneSpring EASE S+ArrayAnalyzer

18 Over Representation Genes on array - pathway X or O - overall 50/50 Does a gene list over represent one of the pathways? Fisher Exact Test

19 EASE Expression Analysis Systematic Explorer http://david.niaid.nih. gov/david Statistical analysis of category over- representation Many choices of category lists available

20 EASE Output

21 Overview Similarity / Clustering Applications Gene List Annotation Statistical Significance of Over Representation Public Data Formats and Databases

22 MGED Microarray Gene Expression Data (MGED) Society Organization devoted to facilitation of sharing microarray data CBIL group at UPenn key contributors Focus on standards for microarray data annotation and exchange Creation of software and databases

23 MIAME Minimum Information About a Microarray Experiment required to interpret and verify the results Required by many journals Explicit guidelines for: Sample description Experimental design Array technology Protocols Analytical methods

24 Public Microarray Databases ArrayExpress (EBI) http://www.ebi.ac.uk/arrayexpress/ GEO (NCBI) http://www.ncbi.nlm.nih.gov/geo/ CIBEX (NIG) http://cibex.nig.ac.jp/

25 Contact Information Penn Bioinformatics Core - 13th Floor Blockley Hall John Tobias - 1314 - jtobias@pcbi.upenn.edu Reserve Computers http://core.pcbi.upenn.edu/


Download ppt "GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004."

Similar presentations


Ads by Google