Presentation is loading. Please wait.

Presentation is loading. Please wait.

Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker.

Similar presentations


Presentation on theme: "Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker."— Presentation transcript:

1 Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker

2 Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical Analysis Fit to Model (time series) Expression Index Calculation Advanced Data Analysis ClusteringPCAClassification Promoter Analysis Meta analysisSurvival analysisRegulatory Network Normalization Image analysis The DNA Array Analysis Pipeline Comparable Gene Expression Data GO annotations

3 Gene Ontology Gene Ontology (GO) is a collection of controlled vocabularies describing the biology of a gene product in any organism There are 3 independent sets of vocabularies, or ontologies: Molecular Function (MF) –e.g. ”DNA binding” and ”catalytic activity” Cellular Component (CC) –e.g. ”organelle membrane” and ”cytoskeleton” Biological Process (BP) –e.g. ”DNA replication” and ”response to stimulus”

4 Gene Ontology structure

5 GO structure, example 2

6 KEGG pathways KEGG PATHWAYS: –collection of manually drawn pathway maps representing our knowledge on the molecular interaction and reaction networks, for a large selection of organisms 1. Metabolism –Carbohydrate, Energy, Lipid, Nucleotide, Amino acid, Other amino acid, Glycan, PK/NRP, Cofactor/vitamin, Secondary metabolite, Xenobiotics 2. Genetic Information Processing 3. Environmental Information Processing 4. Cellular Processes 5. Human Diseases 6. Drug Development

7 KEGG pathway example 1

8 KEGG pathway example 2

9 Cluster analysis and GO Analysis example: Partitioning clustering of genes into e.g. 10 clusters based on expression profiles Assignment of GO terms to genes in clusters Looking for GO terms overrepresented in clusters

10 Hypergeometric test The hypergeometric distribution arises from sampling from a fixed population. 10 balls We want to calculate the probability for drawing 7 or more white balls out of 10 balls given the distribution of balls in the urn 20 white balls out of 100 balls

11 Yeast cell cycle Time series experiment: Gene expression profiles: Time Y Y Y Y Y Y Y Gene1 Gene2 Sampling

12 Exercise Find it on the course page


Download ppt "Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker."

Similar presentations


Ads by Google