Functional Genomics Functional genomic datasets Biological networks Integrating genomic datasets BIO520 BioinformaticsJim Lund.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Microarray Data Analysis Day 2
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Bioinformatics and Evolutionary Genomics High throughput “functional” data / functional genomics / Omics.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Gene expression analysis summary Where are we now?
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Protein-protein interactions
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Protein domains vs. structure domains - an example.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Genetics: From Genes to Genomes
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Affinity chromatography/mass spec Bait protein GST Page 252.
Protein Classification A comparison of function inference techniques.
Gene expression and the transcriptome II. SAGE SAGE = Serial Analysis of Gene Expression Based on serial sequencing of 15-bp tags that are unique to each.
Cloning, genomes, and proteomes
Expression Profiling Using DNA MicroArrays - Each cell type within an organism expresses a unique combination of genes – this is, in part, what makes cells.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 5 A Story of Transcription.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Recombinant DNA Technology Site directed mutagenesis Genetics vs. Reverse Genetics Gene expression in bacteria and viruses Gene expression in yeast Genetic.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Biological Pathways & Networks
Presentation for Shamir group meeting Interactome under construction: protein-protein interaction and pathway databases 5/1/2011 Based on the papers: Protein-protein.
(D) Crosslinking Interacting proteins can be identified by crosslinking. A labeled crosslinker is added to protein X in vitro and the cell lysate is added.
Interactions and more interactions
Ethylene responses Developmental processes
Yeast as a Model System MBIOS 520/420 September 29, 2005.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Finish up array applications Move on to proteomics Protein microarrays.
Introduction to Proteomics 1. What is Proteomics? Proteomics - A newly emerging field of life science research that uses High Throughput (HT) technologies.
Proteomics and annotation. Definition of proteomics Study of all the proteins in an organism Derived from genomics all the DNA in an organsim On some.
Protein Interaction (domain domain interaction) Bioinformatics in Biosophy Park, Jong Hwa MRC-DUNN Hills Road Cambridge CB2 2XY England 1 Next : 02/06/2001.
A road map for cell biology: Why studying large protein complexes is crucial at this time David Drubin, UC Berkeley.
Chapter 21 Eukaryotic Genome Sequences
MCB 317 Genetics and Genomics Topic 11 Genomics. Readings Genomics: Hartwell Chapter 10 of full textbook; chapter 6 of the abbreviated textbook.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
New issues in storage and analysis Christophe Roos - MediCel ltd Annotating genomes with functional information: automatic but.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Introduction to biological molecular networks
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Two powerful transgenic techniques Addition of genes by nuclear injection Addition of genes by nuclear injection Foreign DNA injected into pronucleus of.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
GO based data analysis Iowa State Workshop 11 June 2009.
Announcements: Note that there will be presentations and associated paper summaries for both Thursday and Tuesday classes. The Exam II mean is 81.6 and.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
The two-hybrid system – why?
1 Computational functional genomics Lital Haham Sivan Pearl.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences TuyetLinh Nguyen.
The Transcriptional Landscape of the Mammalian Genome
Recombinant DNA Technology
Protein Complex Discovery
Protein Complex Discovery
Presentation transcript:

Functional Genomics Functional genomic datasets Biological networks Integrating genomic datasets BIO520 BioinformaticsJim Lund

Functional genomics Genome scale experiments to understand the function of all the proteins--what they do and how they interact. Many different experimental designs –Different kinds of information generated. Each has experimental limitations –Coverage: full genome, limited? –False positives. –False negatives.

Reporter Gene Bait Protein Binding Domain Prey Protein Activation Domain Two hybrid proteins are generated with transcription factor domains Both fusions are expressed in a yeast cell that carries a reporter gene whose expression is under the control of binding sites for the DNA-binding domain The Two-Hybrid System for identifying protein/protein binding pairs

Reporter Gene Bait Protein Binding Domain Prey Protein Activation Domain The Two-Hybrid System Interaction of bait and prey proteins localizes the activation domain to the reporter gene, thus activating transcription. Since the reporter gene typically codes for a survival factor, yeast colonies will grow only when an interaction occurs.

Interactions shown as a network

Networks When methods of detecting functional linkages are applied to all the proteins of an organism, network of interacting, functionally linked proteins can be traced. As methods improve for detecting protein linkages, it seems likely that most of the proteins will be included in the network.

What do you miss? Tertiary interactions Regulated interactions –Subcellular localization dependent –Cofactor dependent (eg. Hormone- regulated) Low-affinity (K d >10 -6 )

Immunolocalization –FUSION PROTEINS Prediction –Membrane vs non-membrane improved by homology WHICH MEMBRANE –Nuclear vs cytoplasmic Cellular Location YFG GFP

Drosophila Fusion Project (FlyTrap) Exon GFP vector –Inserts fairly randomly. Fluorescent sort thousands of embryos. –Find embryos with an insertion that produces GFP expression. Image –Capture and analyze images Curate by hand. Computer image analysis and classification.

Developmental Localization

Mouse genomic gene expression Allen Brain Atlas (ABA) is an interactive, genome-wide image database of gene expression in the mouse and human brain. 17,000 mouse gene expression patterns, cortex expression for 2,000 human genes.

Allen Brain Atlas

3D mouse gene expression project Single gene expression database for the mouse research community. Integrated in the Mouse Genome Database (MGD) at the Jackson Laboratory. 10,302 expression entries WT1 expression (red) on a section of the E9 (Theiler Stage 14) embryo from the Edinburgh Mouse Atlas. The gut epithelium is shown in yellow and the neural tube in a blue overlay. WT1 is expressed in the presumptive mesothelium of the coelom and in the intermediate mesoderm (ventral to the somites).

Methods for discovering protein function Automated Binding Assays High Throughput Enzyme Assays

Genome-wide Knockouts Yeast Genome –Recombination strategy Mouse Genome More in Functional Genomics!!!

Essential vs Non-essential Transcription similar –>99% essential genes transcribed Transcript level 70% higher –>90% non-essential transcribed Genome locations similar –Not clustered –Essential genes rarely near telomeres

Why only 20% essential? Redundant –8.5% of non-essential had CLOSE homolog in genome (P< ) Essential in another condition Marginal Benefit

Resources YEAST Saccharomyces Genome Deletion Project – sequence.stanford.edu/ group/yeast_deletion_p roject/deletions3.html MOUSE Mouse Phenome Database – cgi/phenome/mpdcgi?rtn=docs/h ome Knockout Mouse Project –

Genome-Scale Biochemical Assay Protein arrays- biochemically active

Databases Relationships between genes/proteins. How are different types of experimental data integrated? –Schema Data quality –Who curates? –Who revises?

Proteome Projects SwissProt (ExPasy) – Saccharomyces Genome Database (SGD) Gene Function Information –2-hybrid, functional assignments, pathways. – Yale TRIPLES –Database of TRansposon-Insertion Phenotypes, Localization, and Expression in Saccharomyces. 2-hybrid databases –

Pathway and interaction databases KEGG ( –Metabolic and signaling pathways PUMA ( –Metabolic and signaling pathways DIP ( –Protein-protein interactions BIND ( –Molecular and genetic interactions

KEGG pathway map Pentose phosphate cycle Purine metabolism HISTIDINE METABOLISM Phosphoribulosyl- Formimino- AICAR-P Phosphoribosyl- Formimino-AICAR-P Phosphoribosyl-AMP Phosphoriboxyl-ATP PRPP 5P-D-1-ribulosyl- formimine Imidazole- Glicerol-3P Imidazole- acetole P L-Histidinol-P N-Formyl-L- aspartate Imidazolone acetate Imidazole- 4-acetate Imidazole acetaldehyde Histamine Carnosine Aneserine Methyl- L-histidine L-Hisyidinal 5P Ribosyl-5-amino 4- Imidazole carboxamide (AICAR) L-Histidine Hercyn

Integrating pathway and expression data The list of genes being activated or inactivated or that are unaffected when comparing two samples becomes more informative if the genes can be mapped onto maps from which functions can be deduced.