Network Clustering 7000 Yeast interactions among 3000 proteins.

Slides:



Advertisements
Similar presentations
Uses of Cloned Genes sequencing reagents (eg, probes) protein production insufficient natural quantities modify/mutagenesis library screening Expression.
Advertisements

Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Antibodies Analytical Techniques Utilizing Antibodies: flow cytometry
Recombinant DNA Technology
Ameer Effat M. Elfarash Dept. of Genetics Fac. of Agriculture, Assiut Univ. Gene Expression.
Identification of Protein-Protein Interactions by the Yeast 2-Hybrid System Alliance for Cell Signaling Myriad Genetics Inc.
Review: Amino Acid Side Chains Aliphatic- Ala, Val, Leu, Ile, Gly Polar- Ser, Thr, Cys, Met, [Tyr, Trp] Acidic (and conjugate amide)- Asp, Asn, Glu, Gln.
Novel labeling technologies on proteins
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Protein Purification and Expression MCB 130L, Lecture 2.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Protein Purification and Expression
Protein domains vs. structure domains - an example.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
RT-PCR Obtain the internal region of cDNA 1. Mix crude RNA or poly(A) RNA, cDNA primer (one of the amplification primers in 7.) 2. Heat at 65 0 C for 3-15.
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Affinity Chromatography Yongting Wang Jan07. What is AC? Affinity chromatography (AC) is a technique enabling purification of a biomolecule with respect.
Polymerase chain reaction: Starting with VERY SMALL AMOUNTS OF DNA (sometimes a few molecules), one can amplify the DNA enough to detect it by electrophoresis.
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Proteome.
A highly abbreviated introduction to proteomics
MACS® Technology for magnetic isolation of cells and molecules
18.7 Isolation, Purification, and Fractionation of Proteins (1)
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
MN-B-WP II (BInf 2) Bioinformatische Datenbanken Kay Hofmann – Protein Evolution Group Woche 4: Interaktionsdatenbanken.
Research Methodology of Biotechnology: Protein-Protein Interactions
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Last Class 1.Junctions: Occluding Junctions, Anchoring Junctions, Communicating Junctions 2. Occluding Junctions: Tight Junction 3. Anchoring Junctions:
(D) Crosslinking Interacting proteins can be identified by crosslinking. A labeled crosslinker is added to protein X in vitro and the cell lysate is added.
Biological Dynamics Group Central Dogma: DNA->RNA->Protein.
Lecture 5 Post-genomics. Functional genomics (A) Identifying genes from the sequence (B) Gene expression profiling (transcriptome) (C) Model systems.
Finish up array applications Move on to proteomics Protein microarrays.
Microbial Biotechnology Philadelphia University
Protein Interaction (domain domain interaction) Bioinformatics in Biosophy Park, Jong Hwa MRC-DUNN Hills Road Cambridge CB2 2XY England 1 Next : 02/06/2001.
Discovering Macromolecular Interactions. An experimental strategy for identifying new molecular actors in a process candidate approach general screen.
Proteome and interactome Bioinformatics.
Protein-protein interactions “The Interactome” Yeast two-hybrid analysis Yeast two-hybrid analysis Protein chips Protein chips Biochemical purification/Mass.
Proteomics The science of proteomics Applications of proteomics Proteomic methods a. protein purification b. protein sequencing c. mass spectrometry.
Functional Genomics Functional genomic datasets Biological networks Integrating genomic datasets BIO520 BioinformaticsJim Lund.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Central dogma: the story of life RNA DNA Protein.
Reverse Interactomics
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Lecturer: David. * Reverse transcription PCR * Used to detect RNA levels * RNA is converted to cDNA by reverse transcriptase * Then it is amplified.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Announcements: Note that there will be presentations and associated paper summaries for both Thursday and Tuesday classes. The Exam II mean is 81.6 and.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Affinity chromatography based strategies “Fishing for partners” Immunoprecipitation approaches The TAP TAG system Two hybrid assay.
Protein-Protein Interactions. A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions.
Protein-protein Interactions
Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli.
Protein Seperation Methods
Affinity Chromatography
Protein Complex Discovery
Protein microarrays: prospects and problems
A perspective on proteomics in cell biology
Volume 9, Issue 4, Pages (April 2002)
Protein Complex Discovery
Methods for the Elucidation of Protein-Small Molecule Interactions
Presentation transcript:

Network Clustering 7000 Yeast interactions among 3000 proteins

The Interactome: the Next ‘omic Step Interactome Proteome Transcriptome Genome

protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions Bio-Map

A Parts List Approach to Bike Maintenance What are the shared parts (bolt, nut, washer, spring, bearing), unique parts (cogs, levers)? What are the common parts -- types of parts (nuts & washers)? How many roles can these play? How flexible and adaptable are they mechanically? Where are the parts located?

A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions

Genome: genes Transcriptome: mRNAs Proteome: proteins > interactions Dimensions of Information Complexity Genomics vs. Post-Genomics Human Genome Human Proteome Transcripts Protein Interaction

Comprehensive Analysis of Complex Protein Structures in the Cell Total Protein Characterization Protein Identification: What’s there Post Translational Modifications: Regulation Quantification: Dynamics Multiprotein Complex/Organelle

Protein-Protein Interactions: The “Interactome” Experimental methods: Mass Spec, yeast 2-hybrid system, microarrays,… Computational techniques: phylogenic profiles, sequence analysis,… 2 challenges: - find which proteins interact (the partners) - find which residues participate in the interactions

Figure 1. General principle of far-Western analysis. The ProFound Far-Western Protein:Protein Interaction Kits follow the non-radiolabeled bait path.

Finding Protein Partners

Approaches

Predicting Protein-Protein Interactions Genome-based approach Proximity of genes on chromosome Genes that appear near each other on a chormosome are often expressed together. They may interact (need confirmation from biology, or annotation) Example: operons Gene 1 Gene 2 Gene 3

Global view of protein family interaction networks for 146 genomes

Structure DB Predicted Human Interactome 

Binding Domain Activation Domain Yeast 2-hybrid system Two hybrid proteins are generated with transcription factor domains Both fusions are expressed in a yeast cell that carries a reporter gene whose expression is under the control of binding sites for the DNA-binding domain

Binding Domain Activation Domain Binding Domain Prot1 Activation Domain Prot2 Yeast 2-hybrid system

Binding Domain Activation Domain Binding Domain Prot1 Activation Domain Prot2 Reporter Gene Promoter Region Binding Domain Prot1 Activation Domain Prot2 mRNA Yeast 2-hybrid system If Prot1 and Prot2 interact: Interaction of bait and prey proteins localizes the activation domain to the reporter gene, thus activating transcription. Since the reporter gene typically codes for a survival factor, yeast colonies will grow only when an interaction occurs.

Principle of the Yeast Two-Hybrid (Y2H) System Reporter Gene DNA Binding Domain Bait Activation Domain Prey ( No Reporter Gene Activity ) Scenario B: Proteins X and Z do not Interact Readout: No growth of yeast colonies Protein Z Protein X DNA Reporter Gene DNA Binding Domain Protein X Bait Protein Y Activation Domain Prey Scenario A: Proteins X and Y do Interact Readout: Yeast colonies grow DNA

Yeast 2-hybrid system In words: A transcription factor is split into 2 domains 2 hybrid proteins are designed, each containing one of the two proteins that are tested If the two proteins interact, the two domains from the transcription factor will interact, causing expression of a (detectable) reporter gene The reporter can be: - essential, in which case the yeast colony dies if the 2 proteins do not interact - reversely, the reporter gene can be attached to a green fluorescent protein Unfortunately, the rate of false positive is high (estimated > 45%)

Bait Design Fragments of proteins representing folded domains are often more effective than the full-length protein in identifying physiologically relevant interactions If the domain structure of a given bait protein was already established, the specific baits were designed to represent one or more folded domains. For cases in which domain structure was not available, a variety of secondary structure prediction algorithms were used to predict domains and thus direct bait design. Baits were designed to cover the entire protein, with several overlapping fragments, as not all baits will work effectively.

Caveats! Y2H data requires validation by secondary assay –Protein complementation –Pull-downs Didn’t observe source library in data analysis Didn’t analyze all bait and prey coordinates to map sites of interaction –Cant assume multi-protein complexes since proteins may be competing for same site of interaction –This level of analysis requires more sophisticated computational approach

Protein-Protein Interactions (1) Affinity Purification (2) Phage Display (3) Yeast 2-hybrid tag affinity support protease cleavage PAGE MS no interaction positive interaction transcription Protein Chips GST-fusion proteins GST-fusion proteins ORF-activation domain fusion cell lysate phage cDNA display library ORF-binding domain fusion wash unbound proteins wash unbound proteins nutritional selection elute bound proteins amplify phage particles grow up surviving yeast colonies MS repeat and/or sequence repeat and/or sequence

General Strategy for Protein Characterization Purification/Enrichment 1-DE2-DESolution Identification Identification Sequencing Sequencing PeptidesProteinor Measurement Analysis Mass Spectrometry

Comprehensive Analysis of Protein-Protein Interactions Co-immunoprecipitation ProteolysisLC/MS/MSLC/LC/MS/MS Identification of Protein Components Identification of Modifications Dynamics of components and modifications Multiprotein Complex Protein Interaction Chromatography Agarose Protein GST Agarose Ig-G Agarose C L TEV TAP-Tagged Proteins Cell Biology/ Genetics

Fishing for Partners Recomb. Protein Histidine-Tag Biotinyl-Tag GST-Protein Flag-Protein Specific Ab Ni +2 Streptavidin Glutathione Specific mAB Protein Bait Ligand

GST “X” “Y” GST pull-down assay Sepharose GSH Sepharose GSH GST “Y”

GST pull-down assay Sepharose GSH GST “X” “Y” Run Western blot Input GST-X GST anti-Y

GST pull-down assay mix and incubate express GST-fusion protein in E.coli pGEX GST gene X prepare protein extract from brain

GST pull-down assay GST-fusion protein GST alone

Protein Bait Specific bait ligand on beads Bait linked to the support Specific Interactions Cellular extract incubated in batch with the immobilised bait Unbound proteins Elution of bait and partner(s) SDS-PAGE separation Fishing for Partners Strategy MS Identification

|BamH1 | |EcoR1 ||SmaI |SalI | XhoI | NotI |... ATC GAA GGT CGT GGG ATC CCC AGG AAT TCC CGG GTC GAC TCG AGC GGC CGC TAG CTT CCA GCA CCC TAG GGG TCC TTA AGG GCC CAG CTG AGC TCG CCG GCG Ile Glu Gly Arg Gly Ile Pro Arg Asn Ser Arg Val Asp Ser Ser Gly Arg... | Factor Xa | Engineered protease site allows removal of fusion partner

His 6 Tag add 6 consecutive His to either end binds metals Addition of a few residues should have minimal effect on recombinant protein Epitope Tag 6-12 amino acids mAb for detection or purification

Immunoprecipitation affinity purification based on isolation of Ag-Ab complexes analyze by gel electrophoresis initially based on centrifugation of large supramolecular complexes [high] and  equal amounts isolation of Ag-Ab complexes protein A-agarose protein G-agarose Bacterial proteins that bind IgG (Fc): protein A (Staphylococcus aureus) protein G (Streptococcus) binds more species and subclasses

Typical IP Protocol 1. Solubilize antigen usually non-denaturing SDS + excess of TX Mix extract and Ab 3. Add protein G-agarose, etc 4. Extensively wash 5. Elute with sample buffer 6. SDS-PAGE 7. Detection protein stain radioactivity G agarose

Protein identification In situ digestion Peptide extraction MALDI-MS

MS-MS of Peptide Mixtures LC MS MS/MS

Counts Mass (m/z) … PPGTGKTLLAK AVANESGANFISVK FYVINGPEIM... Molecular Weight Fragmentation

Leu-Gly-Val-Asp-Glu-

Database IIGHFYDDWCPLK SPAFDSIMAETLK AFDSLPDDIHEK GGILAQSPFLIIK

What can Biacore do for you? SPR (surface plasmon resonance) technology enables real-time detection and monitoring of biomolecular events and provides quantitative information on: 1.Specificity – how specific is the binding between two moelcules? 2.Concentration – how much of a given molecule is present and active? 3.Kinetics – what is the rate of association/dissociation? 4.Affinity – how strong is the binding? 5.Binding partners - provide identification of binding targets by linking SPR to MS

The BIAcore uses an optical method (surface plasmon resonance) to measure changes in refractive index. Macromolecules binding to a sensor surface leads to an increase in refractive index near the surface. How the BIAcore works

A BIAcore sensorgram

MicroArray Analysis of thousands of proteins at one time. Many different types –Antibody arrayed - detect many proteins –Proteins arrayed - detect interacting proteins –Proteins arrayed - detect interacting small molecules –Etc.

Protein:protein interactions

Examples of protein-ligand combinations used in protein microarrays

Gene name Interaction Knowledge from proteomics studies is limited by our inability to analyze efficiently large data sets Proteomics studies highlight the extreme complexity of interactions in a genomic scale. Proteomics is facing the challenge of analyzing large and highly complex and very noisy data sets. Bioinformatics is integrated in proteomics projects to mine data and is becoming more and more important.

CD19 Actin Cytoskeleton PDK1 PI3K  (p110) Btk WISH CAP Dbl CamK II NdkB cdc42 CD22 Fyn SOS2

CD19 Btk Actin Cytoskeleton SOS2 WISH CAP Dbl CamK II NdkB cdc42 PDK1 CD22 Fyn PI3K  (p110) Sam68 Protein 4.1G AIP Cbl-b

Databases of experimental protein interaction data –SPIN-PP: (existing protein-protein interfaces in the PDB) –MIPS: (protein-protein interactions in saccharomyces cerevisae) –InterAct: (protein interactions from literature curation) –DIP: –BIND: Protein Interactions

Protein Interactomics