Protein-protein Interactions June 18, 2015. Why PPI?  Protein-protein interactions determine outcome of most cellular processes  Proteins which are.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
PPI network construction and false positive detection Jin Chen CSE Fall 1.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
The IntAct Database Sandra Orchard & Birgit Meldal.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Computational analysis of protein-protein interactions for bench biologists 2-8 September, Berlin Protein Interaction Databases Francesca Diella.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Protein domains vs. structure domains - an example.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Biological networks: Types and origin
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Protein Interactions and Disease Audry Kang 7/15/2013.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 5 A Story of Transcription.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Ch10. Intermolecular Interactions and Biological Pathways
Protein protein interactions
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
PutidaNET :Interactome database service and network analysis of Pseudomonas putida KT2440 (P. putida KT2440) Korean BioInformation Center (KOBIC) Seong-Jin,
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
MN-B-WP II (BInf 2) Bioinformatische Datenbanken Kay Hofmann – Protein Evolution Group Woche 4: Interaktionsdatenbanken.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Presentation for Shamir group meeting Interactome under construction: protein-protein interaction and pathway databases 5/1/2011 Based on the papers: Protein-protein.
Interactions and more interactions
Transient Protein-Protein Interactions (TPPI)
Networks and Interactions Boo Virk v1.0.
Finish up array applications Move on to proteomics Protein microarrays.
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
Discovering Macromolecular Interactions. An experimental strategy for identifying new molecular actors in a process candidate approach general screen.
Proteome and interactome Bioinformatics.
Computational prediction of protein-protein interactions Rong Liu
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Introduction to IntAct Pablo Porras Millán, IntAct
TAP(Tandem Affinity Purification) Billy Baader Genetics 677.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
Reverse Interactomics
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Affinity chromatography based strategies “Fishing for partners” Immunoprecipitation approaches The TAP TAG system Two hybrid assay.
Protein-protein Interactions
Protein-protein Interactions
Networks and Interactions
Functional organization of the yeast proteome by systematic analysis of protein complexes Presented by Nathalie Kirshman and Xinyi Ma.
Protein-protein Interactions
Protein Complex Discovery
Protein Complex Discovery
Protein-protein Interactions
Presentation transcript:

Protein-protein Interactions June 18, 2015

Why PPI?  Protein-protein interactions determine outcome of most cellular processes  Proteins which are close homologues often interact in the same way  Protein-protein interactions place evolutionary constraints on protein sequence and structural divergence  Pre-cursor to networks

PPI classification  Strength of interaction  Permanent or transient  Specificity  Location within polypeptide chain  Similarity of partners  Homo- or hetero-oligomers  Direct (binary) or a complex  Confidence score

Determining PPIs  Small-scale methods  Co-immunoprecipitation  Affinity chromatography  Pull-down assays  In vitro binding assays FRET, Biacore, AFM  Structural (co-crystals)

PPIs by high-throughput methods  Yeast two hybrid systems  Affinity tag purification followed by mass spectrometry  Protein microarrays  Microarrays/gene co-expression  Implied functional PPIs  Synthetic lethality  Genetic interactions, implied functional PPIs

Yeast two hybrid system Gal4 protein comprises DNA binding and activating domains Binding domain interacts with promoter Measure reporter enzyme activity (e.g. blue colonies) Activating domain interacts with polymerase

Yeast two hybrid system Gal4 protein: two domains do not need to be transcribed in a single protein If they come into close enough proximity to interact, they will activate the RNA polymerase Binding domain interacts with promoter Measure reporter enzyme activity (e.g. blue colonies) Activating domain interacts with polymerase A B Two other protein domains (A & B) interact

Yeast two hybrid system A B  This is achieved using gene fusion  Plasmids carrying different constructs can be expressed in yeast Binding domain as a translational fusion with the gene encoding another protein in one plasmid. Activating domain as a translational fusion with the gene encoding a different protein in a second plasmid. If the two proteins interact, then GAL4 is expressed and blue colonies form

Yeast two hybrid  Advantages  Fairly simple, rapid and inexpensive  Requires no protein purification  No previous knowledge of proteins needed  Scalable to high-throughput  Is not limited to yeast proteins  Limitations  Works best with cytosolic proteins  Tendency to produce false positives

Mass spectrometry  Need to purify protein or protein complexes  Use a affinity-tag system  Need efficient method of recovering fusion protein in low concentration

TAP (tandem affinity purification) Spacer CBP TEV site Protein A Spacer CBP TEV site Protein A Homologous recombination Chromosome PCR product Fusion protein Protein Calmodulin binding peptide

TAP process "Taptag simple" by Chandres - Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons

TAP  Advantages  No prior knowledge of complex composition  Two-step purification increases specificity of pull-down  Limitations  Transient interactions may not survive 2 rounds of washing  Tag may prevent interactions  Tag may affect expression levels  Works less efficiently in mammalian cells

Other tags  HA, Flag and His  Anti-tag antibodies can interfere with MS analysis  Streptavidin binding peptide (SBP)  High affinity for streptavidin beads  10-fold increase in efficiency of purification compared to conventional TAP tag  Successfully used to identify components of complexes in the Wnt/  -catenin pathway

Nature Cell Biology 4: (2006) The KLHL12-Cullin-3 ubiquitin ligase negatively regulates Wnt-  - catenin pathway by targeting Dishevelled for degradation Used Dsh-2 and Dsh-3 as bait proteins

Binding partners of Bruton’s tyrosine kinase Protein Science 20: (2011) Role in lymphocyte development & B-cell maturation

 MINT – Molecular Interaction Database  >240,000 interactions with 35,000 proteins  Covers multiple speces  DIP -- Database of Interacting Proteins (UCLA)  >79,000 interactions with >27,000 proteins  CCSB – Proteomics base interactomes (Harvard)  Human, viruses, C. elegans, S. cerevisiae  Some unpublished data  IntAct – EBI molecular interaction database  Curated data from multiple sources Databases of protein-protein interactions

Integrated Databases of PPIs  MiMI: Michigan Molecular Interactions  Data merged from several PPI databases; source provenance maintained  Links to literature sources for the PPI  Linked to Entrez Gene, InterPro, Gene ontology  Includes pathway data  Various methods of viewing the data  NOT CURATED  Data only as good as source data

MiMI database

MiMI search results

MiMI Gene Detail Gene Ontology Pathways Interactions

KEGG pathway Each protein name is a link to another page Arrows & lines provide information about the type of interaction

Other viewing options MeSH terms that involve this gene PPI with this gene in Cytoscape Adaptive PubMed search

 On average, two databases curating the same publication agree on 42% of their interactions. Discrepancies between sets of proteins annotated from the same publication are less pronounced, with an average agreement of 62%, but the overall trend is similar  Better agreement on non-vertebrate model organisms data sets than for vertebrates  Isoform complexity is a major issue Literature curation of protein interactions: measuring agreement across databases. Turinsky A.L. et. al. Database, Vol. 2010, Article ID baq026

iRefWeb  Web interface to integrated database of protein- protein interactions  Better review of the records after pulling in the data from the various source databases  Can search by gene name or various IDs, including batch searches.  Does not have the pathway and other information, but has a better measure of confidence of PPI

iRef Web search The search will try to match automatically, both name and species.

MI score: (Mint-inspired) score is a measure of confidence in molecular interactions for interactions between A and B: 1.Total number of unique PubMed publications that support the interactions 2.Cumulative sum of weighted evidence from all 3.The cumulative sum of weighted evidence from all interologs, i.e. interactions containing homologous pairs A' and B'.

Interaction detail

STRING database  Search Tool for the Retrieval of Interacting Genes  Integrates information from existing PPI data sources  Provides confidence scoring of the interactions  Periodically runs interaction prediction algorithms on newly sequenced genomes  v.10 covers >2000 organisms

Networks in STRING database Starting protein

Networks can be expanded 3 indirect interactions

Information about the proteins

Transferring PPI annotation  Most of the high-throughput PPI work is done in model organisms  Can you transfer that annotation a homologous gene in a different organism?

Defining homologs Orthologue of a protein is usually defined as the best- matching homolog in another species  Candidates with significant BLASTP E-value (< )  Having ≥80% of residues in both sequences included in BLASTP alignment  Having one candidate as the best-matching homologue of the other candidate in corresponding organism

Interologs  If two proteins, A and B, interact in one organism and their orthologs, A’ and B’, interact in another species, then the pair of interactions A—B and A’—B’ are called interologs  Align the homologs (A & A’, B & B’) to each other.  Determine the percent identity and the E-value of both alignments  Then calculate the Joint identity and the Joint Evalue Joint identity Joint E-value

Transfer of annotation  Compared interaction datasets between yeast, worm and fly  Assessed chance that two proteins interact with each other based on their joint sequence identities  Performed similar analysis based on joint E-values  All protein pairs with J I ≥ 80% with a known interacting pair will interact with each other  More than half of protein pairs with J E  E -70 could be experimentally verified. Yu, H. et. al. (2004) Genome Res. 14: PMID:

Examples of Protein-Protein Interologs  In C. elegans, mpk-1 was experimentally shown to interact with 26 other proteins (by yeast 2-hybrid)  Ste5 is the homolog of Mpk-1 in S. cerevisiae  Based on the similarity between the interaction partners of mpk-1 and their closest homologs in S. cerevisiae, the interolog approach predicted 5 of the 6 subunits of the Ste5 complex in S. cerevisiae

 This paper has been cited >100 times  Why the interest in predicting protein-protein interactions?  Determining protein-protein interactions is challenging and the high-throughput (genome- wide) methods are still difficult and expensive to conduct  Identifying candidate interaction partners for a targeted pull-down assay is a more viable strategy for most labs

BIPS: BIANA Interolog Prediction Server Based on concept of interolog Pre-defined alignments Can submit list of proteins to get predicted interaction partners Can filter predicted list to increase confidence

Today in computer lab  Tutorial on finding PPIs in your gene list using MiMI or iRefWeb  Exploring a subset of PPIs using the STRING database  Prediction of interactions homologs using the BIPS server  Exercise 4 on protein domain analysis