Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:

Slides:



Advertisements
Similar presentations
STRING Prediction of protein networks through integration of diverse large-scale data sets Lars Juhl Jensen EMBL Heidelberg.
Advertisements

Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Protein structure prediction.. Protein folds. Fold definition: two folds are similar if they have a similar arrangement of SSEs (architecture) and connectivity.
1 Modular Co-evolution of metabolic networks Zhao Jing.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
PPI network construction and false positive detection Jin Chen CSE Fall 1.
Computational analysis of protein-protein interactions for bench biologists 2-8 September, Berlin Protein Interaction Databases Francesca Diella.
The STRING database Michael Kuhn EMBL Heidelberg.
Anna R. Panchenko Benjamin A. Shoemaker NCBI / NIH
STRING Modeling of biological systems through cross-species data integration.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Components of a Cell (Eukaryotes) Picture from on-line biology book,on-line biology book,
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Protein-protein interactions
Biological Gene and Protein Networks
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Protein-protein interactions Ia. A combined algorithm for genome-wide prediction of protein function. Edward M. Marcotte, Matteo Pellegrini, Michael J.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Protein interaction Computational (inferred) Experimental (observed)
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Biological networks: Types and origin
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Affinity chromatography/mass spec Bait protein GST Page 252.
Epistasis Analysis Using Microarrays Chris Workman.
Protein Interactions and Disease Audry Kang 7/15/2013.
Protein-protein Interactions June 18, Why PPI?  Protein-protein interactions determine outcome of most cellular processes  Proteins which are.
BCB 570 Spring Protein-Protein Interaction Networks & methods Julie Dickerson Electrical and Computer Engineering.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Protein Interaction Networks Aalt-Jan van Dijk Applied Bioinformatics, PRI, Wageningen UR & Mathematical and Statistical Methods, Biometris, Wageningen.
A number of slides taken/modified from:
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Ch10. Intermolecular Interactions and Biological Pathways
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Network Analysis and Application Yao Fu
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Modeling Functional Genomics Datasets CVM Lesson 6 11 July 2007Bindu Nanduri.
Biological Pathways & Networks
蛋白质相互作用的生物信息学 高友鹤 中国医学科学院 基础医学研究所. 蛋白质相互作用的生物信息学 1. 实验数据 2. 蛋白质相互作用数据库 3. 高通量实验数据的验证 4. 蛋白质相互作用网络 5. 计算预测蛋白质相互作用.
Interactions and more interactions
Protein-protein interactions Courtesy of Sarah Teichmann & Jose B. Pereira-Leal MRC Laboratory of Molecular Biology, Cambridge, UK EMBL-EBI.
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
Social behavior of proteins? Rui Alves. Organization of the talk Social behavior of the protein?!?!?!? Using meta text analysis Using phylogenetic profiling.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Proteome and interactome Bioinformatics.
Computational prediction of protein-protein interactions Rong Liu
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
NIGMS Protein Structure Initiative: Target Selection Workshop ADDA and remote homologue detection Liisa Holm Institute of Biotechnology University of Helsinki.
Introduction to IntAct Pablo Porras Millán, IntAct
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
Central dogma: the story of life RNA DNA Protein.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
GO based data analysis Iowa State Workshop 11 June 2009.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Computational functional genomics Lital Haham Sivan Pearl.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Protein-Protein Interactions. A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions.
Protein-protein Interactions
Protein association networks with STRING
STRING Large-scale data and text mining
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Network biology An introduction to STRING and Cytoscape
Presentation transcript:

Protein-protein interactions Chapter 12

Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex: homodimeric citrate synthase Interface 4890 Å 2 Hydrophobic interfaces “Hydrophilic” interfaces Stable vs. transient protein-protein interactions Multi-domain protein

Using publicly available interaction data 1.There are interactors for your protein in the literature 2.There are databases of interactions where your protein may appear 3.There are homologues of your protein in the protein interaction databases 4.You can predict interactors by other means? 5.This failing, at this point you go back to the bench… Are there know interaction partners for you pet protein? Check if:

Using publicly available interaction data Problems: Low coverage Does not include results from high throughput experiments Gene names may not be consistent 1.Are there interactors for my protein in the literature ?

Using publicly available interaction data 2. Are there databases of interactions where my protein may appear? Some DBs: BIND, MINT (General) + organism specific databases (e.g. MIPS/CYGD) Caution! Check: -the experimental methods used to identify the interaction (e.g. high error rate in large scale yeast-two hybrids) -check the method used to incorporate the interaction in the database (e.g. manual curation vs. literature mining using “intelligent” algorithms)

Experimental techniques Yeast two-hybrid screens MS analysis of tagged complexes Correlated mRNA expression levels

Experimental techniques Yeast two-hybrid screens MS analysis of tagged complexes Correlated mRNA expression levels 90% of genes with conserved co-expression are members of stable complexes Use microarrays to identify co-expression

How good is the data? (von Mering et al., Nature 417:399)

”We estimate that more than half of all current high-throughput interaction data are spurious”

Computational prediction of protein interactions Tryptophan synthetase   fusion 1PII TrpCTrpF Fused in E.coli Unfused in some other genomes (Synechocystis sp. and Thermotoga maritima.) Enright et al (1999) Nature 409:86 Marcotte et al (1999) Science 285: 751 Gene fusion events

Pellegrini et al (1999) PNAS 96: 4285 Computational prediction of protein interactions Phylogenetic profiles

Computational prediction of protein interactions Pre-computed predictions: where to find them?

Identification of functional modules from protein interaction data Graph theory formalisms Custering Messy data Functional modules Pereiral-Leal, Enright and Ouzounis (2003) Proteins in press

14 DIP database Documents protein-protein interactions from experiment –Y2H, protein microarrays, TAP/MS, PDB 55,733 interactions between 19,053 proteins from 110 organisms. Organisms# proteins# interactions Fruit fly705220,988 H. pylori Human E. coli C. elegans Yeast492118,225 Others985401

15 DIP database Duan et al., Mol Cell Proteomics, 2002 Assess quality –Via proteins: PVM, EPR –Via domains: DPV Search by BLAST or identifiers / text URL Dyrk1a GI

16 DIP database Duan et al., Mol Cell Proteomics, 2002 Assess quality –Via proteins: PVM, EPR –Via domains: DPV Search by BLAST or identifiers / text Map expression data

17 DIP/LiveDIP Duan et al., Mol Cell Proteomics, 2002 Records biological state –Post-translational modifications –Conformational changes –Cellular location

18 DIP/Prolinks database Bowers et al., Genome Biol, Records functional association using prediction methods: –Gene neighbors –Rosetta Stone –Phylogenetic profiles –Gene clusters

19 Other functional association databases Phydbac2 (Claverie) Predictome (DeLisi) ArrayProspector (Bork)

20 BIND database Records experimental interaction data 83,517 protein-protein interactions 204,468 total interactions Includes small molecules, NAs, complexes URL Alfarano et al., Nucleic Acids Res, 2005

21 BIND database Displays unique icons of functional classes

22 MPact/MIPS database Records yeast protein-protein interactions Curates interactions: –4,300 PPI –1,500 proteins Guldener et al., Nucleic Acids Res, 2006

23 STRING database Records experimental and predicted protein- protein interactions using methods: –Genomic context –High-throughput –Coexpression –Database/literature mining –URLURL von Mering et al., Nucleic Acids Res., 2005

24 STRING database Graphical interface for each of the evidence types Benchmark against Kegg pathways for rankings

25 STRING database 736,429 proteins in 179 species Uses COGs and homology to transfer annotation

26 More interaction databases IntAct (Valencia) –Open source interaction database and analysis –68,165 interactions from literature or user submissions MINT (Cesareni) –71,854 experimental interactions mined from literature by curators –Uses IntAct data model BioGRID (Tyers) –116,000 protein and genetic interactions

27 InterDom database Predicts domain interactions (~30000) from PPIs Data sources: –Domain fusions –PPI from DIP –Protein complexes –Literature Scores interactions Ng et al., Nucleic Acids Res, 2003

28 Definition of CBM Interacting domain pair – if at least 5 residue-residue contacts between domains (contacts – distance of less than 8 Ǻ) Structure-structure alignments between all proteins corresponding to a given pair of interacting domains Clustering of interface similarity, those with >50% equivalently aligned positions are clustered together Clusters with more than 2 entries define conserved binding mode.

29 DIMA database Pagel et al., Bioinformatics, 2005 Phylogenetic profiles of Pfam domain pairs Uses structural info from iPfam Works well for moderate information content

Prediction of the molecular basis of protein interactions So.. You know your two proteins interact… do you want to know how?

Molecular basis of protein interaction “Tree determinant residues” Rab Ras Rho Arf Ran x REP + _ MSA Prediction Experimental tests Pereira-Leal and Seabra (2001) J. Mol. Biol. Pereira-Leal et al (2003) Biochem. Biophys. Res. Com.

Molecular basis of protein interaction “Tree determinant residues” Continued… Sequence Space algorithm Casari et al (1995) Nat. Struct. Biol 2(2) AMAS (part of a bigger package)

Molecular basis of protein interaction In silico docking Requires 3D structures of components Conformational changes cannot be considered (rigid body)