A Tutorial of the PrePPI Database Presenters: Gabriel Leis and Katrina Sherbina Loyola Marymount University Departments of Biology and Computer Science.

Slides:



Advertisements
Similar presentations
Science Department Open House Integrating Technology into Science Courses.
Advertisements

+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Deterministic Global Parameter Estimation for a Budding Yeast Model T.D Panning*, L.T. Watson*, N.A. Allen*, C.A. Shaffer*, and J.J Tyson + Departments.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Consistent probabilistic outputs for protein function prediction William Stafford Noble Department of Genome Sciences Department of Computer Science and.
The STRING database Michael Kuhn EMBL Heidelberg.
Training a Neural Network to Recognize Phage Major Capsid Proteins Author: Michael Arnoult, San Diego State University Mentors: Victor Seguritan, Anca.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
25. Lecture WS 2003/04Bioinformatics III1 Integrating Protein-Protein Interactions: Bayesian Networks - Lot of direct experimental data coming about protein-protein.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
QuASI: Question Answering using Statistics, Semantics, and Inference Marti Hearst, Jerry Feldman, Chris Manning, Srini Narayanan Univ. of California-Berkeley.
UCB BioText TREC 2003 Participation Participants: Marti Hearst Gaurav Bhalotia, Presley Nakov, Ariel Schwartz Track: Genomics, tasks 1 and 2.
Blast heuristics Morten Nielsen Department of Systems Biology, DTU.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Introduction to Gene Mining Part B: How similar are plant and human versions of a gene? After completing part B, you will demonstrate How to use NCBI BLASTp.
Additional Unit 2 Lecture Notes New Instructional Design Focus School of Education Additional Unit 2 Lecture Notes New Instructional Design Focus School.
An Ontology for Protein- Protein Interaction Data Karen Jantz CIS Honors Project December 7, 2006.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Inferring strengths of protein-protein interactions from experimental data using linear programming Morihiro Hayashida, Nobuhisa Ueda, Tatsuya Akutsu Bioinformatics.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Computational prediction of protein-protein interactions Rong Liu
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
A Study of Residue Correlation within Protein Sequences and its Application to Sequence Classification Christopher Hemmerich Advisor: Dr. Sun Kim.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Meng-Han Yang September 9, 2009 A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
Using structure in protein function annotation: predicting protein interactions Donald Petrey, Cliff Qiangfeng Zhang, Raquel Norel, Barry Honig Howard.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
A collaborative tool for sequence annotation. Contact:
Western New York Genetics in Research Partnership Expanding Exposure, Career Exploration and Interactive Projects in Basic Genome Analysis and Bioinformatics.
Introduction to biological molecular networks
GO based data analysis Iowa State Workshop 11 June 2009.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Create and assess protein networks through molecular characteristics of individual proteins Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo 12/07/2006.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
BME435 BIOINFORMATICS.
Bioinformatics Overview
Yiming Yang1,2, Abhay Harpale1 and Subramanian Ganaphathy1
Biological Databases By: Komal Arora.
Review of Circadian Gene Database
STRING Large-scale data and text mining
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
miRPathDB: A Specialized Professional Database with Upkeep Concerns
Hood College Master of Science in Bioinformatics (Proposed)
SynechoNET: integrated protein interaction database of Synechocystis
STRING Protein networks from data and text mining
PIR: Protein Information Resource
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Ensembl Genome Repository.
Similarity of gene expression level (A) and nucleosome occupancy profile (B) between paralog pairs. Similarity of gene expression level (A) and nucleosome.
Network biology An introduction to STRING and Cytoscape
Bioinformatics analysis of the small TbTim amino acid sequences.
Structural features of Rps26a.
Presentation transcript:

A Tutorial of the PrePPI Database Presenters: Gabriel Leis and Katrina Sherbina Loyola Marymount University Departments of Biology and Computer Science October 3, 2013

Outline The PrePPI Database Catalogs Predicted and Experimentally Determined Protein-Protein Interactions Overview of the Method to Determine The Probability of Protein-Protein Interactions Step-by-Step Tutorial on Using the Database PrePPI Is a "Meta" Database That is Electronically Curated Future Plans to Fix Some of the Problems With the Current Version of the Database Conclusion

The PrePPI Database Catalogs Predicted and Experimentally Determined Protein-Protein Interactions Specific to Homo sapiens and Saccharomyces cerevisiae Amino acid sequence of interacting proteins Functional information about the query protein Links to experimental evidence for protein interaction Answers question regarding the Likelihood of an interaction Possible structure of the interacting proteins

Overview of the Method to Determine The Probability of Protein-Protein Interactions Zhang et al. (2012) Nature 490(7421).

Overview of the Method to Determine The Probability of Protein-Protein Interactions Zhang et al. (2012) Nature 490(7421).

Outline The PrePPI Database Catalogs Predicted and Experimentally Determined Protein-Protein Interactions Overview of the Method to Determine The Probability of Protein-Protein Interactions Step-by-Step Tutorial on Using the Database PrePPI Is a "Meta" Database That is Electronically Curated Future Plans to Fix Some of the Problems With the Current Version of the Database Conclusion

Type In the Protein ID of Interest

The Predicted Interactions Are Displayed In Decreasing Order of Likelihood ………

A Likelihood Ratio (LR) Determines How Much the Evidence Contributes to the Predicted PPI c 1 = S = Structural Model c 2 = G = GO Term Similarity c 3 = E = Protein Essentiality (for Yeast) c 4 = M = MIPS Term Similarity (for Yeast) c 5 = C = Co-expression c 6 = P = Phylogenetic Profile Similarity a.edu/PrePPI/help.html

The Prediction and Database LR are Combined to Get the Final Probability for the Interaction Database LR = LR for 1 of 7 categories that an interaction recorded in a database belongs to Category 1 = Interactions present in multiple databases Categories 2 – 7 = interactions present in only 1 database

3D Structure of Interacting Proteins Available for Those With A Structural LR > 50

Outline The PrePPI Database Catalogs Predicted and Experimentally Determined Protein-Protein Interactions Overview of the Method to Determine The Probability of Protein-Protein Interactions Step-by-Step Tutorial on Using the Database PrePPI Is a "Meta" Database That is Electronically Curated Future Plans to Fix Some of the Problems With the Current Version of the Database Conclusion

PrePPI Is a "Meta" Database That is Electronically Curated Developed by the Honig Lab (Columbia University and HHMI) Funding from NIH and China Scholarship Council Licenses available for commercial and academic institutions Latest version released 1/27/2012

Future Plans to Fix Some of the Problems With the Current Version of the Database Problems No evaluation of structure beyond the final LR score No attention paid to oligomerization state No differentiation between compartmental paralogs Future Plans Implement methods to create more reliable structures Expand to other model organisms

Conclusion The PrePPI database catalogs protein-protein interactions both predicted and experimentally determined Likelihood ratios (LR) are computed to determine how probable is a predicted interaction A potential 3D representation of the interacting proteins is available for those interactions with an LR>50 PrePPI is a “meta" database that is electronically curated Future plans include expanding the database to include more organisms and developing methods to create more reliable interaction structures

References Zhang et al. (2012). Structure-based prediction of protein– protein interactions on a genome-wide scale. Nature, 490(7421), Retrieved from html Zhang et al. (2013). PrePPI: a structure-informed database of protein-protein interactions. Nucleic Acids Research, 41, Retrieved from Zhang et al. (Designer). (2012, October 25). Predicting protein–protein interactions using PrePPI [Web Graphic]. Retrieved from e11503_F1.html