PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.

Slides:



Advertisements
Similar presentations
STRING Prediction of protein networks through integration of diverse large-scale data sets Lars Juhl Jensen EMBL Heidelberg.
Advertisements

1 Modular Co-evolution of metabolic networks Zhao Jing.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
The STRING database Michael Kuhn EMBL Heidelberg.
STRING Modeling of biological systems through cross-species data integration.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Biological Gene and Protein Networks
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
The STRING Database What it does and how it interfaces to other resources The STRING Database What it does and how it interfaces to other resources Christian.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Ch10. Intermolecular Interactions and Biological Pathways
Metagenomic Analysis Using MEGAN4
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.

Copyright OpenHelix. No use or reproduction without express written consent1.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Genomics for Librarians Stuart M. Brown, Ph.D. Director, Research Computing, NYU School of Medicine.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
P HYLO P AT : AN UPDATED VERSION OF THE PHYLOGENETIC PATTERN DATABASE CONTAINS GENE NEIGHBORHOOD Presenter: Reihaneh Rabbany Presented in Bioinformatics.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
EB3233 Bioinformatics Introduction to Bioinformatics.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
A collaborative tool for sequence annotation. Contact:
Introduction to biological molecular networks
GO based data analysis Iowa State Workshop 11 June 2009.
David Wishart February 18th, 2004 Lecture 3 BLAST (c) 2004 CGDN.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
A Tutorial of the PrePPI Database Presenters: Gabriel Leis and Katrina Sherbina Loyola Marymount University Departments of Biology and Computer Science.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Create and assess protein networks through molecular characteristics of individual proteins Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo 12/07/2006.
1 Computational functional genomics Lital Haham Sivan Pearl.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
High throughput biology data management and data intensive computing drivers George Michaels.
Bioinformatics Overview
Networks and Interactions
Data-intensive Computing: Case Study Area 1: Bioinformatics
Protein association networks with STRING
STRING Large-scale data and text mining
FLiPS Functional Linkage Prediction Service.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
STRING Protein networks from data and text mining
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Network biology An introduction to STRING and Cytoscape
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006

2 Contents  Introduction  Roles and schedule  Work completed  Work in progress  Work remains to be done  References  Paper work schedule

3 Introduction(1/2)  Protein Protein Interaction (PPI)  단백질 상호작용  Proteins working on the same pathway  Proteins forming a protein complex  Detection of protein interaction  Experiments  Computational methods  Gene Context Analysis  Gene Context Analysis  Utilizing the information of gene location, co-occurrence and fusion events

4 Introduction(2/2)  Issues  How to improve the quality of prediction database  Modeling the gene context with the probability  Scoring the interactions  How to interpret the prediction result  Visualization of the interaction network  Mapping proteins to functions

5 Roles and schedule(1/3)  Roles of members  Sangwon Yoo  Analysis of the existing models and algorithms  Designing a new interpretation method  Hoyoung Jeong  Implementation of the system  Focus on the efficiency of the processing  Taewhi Lee  Preprocessing of the data  BLAST management  User interface

6 Roles and schedule(2/3) Scheduled TaskStatus Data preprocessingdelayed Algorithm analysisdone Algorithm implementationdone User interfacedelayed Total80%

7 Roles and schedule(3/3)  Problems  BLAST search  Time consuming jobs  From 486 sequences to sequences per organism in 168 species  3 or 4 species a day for blast search  Examinations  TOEFL, 전문연구요원선발시험

8 Work completed(1/4)  Implemented algorithms  Phylogenetic profiles method  Using the co-occurrences of the genes  Hypergeometric distribution Genome 1Genome 2Genome 3Genome 4 Gene a1100 Gene b1100 Gene c0101

9 Work completed(2/4)  Gene cluster method  Using the distance between genes in a genome  Finding an operon structure in a microbial organism  Poisson distribution †Operon: An operon is a collection of inter-related genes including one which acts as a switch that governs the expression of the structural genes in the collection.

10 Work completed(3/4)  Gene Neighbor method  Using the order of genes  Finding the conserved ‘close’ †Close: a set of genes occurring on a prokaryotic chromosome if and only if they all occur on the same strand and the gaps between adjacent genes are 300 bp or less

11 Work completed(4/4)  Gene Fusion method  Analysis of gene fusion events  Detecting proteins carrying out consecutive metabolic steps  Detecting proteins being components of molecular complexes  Hypergeometric distribution

12 Work in progress(1/3)  User Interface  Input: NCBI ids, protein name  Output  Make a list of interacting proteins  Drawing the interaction network  Utilizing the public graph drawing API

13 Work in progress(2/3) GI: Query protein 1.public database identifiers 2.gene name, protein name Methods PP GN GC RS confidence Select methods 2. Set confidence value

14 Work in progress(3/3) Functional Links methodidentifierconfidencename PP mfd PP recG GN murG …………

15 Work remains to be done(1/2)  MAR~APR  Input: sequence, other ids  Output  Detailed information  Integration of other application information  Pathway maps  Localization information

16 Work remains to be done(2/2)  MAY~JUN  Input: keyword  Output  Predicted functions  Go terms for molecular function, biological process and cellular component  Research  Improvement of the phylogenetic profile method  Interpretation of the interaction network  Integration of other applications

17 References(1/2)  Prolinks  Institute for Genomics and Proteomics, UCLA  Prolinks : a database of protein functional linkages derived from coevolution Bowers PM, Pellegrini M, Thompson MJ, Fierro J, Yeates TO, Eisenberg D Genome Biology 2004, 5(5):R35  Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO PNAS 1999, 96(8):  A combined algorithm for genome-wide prediction of protein function Marcotte EM, Pellegrini M, Thompson MJ, Yeates TO, Eisenberg D Nature 1999, 402:83-86  Providing the appropriate probability models  More prediction links, higher accuracy

18 References(2/2)  String  Search Tool for the Retrieval of Interacting Genes/Proteins  European Molecular Biology Laboratory, Germany  STRING: known and predicted protein-protein associations, integrated and transferred across organisms von Mering C, Jensen LJ, Snel B, Hooper SD, Krupp M, Foglierini M, Jouffre N, Huynen MA, Bork P. Nucleic Acids Res Jan 1;33(Database issue):D  STRING: a database of predicted functional associations between proteins von Mering C, Huynen M, Jaeggi D, Schmidt S, Bork P, Snel B. Nucleic Acids Res Jan 1;31(1):  STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene Snel B, Lehmann G, Bork P, Huynen MA Nucleic Acids Res Sep 15;28(18):  Using experimental data and expression data  Providing many links to additional information

19 Paperwork schedule  Domestic journal  This semester  Topic: Interaction interpretation method  Author: Sangwon Yoo, Hoyoung Jeong, Taewhi Lee, Mikyoung Lee, Cheolgoo Hur, Hyoung-Joo Kim  Topic: Efficient processing of phylogenetic profile method  Author: Hoyoung Jeong, Sangwon Yoo, Taewhi Lee, Cheolgoo Hur, Hyoung-Joo Kim