Phyre2 Dr. Lawrence Kelley Structural Bioinformatics Group

Slides:



Advertisements
Similar presentations
Assignment of PROSITE motifs to topological regions: Application to a novel database of well characterised transmembrane proteins Tim Nugent.
Advertisements

Secondary structure prediction from amino acid sequence.
Blast to Psi-Blast Blast makes use of Scoring Matrix derived from large number of proteins. What if you want to find homologs based upon a specific gene.
Hidden Markov Models (1)  Brief review of discrete time finite Markov Chain  Hidden Markov Model  Examples of HMM in Bioinformatics  Estimations Basic.
Pfam(Protein families )
EBI is an Outstation of the European Molecular Biology Laboratory. Alex Mitchell InterPro team Using InterPro for functional analysis.
Profile Hidden Markov Models Bioinformatics Fall-2004 Dr Webb Miller and Dr Claude Depamphilis Dhiraj Joshi Department of Computer Science and Engineering.
1 Profile Hidden Markov Models For Protein Structure Prediction Colin Cherry
Hidden Markov models for detecting remote protein homologies Kevin Karplus, Christian Barrett, Richard Hughey Georgia Hadjicharalambous.
درس بیوانفورماتیک December 2013 مدل ‌ مخفی مارکوف و تعمیم ‌ های آن به نام خدا.
Bioinformatics “Other techniques raise more questions than they answer. Bioinformatics is what answers the questions those techniques generate.” SheAvery
Applications of Hidden Markov Models in the Avian/Mammalian Genome Comparison Christine Bloom Animal Science College of Agriculture University of Delaware.
Sequence Similarity Searching Class 4 March 2010.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Optimatization of a New Score Function for the Detection of Remote Homologs Kann et al.
Protein RNA DNA Predicting Protein Function. Biochemical function (molecular function) What does it do? Kinase??? Ligase??? Page 245.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Machine Learning for Protein Classification Ashutosh Saxena CS 374 – Algorithms in Biology Thursday, Nov 16, 2006.
Profile-profile alignment using hidden Markov models Wing Wong.
Protein structure (Part 2 of 2).
Protein RNA DNA Predicting Protein Function. Biochemical function (molecular function) What does it do? Kinase??? Ligase??? Page 245.
Protein Modules An Introduction to Bioinformatics.
Protein Structure and Function Prediction. Predicting 3D Structure –Comparative modeling (homology) –Fold recognition (threading) Outstanding difficult.
Introduction to Bioinformatics - Tutorial no. 8 Protein Prediction: - PROSITE - Pfam - SCOP - TOPITS - genThreader.
Predicting Function (& location & post-tln modifications) from Protein Sequences June 15, 2015.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Matlab Bioinformatics Toolkit Evaluation Kanishka Bhutani.
Fast Search Protein Structure Prediction Algorithm for Almost Perfect Matches1 By Jayakumar Rudhrasenan S Primary Supervisor: Prof. Heiko Schroder.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Module 3 Sequence and Protein Analysis (Using web-based tools) Working with Pathogen Genomes - Uruguay 2008.
2 o structure, TM regions, and solvent accessibility Topic 13 Chapter 29, Du and Bourne “Structural Bioinformatics”
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Bioinformatics Ayesha M. Khan 9 th April, What’s in a secondary database?  It should be noted that within multiple alignments can be found conserved.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Protein and RNA Families
Structure prediction: Homology modeling
PROTEIN PHYSICS LECTURES 22-23
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
Protein Homologue Clustering and Molecular Modeling L. Wang.
March 28, 2002 NIH Proteomics Workshop Bethesda, MD Lai-Su Yeh, Ph.D. Protein Scientist, National Biomedical Research Foundation Demo: Protein Information.
Point Specific Alignment Methods PSI – BLAST & PHI – BLAST.
Exercises Pairwise alignment Homology search (BLAST) Multiple alignment (CLUSTAL W) Iterative Profile Search: Profile Search –Pfam –Prosite –PSI-BLAST.
Big Data Bioinformatics By: Khalifeh Al-Jadda. Is there any thing useful?!
Doug Raiford Phage class: introduction to sequence databases.
David Wishart February 18th, 2004 Lecture 3 BLAST (c) 2004 CGDN.
Combining Evolutionary Information Extracted From Frequency Profiles With Sequence-based Kernels For Protein Remote Homology Detection Name: ZhuFangzhi.
Computational Biology, Part C Family Pairwise Search and Cobbling Robert F. Murphy Copyright  2000, All rights reserved.
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
HomologyIf twp proteins are homologous, they have a common fold and a common ancestor If two proteins have >25% identity across their entire length, they.
Using the Fisher kernel method to detect remote protein homologies Tommi Jaakkola, Mark Diekhams, David Haussler ISMB’ 99 Talk by O, Jangmin (2001/01/16)
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Genome Annotation (protein coding genes)
Demo: Protein Information Resource
Hidden Markov Models (HMM)
Pfam: multiple sequence alignments and HMM-profiles of protein domains
Genome Annotation Continued
Predict Protein Sequence by Fuzzy-Association Rules
Bioinformatics and BLAST
Bioinformatics Biological Data Computer Calculations +
Sequence Based Analysis Tutorial
PEDNA Notes Mon, Oct 27, 2008 Marc L. Smith
Dr Tan Tin Wee Director Bioinformatics Centre
Sequence Based Analysis Tutorial
Molecular Modeling By Rashmi Shrivastava Lecturer
The future of protein secondary structure prediction accuracy
Bioinformatics Lecture 2 By: Dr. Mehdi Mansouri
Protein Structural Classification
Presentation transcript:

Phyre2 Dr. Lawrence Kelley Structural Bioinformatics Group Imperial College London

Phyre2 SVYDAAAQLTADVKKDLRDSW KVIGSDKKGNGVALMTTLFAD NQETIGYFKRLGNVSQGMAND KLRGHSITLMYALQNFIDQLD NPDSLDLVCS……. Predict the 3D structure adopted by a user-supplied protein sequence

How does Phyre2 work?

Phyre2 ARDLVIPMIYCGHGY Homologous sequences User sequence Search the 10 million known sequences for homologues using PSI-Blast.

An evolutionary fingerprint Phyre2 HMM ARDLVIPMIYCGHGY User sequence PSI-Blast Hidden Markov model Capture the mutational propensities at each position in the protein An evolutionary fingerprint

Phyre2 ~ 65,000 known 3D structures

Phyre2 ~ 65,000 known 3D structures

Phyre2 Extract sequence HAPTLVRDC……. ~ 65,000 known 3D structures

Phyre2 ~ 65,000 known 3D structures Extract sequence HAPTLVRDC……. PSI-Blast

Phyre2 ~ 65,000 known 3D structures Extract sequence HAPTLVRDC……. PSI-Blast HMM Hidden Markov model for sequence of KNOWN structure

Phyre2 ~ 65,000 known 3D structures ~ 65,000 hidden Markov models HMM

Hidden Markov Model Database of Phyre2 Hidden Markov Model Database of KNOWN STRUCTURES ~ 65,000 known 3D structures

An evolutionary fingerprint Phyre2 HMM ARDLVIPMIYCGHGY PSI-Blast Hidden Markov model Capture the mutational propensities at each position in the protein An evolutionary fingerprint

Hidden Markov Model DB of KNOWN Phyre2 HMM ARDLVIPMIYCGHGY PSI-Blast Hidden Markov Model DB of KNOWN STRUCTURES HMM-HMM matching Alignments of user sequence to known structures ranked by confidence. ARDL--VIPMIYCGHGY AFDLCDLIPV--CGMAY Sequence of known structure

Hidden Markov Model DB of KNOWN Phyre2 HMM ARDLVIPMIYCGHGY PSI-Blast Hidden Markov Model DB of KNOWN STRUCTURES HMM-HMM matching ARDL--VIPMIYCGHGY 3D-Model AFDLCDLIPV--CGMAY Sequence of known structure

Hidden Markov Model DB of KNOWN Phyre2 HMM ARDLVIPMIYCGHGY PSI-Blast Hidden Markov Model DB of KNOWN STRUCTURES Very powerful – able to reliably detect extremely remote homology HMM-HMM matching Routinely creates accurate models even when sequence identity is <15% ARDL--VIPMIYCGHGY 3D-Model AFDLCDLIPV--CGMAY Sequence of known structure