PROTEIN MODELLING Presented by Sadhana S.

Slides:



Advertisements
Similar presentations
1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Advertisements

Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Lecture 1 BNFO 240 Usman Roshan. Course overview Perl progamming language (and some Unix basics) Sequence alignment problem –Algorithm for exact pairwise.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
©CMBI 2002 Homology modelling ? X-ray ? NMR ? Intro Proteins Modelling 8 Steps Detect Threading Alignment Template Side chain Indels Optimize Validate.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
Protein structure determination & prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Homology Modeling of Proteins Abhishek Tripathi Biomedicum, Helsinki.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Macromolecular structure
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
COMPARATIVE or HOMOLOGY MODELING
Lecture 10 – protein structure prediction. A protein sequence.
Fast Search Protein Structure Prediction Algorithm for Almost Perfect Matches1 By Jayakumar Rudhrasenan S Primary Supervisor: Prof. Heiko Schroder.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Construction of Substitution Matrices
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Structure prediction: Homology modeling
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Homology Modeling 原理、流程,還有如何用該工具去預測三級結構 Lu Chih-Hao 1 1.
Construction of Substitution matrices
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Protein dynamics Folding/unfolding dynamics
Protein Structure Prediction
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Basic Local Alignment Search Tool
Homology modeling in short…
Presentation transcript:

PROTEIN MODELLING Presented by Sadhana S

definition Protein structure prediction/protein modelling is the prediction of the three-dimensional structure of protein from its amino acid sequence i.e., the prediction of its folding & its secondary, tertiary, & quaternary structure from its primary structure

Why to predict protein structure? Owing to significant efforts in genome sequencing over nearly three decades, gene sequences from many organism have been deduced. Over 100 million nucleotide sequences from over 300 thousand different organisms have been deposited in the major DNA databases, DDBJ/ EMBL/GenBank totaling almost 200 billion nucleotide bases. Over 5 million of these nucleotide sequences have been translated into amino acid sequences and deposited in the UniProtKB database.

However, the protein sequences themselves are usually insufficient for determining protein function as the biological function of proteins is intrinsically linked to three dimensional protein structure. The most accurate structural characterization of proteins is provided by X-ray crystallography and NMR spectroscopy. Owing to the technical difficulties and labor intensiveness of these methods, the number of protein structures solved by experimental methods lags far behind the accumulation of protein sequences

Many proteins are simply too large for NMR analysis and cannot be crystallized for X-ray diffraction. Protein modeling(computational methods) is the only way to obtain structural information if experimental techniques fail. The ultimate goal of protein modeling is to predict a structure from its sequence with an accuracy that is comparable to the best results achieved experimentally.

Can we predict structure from sequence?

Computational Methods The three major approaches for three- dimensional (3D) structure predictions are Ab initio methods Threading methods Comparative modelling / homology modelling

What is Homology Modelling? It is the prediction of the three-dimensional structure of a given protein sequence (target) based on an alignment to one or more known protein structures (templates). If similarity between the target sequence and the template sequence is detected, structural similarity can be assumed.

Homology Modelling Homology modeling, also known as Comparative modeling of protein is the technique which allows to construct an unknown atomic-resolution model of the "target" protein from: 1. Its amino acid sequence and 2.An experimental 3Dstructure of a related homologous protein (the "template").

Basis for homology modelling? Structure of a protein is uniquely determined by its amino acid sequence Structure is much more conserved than sequence during evolution. Proteins sharing high sequence similarity should have similar protein fold. Higher the similarity, higher is the confidence in the modeled structure.

1. Template recognition & initial alignment 2. Alignment corrections Homology modeling is a multistep process that can be summarized in seven steps: 1. Template recognition & initial alignment 2. Alignment corrections 3. Backbone generation 4. Loop modeling 5. Side-chain modeling 6. Model optimization 7. Model validation

TEMPLATE RECOGNITION Achieved by searching the PDB of known protein structures using the target sequence as the query. Templates can be found using the target sequence as a query for searching using FASTA or BLAST, & PSI-BLAST or PDB-BLAST Select the best template(min.30%) from a library of known protein structures derived from the PDB.

ALIGNMENT Purpose – to propose the homologies between the sites in two or more sequences Insertions & deletions are placed Types Pairwise alignment Multiple alignment

gap penalties Scoring alignments Alignment algorithms Correct alignment is necessary to create the most probable 3D structure of the target. If sequences aligns incorrectly, it will result in false positive or negative results. Important steps to consider: gap penalties Scoring alignments Alignment algorithms

Alignment Corrections Alignments are scored (substitution score) in order to define similarity between 2 amino acid residues in the sequences A substitutions score is calculated for each aligned pair of letters. Alignment algorithms- DPA, BLAST & FASTA

example Structure of alignment 1 and 2 with the template

Alignment Outcome The (true) alignment indicates the evolutionary process giving rise to the different sequences starting from the same ancestor sequence and then changing through mutations (insertions, deletions, and substitutions)

It they are same- side chain is also included BACKBONE GENERATION One simply copies the coordinates of those template residues that show up in the alignment with the model sequence If two aligned residues differ- only backbone coordinates(N, C-alpha, C & O) are copied It they are same- side chain is also included

Backbone Generation For SCRs - copy coordinates from known structures. For variable regions (VR) - copy from known structure, if the residue types are similar; otherwise, use databases for loop sequences.

Loop Modelling Knowledge based- PDB is searched Energy based- energy function is used to judge the quality of loop Molecular modeling/dynamic programs are used

Loop Modelling

Side Chain Modelling 1. Use of rotamer libraries (backbone dependent) 2. Molecular mechanics optimization - Dead-end elimination (heuristic) - Monte Carlo (heuristic) - Branch & Bound (exact)

Model refinement/optimization Idealization of bond geometry Removal of unfavorable non-bonded contacts Performed by energy minimization with force fields such as CHARMM, AMBER, or GROMOS Major errors are removed

Evaluation/validation of the model Internal evaluation Self-consistency checks Assessment of stereochemistry of the model PROCHECK & WHATCHECK External evaluation Tests whether a correct template was used PROSA & VERIFY3D

Applications Designing mutants to test hypotheses about the function of a protein. Identifying active & binding sites. Predicting antigenic epitopes. Simulating protein-protein docking. Confirming a remote structural relationship.

Web servers Swiss- model server (http://www.expasy.ch/swissmod/) CPHModels (http://www.cbs.dtu.dk/services/CPHmodels/) SDSC1 (http://www.cl.sdsc.edu/hm) FAMS (http://www.physchem.pharm.kitasato- u.ac.jp/FAMS/fams.html) ModWeb (http://www.guitar.rockefeller.edu/modweb)

References Zhumur Ghosh & Bibekanand mallik. bioinformatics- Principles & applications. Oxford university press S C Rastogi, N.Mendiratta, & P Rastogi. Bioinformatics- methods & applications. Eastern economy edition. Prentice hall of India. New Delhi Philip.E.Bourne & Helge Wiessig. Structural Bioinformatics. John Wiley & Sons. NewYork C A Orengo, D T Jones & J M Thornton. Bioinformatics- gene, proteins, & computers. BIOS . Scientific Publishers