Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and.

Slides:



Advertisements
Similar presentations
Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid.
Advertisements

Protein – Protein Interactions Lisa Chargualaf Simon Kanaan Keefe Roedersheimer Others: Dr. Izaguirre, Dr. Chen, Dr. Wuchty, ChengBang Huang.
Protein Structure and Physics. What I will talk about today… -Outline protein synthesis and explain the basic steps involved. -Go over the Chemistry of.
Undergraduate Exercises with Trp Cage Paula Evans, Chet Fornari, Jeff Hansen, Jennifer Inlow, Larry Merkle.
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Protein-DNA interactions: amino acid conservation and the effects of mutations on binding specificity Nicholas M. Luscombe and Janet M. Thornton JMB (2002)
FLEX* - REVIEW.
Discovery of RNA Structural Elements Using Evolutionary Computation Authors: G. Fogel, V. Porto, D. Weekes, D. Fogel, R. Griffey, J. McNeil, E. Lesnik,
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
Similar Sequence Similar Function Charles Yan Spring 2006.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
© Wiley Publishing All Rights Reserved. Biological Sequences.
Protein Structures.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein Tertiary Structure Prediction
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
Presented by Qiong Cheng in cs6640 Instructor: Prof. Irene T. Weber Prof. Robert W. Harrison Identify an Obesity Association INSIG2.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Guiding Motif Discovery by Iterative Pattern Refinement Zhiping Wang, Mehmet Dalkilic, Sun Kim School of Informatics, Indiana University.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Levels of Protein Structure
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
A new Approach to Structural Prediction of Proteins Heiko Schröder Bertil Schmidt Jiujiang Zhu School of Computer Engineering Nanyang Technological University.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
T.Jadczyk, Bioinformatics Applications in the Virtual Laboratory Bioinformatics Applications in the Virtual Laboratory Tomasz Jadczyk AGH University of.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
MSDmotif 1 Adel Golovin Protein Site and Motif search Biosapiense network of excellence.
Construction of Substitution Matrices
Bioinformatics Ayesha M. Khan 9 th April, What’s in a secondary database?  It should be noted that within multiple alignments can be found conserved.
1 Proteins Protein functions include: 1. enzyme catalysts 2. defense 3. transport 4. support 5. motion 6. regulation 7. storage Chapter 3- part 2.
Python + PyMOL Arbitrary Python code possible within python scripts PyMOL functionality available by importing PyMOL's modules cmd, cgo, stored etc. Call.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
3DM: Protein engineering Super-family platforms Bio-Prodict DM super-family systems Henk-Jan Joosten Remko Kuipers Tom v/d Bergh Bas Vroling.
Chemistry XXI Unit 3 How do we predict properties? M1. Analyzing Molecular Structure Predicting properties based on molecular structure. M4. Exploring.
Themes: Structure meets Function
Construction of Substitution matrices
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
Query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDN GVDGEWTYTE Structure-Sequence alignment “Structure is better preserved than sequence” Me! Non-redundant.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
What is Protein Folding? Implications of Misfolding Computational Techniques Background image: Staphylococcal protein A, Z Domain (
An Efficient Index-based Protein Structure Database Searching Method 陳冠宇.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Protein Structure and Function. Proteins are organic compounds made from amino acids held together by peptide bonds.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
PROTEIN IDENTIFIER IAN ROBERTS JOSEPH INFANTI NICOLE FERRARO.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Computational Structure Prediction
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Getting the Most out of the PDBe
There are four levels of structure in proteins
Sequence Based Analysis Tutorial
Protein Structures.
6th International Conference on
Protein structure prediction.
Volume 124, Issue 5, Pages (March 2006)
What DNA sequence codes for this chain of amino acids...?
Volume 17, Issue 5, Pages (May 2009)
Presentation transcript:

Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and Searching for Common Structure Motifs in a Protein Family as Experiments in the GridSpace2 Virtual Laboratory T. JADCZYK, M. Malawski, M. Bubak, and I. Roterman ACC Cyfronet AGH Cracow Grid Workshop Krakow,

2 Outline Protein folding process and Fuzzy Oil Drop (FOD) model FOD algorithm FOD experiment workflow Sequences and proteins' structures comparison Tools used Conservation score Searching for common structure motifs - experiment workflow

3 Fuzzy Oil Drop model  Protein Structure – chains, amino acids, atoms  Protein Folding – primary, secondary, tetriary structure  Fuzzy Oil Drop is Based on Kauzmann's Oil-Drop model of protein molecule  Assumes folding process directed by water environment  Extended from discrete one to 3-D Gauss function used to describe the idealized hydrophobicity distribution  Hydrophobic: "Water hating". Amino acids that prefer to be in a non-aqueous (lipid) environment because they cannot make favorable interactions with water.  The highest hydrophobicity concentration is expected in the center of the protein body, with the decrease of its values toward the surface, where the hydrophobicity is expected to be close to zero

4 Fuzzy Oil Drop – algorithm structure  1. Input: PDB file  2. Simplify protein's residues to „effective atoms” representation  3. Find two furthest atoms, Move protein to origin center, rotate  4. Determine theoretical hydrophobicity distribution  5. Calculate observed hydrophobicity,  6. Test similarity of both distributions  7. Store results:  PDBID, chain, chain length, organism, method, function  O/T, O/R values  Future: O/T, O/R profiles

5 Fuzzy Oil Drop – algorithm structure

6 Fuzzy Oil Drop – experiment workflow

7 Fuzzy Oil Drop – GS2 Experiment Workbench

8 Fuzzy Oil Drop – experiment results  Input: PDB Database (March 2011), files, 11.4GB  Final Results: proteins, chains

9 Structure and Sequence comparison  Search for conservative areas to discover protein function or find ligand binding site  Comparison on three levels of protein description:  Amino acid sequence  Structural codes  3D structures  W score to determine area's conservativeness  Used alignment tools:  ClustalW – multiple sequences alignment  Mammoth – multiple structures alignment

10 Structure and sequence comparison - experiment

11 Thanks for your attention!