FLEX* - REVIEW.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Advertisements

Case Study: Dopamine D 3 Receptor Anthagonists Chapter 3 – Molecular Modeling 1.
Label Placement and graph drawing Imo Lieberwerth.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Molecular Docking G. Schaftenaar Docking Challenge Identification of the ligand’s correct binding geometry in the binding site ( Binding Mode ) Observation:
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Alignment of Flexible Molecular Structures. Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear.
Agenda A brief introduction The MASS algorithm The pairwise case Extension to the multiple case Experimental results.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Docking of Protein Molecules
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Molecular Docking Using GOLD Tommi Suvitaival Seppo Virtanen S Basics for Biosystems of the Cell Fall 2006.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Model Database. Scene Recognition Lamdan, Schwartz, Wolfson, “Geometric Hashing”,1988.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
BIONFORMATIC ALGORITHMS Ryan Tinsley Brandon Lile May 9th, 2014.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Chapter 9 Superposition and Dynamic Programming 1 Chapter 9 Superposition and dynamic programming Most methods for comparing structures use some sorts.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
A computational study of protein folding pathways Reducing the computational complexity of the folding process using the building block folding model.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Classifier Evaluation Vasileios Hatzivassiloglou University of Texas at Dallas.
Fundamentals of Algorithms MCS - 2 Lecture # 7
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
File Storage Organization The majority of space on a device is reserved for the storage of files. When files are created and modified physical blocks are.
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
SimBioSys Inc.© 2004http:// Conformational sampling in protein-ligand complex environment Zsolt Zsoldos SimBioSys Inc., © 2004 Contents:
A two-state homology model of the hERG K + channel: application to ligand binding Ramkumar Rajamani, Brett Tongue, Jian Li, Charles H. Reynolds J & J PRD.
Altman et al. JACS 2008, Presented By Swati Jain.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
R L R L L L R R L L R R L L water DOCKING SIMULATIONS.
Time-Efficient Flexible Superposition of Medium-sized Molecules Presented by Tamar Sharir (Lemmen & Lengauer)
MINRMS: an efficient algorithm for determining protein structure similarity using root-mean-squared-distance Andrew I. Jewett, Conrad C. Huang and Thomas.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
ISA Kim Hye mi. Introduction Input Spectrum data (Protein database) Peptide assignment Peptide validation manual validation PeptideProphet.
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
Simplified picture of the principles used for multiple copy simultaneous search (MCSS) and for computational combinatorial ligand design (CCLD). Simplified.
Reduce the need for human intervention in protein model building
Virtual Screening.
Ligand Docking to MHC Class I Molecules
Complementarity of Structure Ensembles in Protein-Protein Binding
Protein Structure Alignment
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

FLEX* - REVIEW

Agenda Introduction Main concepts FlexE FlexS Evaluation and Discussion

Introduction Methods for the prediction of binding properties of molecules to proteins. Classification by the amount of information available about the target protein

The general schema Modeling Algorithm Ligand conformational flexibility Modeling Receptor-ligand interactions Scoring function Base selection Algorithm Base placement Incremental construction

The Ligand conformational flexibility Approximated by a discrete set of conformations. rotatable single bond - modeled by a discrete set of preferred torsion angles from the MIMUMBA DB. Ring system - A set of ring conformations is computed with the program CORINA.

The model of receptor-ligand interactions Modeled by a few special types of interactions hydrogen bonds metal acceptors bonds hydrophobic contacts

The model of protein-ligand interactions – Cont. To each interaction group, we assign: Interaction types Interaction geometry ( center + surface)

Two groups interact if : The centers of the groups lie approximately on the surface of the counter group. The interaction types are compatible The intermolecular interactions can be classified by the strength of their geometric constrains

Scoring function Estimates the free binding energy in the complex The function is additive in the ligand atoms. match score contact score

Overall docking algorithm Ligand fragmentation Select & Place a set of base fragments Construct the ligand by linking the remaining fragments.

Ligand fragmentation The ligand is decomposed into components by cutting at each acyclic bond. Fragmentation is a partition of the components of the molecule, such that every part, called fragment, is connected in the component tree.

Ligand fragmentation Good results are produced if the added fragments are small Every fragment, except for the base fragment consist of only one component.

Selecting a base fragment The problem: Find a fragment which leads to low energy docking solution. Good base fragment properties: Placeability Specificity

Selecting a base fragment –Cont. We look for fragments maximizing the function:

Rules for selecting a set of fragments No base fragment is fully contained in another base fragment Each component occurs in at most two base fragments Each component in a base fragment must be either necessary for the connectivity of the fragment or it must have interaction centers.

The base placement algorithm Goal: find positions of the base fragment in the active site such that sufficient number of favorable interactions between the fragment and the protein can occur simultaneously. Solution: pose clustering.

The base placement algorithm – Cont. Preparation: Store all triangles of interaction points (IP) of the protein in a hash table. Find all the compatible fragment IP’s triangles. Clustering of the legal transformations

The incremental construction algorithm Input: solution set - set of partial placements with the ligands with the ligands constructed up to and including fragment i-1 Output: set of partial placements with the ligands with the ligands constructed up to and including fragment i

The complex construction algorithm – cont. Adding the next fragment in all the possible conformations Reject extended placements that have strong overlap with the receptor or internal overlap with the ligand. Searching for new interactions Optimizing the positions of the partial ligand Selecting a new solution set Clustering the solution set

Optimizing the positions of the partial ligand The placement is optimized when: New interactions are found. The placement contains slightly overlapping atoms between the receptor and the ligand.

Selecting a new solution set Select k best-scoring solution Problem: the scoring values cannot be compared directly when different fragments are involved. Solution: estimate the score of the whole ligand, given a partial placement.

Clustering partial solutions If no placement contains the other, the distance is infinity Otherwise, the distance is defined to be the RMSD of the intersecting atoms. A cluster is reduced to a single placement.

Protein flexibility - motivation Induced fit – side chain or even backbone adjustments upon docking of different ligands to the same protein. Even small conformational changes are critical for docking applications e.g. if a rotate able bond prevents a ligand from binding in the correct position.

Protein flexibelity Main idea: describe the protein structure variations with a set of protein structures representing the flexibility, mutation or alternative models of a protein. The variability considered by flexE is defined by the differences within the given input structures.

United protein description Data structure that administers the protein structures variations. Contains an ensemble of up to 30 possible conformation of the protein. Most of them are low energy conformations of the same protein.

United protein description - construction Superposition Clustering Add picture - 8

Notation Component : all the atoms which belong to the same amino acid or mutation of the amino acid. Contains a backbone part and a side chain part Part : set of instances Instance : one of the alternative conformations.

United protein description - clustering The superimposed structures are combined by clustering each part separately Complete linkage hierarchical cluster The clustered instances can be recombined to form new valid protein structures. (2) Regardless of the structures from which they originally stem. ===>>>>> the structures we dock into are not limited to the original ensemble structures.

Incompatibility Two instances of the united protein description are incompatible if they cannot be realized simultaneously. Logical: two instances are alternative to each other Geometric: two logically compatible instances overlap Structural: two instances of the same chain are unconnected

Incompatibility graph The last type of incoppatibility is aminly meant to avoid the combination of instances belonging to different loop comformations. [!!!!!!! -------- -------- why???????????????]

Incompatibility graph The incompatibility is internally represented as a graph by using the instances as nodes and the connecting pairs of incompatible node by an edge. Valid protein structures correspond to independent set in the graph.

Selection of instances The ligand is placed fragment by fragment into the active site by the incremental construction algorithm. After each construction step, all possible interactions are determined. Apply the scoring function for each instance. We chose the IS with the highest score.

chose the IS with the highest score. Select the optimal IS The IS can be assembled from IS of the connected components. Apply a modified version of the Bron-Kerbosch algorithm.

Evaluation FlexE was evaluated with ten protein structures ensembles containing 105 crystal structure from the PDB. The structures within the ensemble highly similar backbone trace Different conformations for several side chains.

Evaluation – Cont. FlexE finds a ligand position with RMSD below 2 A in 67% of the cases. Average CPU time for the incremental construction algorithm is 5.5 minutes.

Discussion The ensemble approach is able to cope with several side-chains conformations and even movements of loops. Motions of larger backbone segments or even domains movements are not covered by this approach.

flexS - motivation In drug design, often enough, no structural information about a particular receptor is available. Considerable number of different ligands are known together with their binding affinities towards the receptor.

flexS - overview A method for structurally superpositing pairs of ligands, approximating their putative binding site geometry. Main Applications ligand superpositioning Virtual Screening

Implementation in flexS RigFit – fast rigid-body placement using Fourier space methods. Incremental construction Systematic parameter study

Two Base Placement Methods Target: Place a rigid molecule fragment onto the reference ligand Combinatorial placement procedure Numerical placement procedure

RigFit Optimizes the common volume of two molecule expressed by various Gaussian functions associated to different physicochemical properties. Solves the combinatorial placement problem.

Variable Sequence Construction The sequence in which fragments are added is selected dynamically depending on the actual placement. Effective in cases where the flexible test ligand partially extends beyond the reference ligand.

Dynamically selection of the next fragment Each partial placement is associated with a list of candidate fragments. Evaluation of the next fragment considers: The amount of expected overlap with the reference The number of potential interaction in the candidate fragment The size of the substructure tree rooted at the candidate fragment.

Dynamically selection of the next fragment – Cont. Nbus –number of buildup states. Deviation from the original sequence only if a better sequence is found If flexS exceeds Nbus upper limit, it returns to the original sequence

Evaluation The performance of the algorithm depends on the size of the superimposed ligands. In reproduction of 284 alignments, 60% reproduces with RMSD below A.

Questions?

Thank you!

Scoring & Selection strategy Total score fo the partial ligand

FlexS Flow Test ligand Reference ligand fragmentation Placement of the anchor molecule Add a fragment that adopts a discrete set of conformations

The physicochemical model The conformational space of the ligand The model of protein-ligand interactions Scoring function

United protein description - superposition Assumption: highly similar backbone traces -> superposition by fitting the backbone atoms of the particular structures. This procedure emphasizes the differences and improves the fitting in conserved regions of structures. [why ???] Superposition in two steps: 1. 2. Iteration of the first step. Pairs of atoms with a distance greater that a user defined threshold are ignored for the next step.

Surface and interaction geometries