Protein Interfaces, Surfaces and Assemblies

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

Configuration management
Protein Structure.
Design Concepts and Principles
EMBL-EBI MSDpisa a web service for studying Protein Interfaces, Surfaces and Assemblies Eugene Krissinel
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Case Studies Class 5. Computational Chemistry Structure of molecules and their reactivities Two major areas –molecular mechanics –electronic structure.
A Molecular Replacement Pipeline Garib Murshudov Chemistry Department, University of York 
The Jukebox Orian Paz & Yair Cleper Instructor: Viktor Kulikov Semester: Spring 2009 Final Presentation.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Molecular Docking Using GOLD Tommi Suvitaival Seppo Virtanen S Basics for Biosystems of the Cell Fall 2006.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
Protein Tertiary Structure Prediction
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Coordinate handling and exploitation An overview of coordinate functionality in CCP4 suite Coordinate functionality in REFMAC group of programs (A. Vaguine)
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics,
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
Configuration Management (CM)
A Molecular Replacement Pipeline Garib Murshudov Chemistry Department, University of York 
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
1 PyMOL Evolutionary Trace Viewer 1.1 Lichtarge Lab Sept. 13, 2010.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes MSD Protein.
Wenjing Wu Computer Center, Institute of High Energy Physics Chinese Academy of Sciences, Beijing BOINC workshop 2013.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
EMBL-EBI MSDpisa a web service for studying Protein Interfaces, Surfaces and Assemblies Eugene Krissinel
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Bulk Model Construction and Molecular Replacement in CCP4 Automation Ronan Keegan, Norman Stein, Martyn Winn.
Data Integration and Management A PDB Perspective.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
EBI is an Outstation of the European Molecular Biology Laboratory. Assessment of macromolecular interactions and identification of macromolecular assemblies.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
Macromolecular Structure Database Project EMSD Infra-structure Services for Europe To develop an autonomous structural database capability in Europe
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-PISA a web based service for understanding Protein Interfaces, Surfaces and Assemblies.
SR Users Meeting 10-11th September 2003 CCP4 Release 5.0 Peter Briggs CCP4/CCLRC Daresbury Laboratory.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
17 th October 2005CCP4 Database Meeting (York) CCP4i Database Overview Peter Briggs.
Requirements Engineering Requirements Validation and Management Lecture-24.
CCP4 Molecular Replacement Model Generation Create a CCP4i task for generating Molecular Replacement models. - Selecting suitable PDB entries, based on.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Search Services (PDBelite, PDBePro and BIObar) Sanchayita Sen, Ph.D. PDB Depositions.
EMBL-EBI Eugene Krissinel SSM - MSDfold. EMBL-EBI MSDfold (SSM)
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
Peter J. Briggs, Alun Ashton, Charles Ballard, Martyn Winn and Pryank Patel CCLRC Daresbury Laboratory, Warrington, Cheshire WA4 4AD, UK The CCP4 project.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Cysteine Oxidation Prediction Program (COPP): A New Software Program That Predicts Reversible Protein Cysteine Thiol Oxidation Reactions Ricardo Sanchez,
3.3b1 Protein Structure Threading (Fold recognition) Boris Steipe University of Toronto (Slides evolved from original material.
MANAGEMENT INFORMATION SYSTEM
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Avoiding Redundancy in the Management of Technical Documentation and Models: Requirements Analysis and Prototypical Implementation for Enterprise Architecture.
PDBe Protein Interfaces, Surfaces and Assemblies
Getting the Most out of the PDBe
Database Requirements for CCP4 17th October 2005
Number of released entries
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
An Integrated Approach to Protein-Protein Docking
Molecular Modeling By Rashmi Shrivastava Lecturer
The temporary site to download BALBES:
The site to download BALBES:
Presentation transcript:

Protein Interfaces, Surfaces and Assemblies PISA Protein Interfaces, Surfaces and Assemblies http://www.ebi.ac.uk/msd-srv/prot_int/pistart.html Eugene Krissinel keb@ebi.ac.uk CCP4 & EBI-MSD

PISA is a tool for the assessment of macromolecular interactions using data provided by protein crystallography. Scope of tasks addressed by PISA: identification and prediction of multimeric states analysis of structure-function relationship analysis and prediction of macromolecular interactions analysis of macromolecular complexation and crystallisation properties of macromolecular interfaces search for interface/structure/assembly homologues active site recognition and analysis macromolecular surface analysis other Project started in 2004, supported by BBSRC research grant 721/B19544

PISA today Web-service hosted by EBI-MSD at http://www.ebi.ac.uk/msd-srv/prot_int/pistart.html provides PISA analysis for all PDB entries and database searches allows upload of PDB and mmCIF files for interactive PISA analysis provides XML download of multimer data, which is used in server applications (BALBES) on molecular replacement works on aminoacid, nucleic acid and ligand structures more than 140,000 external queries served since the release more than 1700 users has a command-prompt stand-alone version

PISA basics PISA is based on chemical thermodynamics: for stable structures in the standard state. DGdiss cannot be calculated exactly. PISA uses semiempirical models with parameters calibrated to available experimental data on multimeric states. Precision of free energy estimates in PISA: ±5 kcal/mol Success rate of PQS prediction: 80-90%

Last year activity – nucleic acids and ligands Extension to include protein-DNA/RNA and ligand interactions Derivation and calibration of interaction parameters Database of ligand interactions (~6000 entries parameterized on atomic level) Tools for database update and semi-automatic calculation of protein-ligand interactions Core algorithm completely rewritten in order to: implement changes needed to adopt protein-DNA/RNA and ligand interactions optimize and speed-up the calculations

Last year activity – ligand control Control over ligand processing: Possibility to exclude certain ligands from processing Choice of ligand processing modes: Automatic Fix all ligands Free all ligands

Last year activity – adaptation for MSD&PDB Interface and presentation improvements at request of PDB/MSD curation teams: Consistent identification of symops in PISA pages Adoption of PDB@RCSB symop nomenclature Automatic generation of REMARK 350 Optimization of final assembly positions Reporting on redundant assemblies (especially when ASU contains a fractional number of assemblies >1) PISA is now employed by both MSD and PDB@RCSB as a mandatory processing tool for all depositions

Last year activity - PISA database PISA database searches by Multimeric state Symmetry number Space group Homomeric type Salt bridges Disulphide bonds List of ligands List of keywords Dissociation energy Assembly ASA Assembly BSA Percent BSA

Last year activity - standalone PISA Command-prompt, stand-alone PISA for inclusion into CCP4 Contains only data-processing part of “big” PISA, i.e. no database For technical reasons, there are code differences from “big” PISA Functionally identical to the corresponding parts of “big” PISA Mimics web-page output of “big” PISA in plain text Provides same XML output as “big” PISA Works as a local server: Maintains sessions Data processing separated from data retrieval Visualization using Rasmol

Standalone PISA example

Last year activity – tune-up and polishing Last percent of improvement takes 99% of all efforts Literally hundreds of small problems solved on everyday basis. Examples: Inference of correct orthogonalisation codes Choice of margins for identification of parallel monomeric units Symmetry number calculations: superposition margins and unit enumeration order Identification and proper treatment of overlapping symmetry mates with fractional occupancy Unique labels for the download/visualisation data and wait pages to avoid caching on remote servers Catching up with EBI systems update PISA is roughly 60,000 C++ statements and small bugs are most probably still there 6 releases over year

Future plans PISA is systematically underperforming on FABs. Possible reasons: Neglect of electrostatic interactions Neglect of entropy absorbance in flexible complexes Both are very difficult problems to address Last percent of improvement takes 99% of efforts

Future plans Analysis of “custom” assemblies Allow for input without crystallographic data Effectively inclusion of NMR entries as well Detection of “custom” assemblies Allow for report on specific assemblies otherwise missed as unstable Automatic prediction of macromolecular interactions and assemblies by homologue search in PISA database Assessment of crystal “quality” Identification of fake PDB entries and depositions

Fake PDBs 2i07 2icc 2ice 2icf 2hr0 BSA 20% 51% 24% 10% Interfaces per chain 9.56 7 6.12 8.33 3.5 Min. interfaces / chain 8 4 2 Connected crystal Yes