Recommendations and Questions wwPDB/CCDC/D3R Ligand Validation Workshop Center for Integrative Proteomics Research, Rutgers 7/30-31/2015 Group D, Academic.

Slides:



Advertisements
Similar presentations
Refinement of a pdb-structure and Convert A. Search for a pdb with the closest sequence to your protein of interest. B. Choose the most suitable entry.
Advertisements

Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
VARUNA – Towards a Grid- based Molecular Modeling Environment CICC/MACE – Meeting May 22, 2006 Mookie Baik Department of Chemistry & School of Informatics.
Continuous improvement of macromolecular crystal structures Tom Terwilliger (Los Alamos National Laboratory) DDD WG member ECM 2012: Diffraction Data Deposition.
CCP4 Molecular Graphics (CCP4MG)
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Data activities of the International Union of Crystallography Brian McMahon IUCr 5 Abbey Square Chester CH1 2HU
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
1 Software Requirement Analysis Deployment Package for the Basic Profile Version 0.1, January 11th 2008.
AMBER. AMBER 7 What is AMBER? –A collective name for a suite of programs that allow users to carry out molecular dynamic simulations. –And a set of molecular.
A COMPLEX NETWORK APPROACH TO FOLLOWING THE PATH OF ENERGY IN PROTEIN CONFORMATIONAL CHANGES Del Jackson CS 790G Complex Networks
Alternative Methodologies Ken Peffers UNLV March 2004.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
Management and Distribution of Chemical Data in the Protein Data Bank John Westbrook, Dimitris Dimitropoulos, Jasmine Young, Peter Rose, Philip E. Bourne.
Protein Interfaces, Surfaces and Assemblies
Module 2: Structure Based Ph4 Design
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Bringing Structure to Biology: Small Molecules and the PDBe
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
IDInstanceStatusSelectionDetailsScore LIGA505 LIGAND CODE MATCHED XYZ XYPA503CLOSE MATCH XYPA504NO MATCH MANA500PASSED GLC A501 PASSED NAG A502 PASSED.
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.
The IUPAC Stability Constants Database (SC-Database) The definitive collection of all significant published metal-complex stability constants Title Structure.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
ISM 5316 Week 3 Learning Objectives You should be able to: u Define and list issues and steps in Project Integration u List and describe the components.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
X-ray Validation Package Present status Swanand Gore PDBe D&A meeting : 21-Oct-2010.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
***Adding items to your Etudes Homepage*** Log into Etudes
Altman et al. JACS 2008, Presented By Swati Jain.
Data Integration and Management A PDB Perspective.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
Worldwide Protein Data Bank wwPDB Common D&A Project November 24, 2009 November 24, 2009 Steering Committee Project Update.
Rosetta Steven Bitner. Objectives Introduction How Rosetta works How to get it How to install/use it.
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Worldwide Protein Data Bank wwPDB Common D&A Project Full Project Team Meeting Rutgers March 16-19, 2010.
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Administrators and System Administrators
PDBe Protein Interfaces, Surfaces and Assemblies
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Getting the Most out of the PDBe
Dimitris Dimitropoulos
Complete automation in CCP4 What do we need and how to achieve it?
1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?
Compare and Contrast Testing of... Stand Alone Applications
Volume 25, Issue 11, Pages e3 (November 2017)
Progress Report in REFMAC
Ligand Binding to the Voltage-Gated Kv1
Volume 10, Issue 6, Pages (June 2002)
Outcome of the First wwPDB/CCDC/D3R Ligand Validation Workshop
Ton Spek Utrecht University The Netherlands Vienna –ECM
Presentation transcript:

Recommendations and Questions wwPDB/CCDC/D3R Ligand Validation Workshop Center for Integrative Proteomics Research, Rutgers 7/30-31/2015 Group D, Academic and Industrial Crystallographers: Kathleen Aertgeerts (co-chair) David Brown (co-chair) Seth Harris Tobias Krojer Alan Mark Guy Montelione Robert Nolte John Spurlino Chenghua Shao Oliver Smart Paul Emsley (PM session)

Recommendations

1. Recommended X-Ray Structure Refinement Workflow Validation (see next slides) Refined protein+ligand CIF/PDB CIF, PDB, PNG, MOL2 SMILES String for ligand Available software: CORINA, ELBOW, GRADE, Afitt, ACEDRG, PRODRUG, ATB Available software: Coot, Phenix, Buster, Refmac Potential ambiguity in chirality, tautomeric state and charge Wiki-style educational recommendation on good practices on ligand chemistry and structural solution, with community cooperation and contribution.

1b. Dictionary/model Dictionary: Key restraints used in refinement that can be from multiple sources. To incorporate rotation freedom of certain bonds, and certain degree of freedom for conformation flexibility. Model: Set of 3D coordinates of ligand to start modeling and refinement process. To find low- energy conformation(s). To combine tools and manual process.

2. Validation of ligand during model building and refinement cycle Comparison of B values on protein vs ligand Consideration of occupancy in refinement on ligand; consideration of multiple conformations on ligand; Consideration of disordered moiety of ligand Restraints in mmcif vs observed geometry in refinement process Database methods (e.g. Mogul) or automatic computational scientific software that assesses ligand geometry during refinement (to be developed) Issues(breakout session): covalent ligands, unnatural amino acids RSCC/RSR/LLDF, and difference density explanation. Alternate modeling, e.g. test hypothesis of the extra density being water Include hydrogen atoms to ligand and its binding site residues that facilitates interpretation of protein-ligand interaction

3. Validation during PDB deposition Full ligand should be enumerated, and author defines ligand of interest (e.g. LIG vs ATOM/HETATM) in the PDB/CIF model file Restraints dictionary in mmCIF file mmCIF – Ligand definition (Recommend to include into mmCIF energy term interpretation, and refinement program to output required files for deposition) Slider picture of ligand quality assessment (general and conditional on resolution) RSR, RSCC values at atom and ligand level Develop simple and clear metrics on ligand quality at atomic level Difference electron density figure with fitted ligand Additional column of uncertainty measure(TBD, quantitative) per atom in mmcif that can be captured in visualization programs, e.g. well-defined/ill-defined in NMR VTF; no density with color code; Automated computational scientific tools available on web; software to predict reasonable geometry. And distributable package for local clients Batch deposition process Make CAVEAT more obvious and request for authors to fix/explain issues Protein-ligand interaction: clash score, interaction fingerprint and energy. To compare a new structure’s ligand to the existing validated structures; fragment fitting comparison.

3b. Additional optional information provided by authors during PDB deposition Available QC data on ligand (e.g. NMR, MS) Binding data. In batch mode deposition, to have access to the experimental binding data for the set. Author’s processing details/comments in fields specific to individual ligand and its refinement process Other info (e.g. source)

4. Ligand Validation during journal submission wwPDB validation report including enhanced ligand validation (Buster report as example). Highlight CAVEAT and author’s response. Initial omit density before ligand is loaded (with the final ligand model overlay); difference electron density figure with fitted ligand. Recommend disclosure of fitted ligand and binding pocket. Provide web-access to the coordinates, SF, and map coefficients for reviewers Re-refinement on any existing structure should refer to the original structure/publication, as well as new deposition made

5a. Recommendation on existing PDB archived co-crystal structures: what users want Flag of bad structures, or bad ligands using validation tools. Display slider bar for ligand(s). Alert authors of the entries identified above Possible automatic re-calculation on alternate modeling for the co-crystallized ligands identified above, which could motivate CASP- like computation competition and development of new methods.

5. Recommendation on existing PDB archived co-crystal structures Update on the model by the same author (or PDB) keeps the same PDB code with versioning, no requirement for obsolete, and requires mandatory description for the reason of update. Re-refinement of any structure done by different/same authors, using same data, new PDB code should refer to the original PDB code and data (current practice at wwPDB) Capture curated comments from authors/users on the PDB web

6. Recommendation for ligand chemistry description Agree to all the recommendations in the doc Indicator of the exact charge or tautomer state in the model (author provided, or unknown) Standardize atom naming convention, e.g. InChi canonicalization

Questions/ Points of discussion

Questions: Refinement vs Validation: Validation can be performed during refinement, after refinement, during validation, during PDB process. What is the best practice? Buster’s ligand review example (see bottom) and its implication on ligand validation process. What is ligand? (e.g. Glycerol, Sodium ion can be relevant ligands but mostly are solvent/buffer). To let author specifically mark what is significant for structure for referee review?

Questions (cont) Occupancy review, e.g. how to deal with zero occupancy? B factor review, e.g. how to deal with B factors that are very high? Validation components needs to be distributed to the community? Accessibility of critical software to diverse academic research groups, so that all users are able to generate files for ligand modeling, validation, and deposition. Inconsistent outcome between components, e.g. Mogul vs OpenEye. Leading to direction of cross validation? Density fitting restraints at lower resolution may have problems and ambiguity. Special cases that are valid can be outlying against reference. How to highlight and deal with it?

Questions (cont) Ligand completeness issue. To set artificial occupancy (e.g. zero) can complicate B factor. The current problems with refinement programs: covalent, metal, etc. Automatic tools at web to assess/predict ligand validity. Batch deposition output from in-house sources/databases should be handled, and how?

Questions (cont) Explanation for unfitted density? Especially the presence of difference density close to the ligand atoms. To include validation components in refinement?