Analysis of the Quality of NMR Protein Structures With A Structure Calculated From Your NMR Data, How Do You Determine the Accuracy and Quality of the.

Slides:



Advertisements
Similar presentations
Protein NMR terminology COSY-Correlation spectroscopy Gives experimental details of interaction between hydrogens connected via a covalent bond NOESY-Nuclear.
Advertisements

Quality of Protein Crystal Structures in the PDB Eric. N Brown, Lokesh Gakhar and S. Ramaswamy.
Experimental Techniques in Protein Structure Determination Homayoun Valafar Department of Computer Science and Engineering, USC.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Solving NMR Structures II: Calculation and evaluation What NMR-based (solution) structures look like the NMR ensemble inclusion of hydrogen coordinates.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Incorporating additional types of information in structure calculation: recent advances chemical shift potentials residual dipolar couplings.
Solving NMR structures II: Calculation and evaluation The NMR ensemble Methods for calculating structures distance geometry, restrained molecular dynamics,
Solving NMR structures I --deriving distance restraints from crosspeak intensities in NOESY spectra --deriving dihedral angle restraints from J couplings;
Thomas Blicher Center for Biological Sequence Analysis
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein Structure Prediction and Analysis
How NMR is Used for the Study of Bio-macromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Computer-Assisted Drug Design (1) i)Random Screening ii)Lead Development and Optimization using Multivariate Statistical Analyses. iii)Lead Generation.
COMPARATIVE or HOMOLOGY MODELING
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Representations of Molecular Structure: Bonds Only.
NMR Analysis of Protein Dynamics Despite the Typical Graphical Display of Protein Structures, Proteins are Highly Flexible and Undergo Multiple Modes Of.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Comparing Data from MD simulations and X-ray Crystallography What can we compare? 3D shapes (Scalar coupling constants, a.k.a. J-values, nuclear Overhauser.
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
-1/2 E +1/2 low energy spin state
Computer Simulation of Biomolecules and the Interpretation of NMR Measurements generates ensemble of molecular configurations all atomic quantities Problems.
Biomolecular Nuclear Magnetic Resonance Spectroscopy FROM ASSIGNMENT TO STRUCTURE Sequential resonance assignment strategies NMR data for structure determination.
Metabolomics Metabolome Reflects the State of the Cell, Organ or Organism Change in the metabolome is a direct consequence of protein activity changes.
Precision and Accuracy Agreement Indices in HSP An Introduction to Rietveld Refinement using PANalytical X’Pert HighScore Plus v2.2d Scott A Speakman,
Protein NMR Part II.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Developing a Force Field Molecular Mechanics. Experimental One Dimensional PES Quantum mechanics tells us that vibrational energy levels are quantized,
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
Molecular dynamics (MD) simulations  A deterministic method based on the solution of Newton’s equation of motion F i = m i a i for the ith particle; the.
Structure Prediction and Modeling of a Eukaryotic Member of the Major Facilitator Superfamily Gaurav Narale.
Areas of Spectrum. Remember - we are thinking of each amino acid as a spin system - isolated (in terms of 1 H- 1 H J-coupling) from the adjacent amino.
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Douglas Kojetin, Ph.D. UC College of Medicine
NMR Spectroscopy – Part 2
1. Pure Protein (0.3 mL, mM; ~ 10 mg)
Backbone Dynamics of the 18
R. Elliot Murphy, Alexandra B. Samal, Jiri Vlach, Jamil S. Saad 
Barley lipid-transfer protein complexed with palmitoyl CoA: the structure reveals a hydrophobic binding site that can expand to fit both large and small.
Carlos R. Baiz, Andrei Tokmakoff  Biophysical Journal 
Homology Modeling.
Protein structure prediction.
Volume 13, Issue 9, Pages (December 2015)
Volume 15, Issue 9, Pages (September 2007)
Clare-Louise Towse, Steven J. Rysavy, Ivan M. Vulovic, Valerie Daggett 
Increased Reliability of Nuclear Magnetic Resonance Protein Structures by Consensus Structure Bundles  Lena Buchner, Peter Güntert  Structure  Volume.
Carlos R. Baiz, Andrei Tokmakoff  Biophysical Journal 
The Arginine-Rich RNA-Binding Motif of HIV-1 Rev Is Intrinsically Disordered and Folds upon RRE Binding  Fabio Casu, Brendan M. Duggan, Mirko Hennig 
A Conformational Switch in the CRIB-PDZ Module of Par-6
Volume 20, Issue 12, Pages (December 2012)
Comparison of X-Ray Crystal Structure of the 30S Subunit-Antibiotic Complex with NMR Structure of Decoding Site Oligonucleotide-Paromomycin Complex  Stephen.
The structure of the complex of plastocyanin and cytochrome f, determined by paramagnetic NMR and restrained rigid-body molecular dynamics  Marcellus.
Unmasking the Annexin I Interaction from the Structure of Apo-S100A11
Volume 4, Issue 2, Pages (February 1996)
Volume 23, Issue 6, Pages (June 2015)
Solution Structure of the Proapoptotic Molecule BID
Backbone Dynamics of the 18
Protein structure prediction
Presentation transcript:

Analysis of the Quality of NMR Protein Structures With A Structure Calculated From Your NMR Data, How Do You Determine the Accuracy and Quality of the Structure? Consistency with Known Protein Structural Parameters  bond lengths, bond angles, dihedral angles, VDW interactions, etc  all the structural details discussed at length in the beginning Consistency with the Experimental DATA  distance constraints, dihedral constraints, RDCs, chemical shifts, coupling constants  all the data used to calculate the structure Consistency Between Multiple Structures Calculated with the Same Experimental DATA Overlay of 30 NMR Structures

Analysis of the Quality of NMR Protein Structures As We have seen before, the Quality of X-ray Structures can be monitored by an R-factor No comparable function for NMR Requires a more exhaustive analysis of NMR structures

Analysis of the Quality of NMR Protein Structures Root-Mean Square Distance (RMSD) Analysis of Protein Structures A very common approach to asses the quality of NMR structures and to determine the relative difference between structures is to calculate an rmsd  an rmsd is a measure of the distance separation between equivalent atoms  two identical structures will have an rmsd of 0Å  the larger the rmsd the more dissimilar the structures 0.43 ± 0.06 Å for the backbone atoms 0.81 ± 0.09 Å for all atoms

Analysis of the Quality of NMR Protein Structures Root-Mean Square Distance (RMSD) Analysis of Protein Structures A variety of approaches can be used to measure an RMSD  only backbone atoms  exclude disordered regions  only regions with defined secondary structure  only the protein’s active-site region  on a per-atom or per-residue basis rmsd difference between NMR and X-ray structure

Analysis of the Quality of NMR Protein Structures Root-Mean Square Distance (RMSD) Analysis of Protein Structures Using XPLOR to measure an RMSD  only one PSF file can be used  the structures being compared have to be consistent with the PSF file  can only compare similar structures/sequence  can not compare structural homologs end set precision 3 end evaluate ($1= “PROTEIN_ave.min") set display=all_all.rms end display $1 set display=back_all.rms end display $1 coor disp=comp for $2 in ( “PROTEIN_dg_103.sam" “PROTEIN_dg_3.sam“. “PROTEIN_dg_18.sam" “PROTEIN_dg_84.sam" ) Read in the PSF file and identify the reference structure Name the files to contain the rmsd data Compare the coordinates Loop through a list of files to compare against PROTEIN_ave.min

Analysis of the Quality of NMR Protein Structures loop fil2 coor coor select ((not hydro) and (resid 4:81 or resid 90:111)) fit end coor select ((not hydro) and (resid 4:81 or resid 90:111)) rms end evaluate ($20=$result) set display all_all.rms end display $2 $20 coor select ((name ca or name c or name n) and (resid 4:81 or resid 90:111)) fit end coor select ((name ca or name c or name n) and (resid 4:81 or resid 90:111)) rms end set display back_all.rms end evaluate ($20=$result) display $2 $20 end loop fil2 stop Continuation of XPLOR Script The following set of commands are executed for each protein in the list First, the structures are aligned based on the selected atoms Then, an rmsd is calculated based on the selected atoms and written to the specified file Selection tool is very flexible and you can choose a variety of atom names, types, residue names and types and chains

Analysis of the Quality of NMR Protein Structures Some Caveats About (RMSD) Analysis of Protein Structures rmsd is a global measure  an unexpected high or low value may reflect only a region of the protein that is ordered/disordered or similar/dissimilar For an ensemble of NMR structures, rmsd implies precision  accuracy can be inferred by comparing structures determined by multiple methods and groups High rmsd is an indicator of protein dynamics  This is generally true, but it strongly depends on the completeness and accuracy of analyzing the NOESY data.  regions of mobility need to be confirmed by experimental measurments  high rmsd is directly related to the number of distance constraints

Literature Reports of NMR Protein Structures RMSD provides a means to evaluate the quality of the structures  Always calculate multiple structures (10-100) with a single experimental data set  identify the lowest energy structures (10-30)  calculate an “average” structure (SA)  minimize the “average” structure (SA) r against the NMR experimental constrains  calculate rmsd between the ensemble of structures ( ), (SA), (SA) r and x-ray structure (if available) Analysis of the Quality of NMR Protein Structures

What is an “Average” NMR Structure? Average the x,y,z-coordinate positions for each atom over the entire ensemble of structures Structure 1: ATOM 1 N MET Structure 2: ATOM 1 N MET Structure 3: ATOM 1 N MET Average Structure: ATOM 1 N MET Align the structures and calculate an average

Analysis of the Quality of NMR Protein Structures Is the “Average” NMR Structure a Real Structure? No-it is a distorted structure  level of distortions depends on the similarity between the structures in the ensemble  provides a means to measure the variability in atom positions between an ensemble of structures Expanded View of an “Average” Structure Some very long, stretched bonds Position of atoms are so scrambled the graphics program does not know which atoms to draw bonds between Some regions of the structure can appear relatively normal

Analysis of the Quality of NMR Protein Structures Refine or Minimize the “Average” NMR Structure Remove “bad” bond lengths, angles, etc Minimize against the experimental NMR data Is this structure a “true” representation of the average of the ensemble of structures?  A point of discussion in the NMR community  an overlay of the refined average structure with the ensemble of structures will place the average in the center of the structures Alternative view is to select one structure from the ensemble  Does this represent the average of the ensemble? Overlay of backbone atoms for the refined minimized structure (blue) and ensemble of 30 structures (yellow)

Analysis of the Quality of NMR Protein Structures Structural Statistics Provide Another Approach to Quantify the Quality of an NMR Structure r.m.s deviations from experimental restraints energies  NOE violations, torsional angle violations, repulsive forces, Lennard-Jones electrostatics (not a target function in refinement) deviations from idealized covalent geometry PROCHECK  G-factor, Ramachandran, bad contacts hydrogen bond energy

REMARK FILENAME=“PROTEIN.sam" REMARK =============================================================== REMARK overall, bonds, angles, improper,vdw, REMARK energies: , , , , , REMARK cdih, noe, coup, shift, rama, coll REMARK energies: , , , , , REMARK =============================================================== REMARK bonds, angles, impropers, cdih, noe, coup REMARK RMS: E-03, , , , E-02, REMARK shifts RMS a, b: , REMARK =============================================================== REMARK cdih coup noe REMARK violations : REMARK shifts: 55 REMARK =============================================================== REMARK REMARK jcoup rms-d: REMARK =============================================================== REMARK DATE:16-Feb-04 17:50:26 created by user: Analysis of the Quality of NMR Protein Structures Energies, violations and rms deviations are routinely printed as headers in XPLOR Structure files

Analysis of the Quality of NMR Protein Structures. cons inter (all) (all) end print threshold = collapse evaluate ($rms_col=$result) evaluate ($violations_col=$violations) print threshold = noe evaluate ($rms_noe=$result) evaluate ($violations_noe=$violations) print threshold = cdih evaluate ($rms_cdih=$result) evaluate ($violations_cdih=$violations) print threshold = bonds evaluate ($rms_bonds=$result) print threshold = angles evaluate ($rms_angles=$result) evaluate ($violations_angles=$violations) print threshold = impropers evaluate ($rms_impropers=$result) evaluate ($violations_improp=$violations). Obtaining rms deviations of experimental restraints, energies and deviations from idealized geometry by simply printing the values from XPLOR Structural Statistics Provide Another Approach to Quantify the Quality of an NMR Structure Sets the thresholds for which violations will be reported

Analysis of the Quality of NMR Protein Structures. ========== spectrum 1 restraint 2745 ========== set-i-atoms 93 ALA HN set-j-atoms 78 VAL O R = NOE= 1.80 (- 0.30/+ 0.50) Delta= E(NOE)= ========== spectrum 1 restraint 2747 ========== set-i-atoms 76 GLY HN set-j-atoms 95 VAL O R = NOE= 1.80 (- 0.30/+ 0.50) Delta= E(NOE)= NOEPRI: RMS diff. = 0.040, #(violat.> 0.1)= 69 of 2750 NOEs NOEPRI: RMS diff. class INTR = 0.004, #(viol.> 0.1)= 0 of 483 NOEs NOEPRI: RMS diff. class SHOR = 0.035, #(viol.> 0.1)= 22 of 825 NOEs NOEPRI: RMS diff. class LONG = 0.040, #(viol.> 0.1)= 20 of 651 NOEs NOEPRI: RMS diff. class SEQ = 0.046, #(viol.> 0.1)= 10 of 719 NOEs NOEPRI: RMS diff. class H-BO = 0.104, #(viol.> 0.1)= 17 of 72 NOEs X-PLOR>evaluate ($rms_noe=$result) EVALUATE: symbol $RMS_NOE set to E-01 (real) X-PLOR>evaluate ($violations_noe=$violations) EVALUATE: symbol $VIOLATIONS_NOE set to (real).. Typical Output of an XPLOR restraint analysis where violated restraints are listed followed by a summary Violated NOE restraint, average distance, error and energy RMS summary and number of violated restraints

Analysis of the Quality of NMR Protein Structures As We Discussed Before, PROCHECK is a Very Valuable Tool For Accessing The Quality of a Protein Structure ► Correct ,     distribution ► Comparison of main chain and side-chain parameters to standard values

Analysis of the Quality of NMR Protein Structures Predict NOEs from the Experimental Structure and Compare to the NOESY Data short distances observed in the structure should correlate with NOE cross-peaks missing NOE peaks may arise because of:  degeneracy  too close to diagonal  overlap with noise  disordered region of the structure while ($3 < 113) loop select evaluate ($4 = $3 + 1) noe set display $2 end set print $2 end predict from=(resid $3 and name h*) to=(resid $4:113 and name h* ) cutoff=4.5 end evaluate ($3=$3+1) end loop select end loop dist Lists all the predicted NOEs between the selected atom pairs within the given distance cut-off ======================================== set-i-atoms 2 THR HN set-j-atoms 1 MET HT1 1 MET HT2 1 MET HT3 ( ( )^-1/6 ) = ======================================== set-i-atoms 2 THR HN set-j-atoms 1 MET HA ( ( )^-1/6 ) = ======================================== set-i-atoms 2 THR HN set-j-atoms 1 MET HB1 1 MET HB2 ( ( )^-1/6 ) = ========================================

Analysis of the Quality of NMR Protein Structures NMR R-factor difference between expected and observed NOEs  expected NOEs  structure  observed NOEs  NMR spectra  also includes unassigned NOEs  perfect fit would yield R = 0 R-factors have not been readily adapted in NMR community  affected by completeness of assignments, peak overlap, sensitivity, noise, extent of data (RDCs, coupling constants, etc  trends with rmsd without complications Journal of Biomolecular NMR, 17: 137–151, 2000.