Refinement procedure Copy your best coordinate file to “prok-native-r1.pdb”: cp yourname-coot-99.pdb prok-native-r1.pdb Start refinement phenix.refine.

Slides:



Advertisements
Similar presentations
Quality of Protein Crystal Structures in the PDB Eric. N Brown, Lokesh Gakhar and S. Ramaswamy.
Advertisements

Phasing Goal is to calculate phases using isomorphous and anomalous differences from PCMBS and GdCl3 derivatives --MIRAS. How many phasing triangles will.
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Methods: X-ray Crystallography
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Structure Outline Solve Structure Refine Structure and add all atoms
Protein Planes Bob Fraser CSCBC Overview Motivation Points to examine Results Further work.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
A Brief Description of the Crystallographic Experiment
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
Macromolecular structure refinement Garib N Murshudov York Structural Biology Laboratory Chemistry Department University of York.
Thomas Blicher Center for Biological Sequence Analysis
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Proteins are made by linking amino acids Protein Structure Review and Refinement Introduction Brian Bahnson Dept of Chemistry & Biochemistry, University.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Proteins: Levels of Protein Structure Conformation of Peptide Group
Two parts to successful model building BUILDING TOOLS –how to use Coot –Initiate trace of protein chain (“Place helix here”) –Test sidechain assignments.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Protein Secondary Structure Lecture 2/19/2003. Three Dimensional Protein Structures Confirmation: Spatial arrangement of atoms that depend on bonds and.
Basic Computations with 3D Structures
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Model-Building with Coot An Introduction Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Patterson Space and Heavy Atom Isomorphous Replacement
Data quality and model parameterisation Martyn Winn CCP4, Daresbury Laboratory, U.K. Prague, April 2009.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Coot Tools for Model Building and Validation
Protein Planes Bob Fraser Protein Folding 882 Project November, 2006.
The ‘phase problem’ in X-ray crystallography What is ‘the problem’? How can we overcome ‘the problem’?
Molecular visualization
Overview of MR in CCP4 II. Roadmap
Ligand fitting and Validation with Coot Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.
1. Diffraction intensity 2. Patterson map Lecture
EBI is an Outstation of the European Molecular Biology Laboratory. Validation & Structure Quality.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Pattersons The “third space” of crystallography. The “phase problem”
Atomic structure model
Crystallography -- Lecture 22 Refinement and Validation.
Before Beginning – Must copy over the p4p file – Enter../xl.p4p. – Enter../xl.hkl. – Do ls to see the files are there – Since the.p4p file has been created.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Refinement is the process of adjusting an atomic model to:
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Forward and inverse kinematics in RNA backbone conformations By Xueyi Wang and Jack Snoeyink Department of Computer Science UNC-Chapel Hill.
Today: compute the experimental electron density map of proteinase K Fourier synthesis  (xyz)=  |F hkl | cos2  (hx+ky+lz -  hkl ) hkl.
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
CommonCoot Common Coot (Fulica atra) (Fulica atra)
Computational Structure Prediction
Common Coot (Fulica atra).
Refinement procedure for native structure
Model Building and Refinement for CHEM 645
Phasing Today’s goal is to calculate phases (ap) for proteinase K using MIRAS method (PCMBS and GdCl3). What experimental data do we need? 1) from native.
Reduce the need for human intervention in protein model building
Protein Planes Bob Fraser CSCBC 2007.
Nobel Laureates of X Ray Crystallography
Goals for Today Introduce automated refinement and validation.
Goals for Today Introduce automated refinement and validation.
Protein structure prediction.
Note for 2019 If you get positive peaks on the sulfurs after phenix.refine, try setting all the B-factors to a constants, such as 5.00 Å2 or Å2.
Protein structure prediction
Presentation transcript:

Refinement procedure Copy your best coordinate file to “prok-native-r1.pdb”: cp yourname-coot-99.pdb prok-native-r1.pdb Start refinement phenix.refine prok-native-r1.pdb prok-native-mcollazo.mtz

 obs Structure Refinement Schematic Reciprocal Space Real Space |F obs-native | | F obs-PCMBS | | F obs-EuCl 3 | Fit Map FT (Coot) FT (Phenix) |F calc | in  |F obs -F calc |  |F obs | |F calc | out Manual Refinement Build atoms to FT (Coot) 2F obs -F calc map Fit |F obs | Move atoms to  calc Manual Refinement FT (Phenix) |F calc | in Automatic Refinement experimental map F obs -F calc map coordinates (prok-native-r1_refine_001.pdb) coordinates (prok-native-r2.pdb) coordinates (prok-native-r1.pdb)

Get a sorted list of F obs -F calc peaks Ramachandran plot Kleywegt plot Incorrect Chiral Volumes Unmodeled Blobs Difference Map peaks Check/Delete Waters Geometry Analysis Peptide Omega Analysis Rotamer Analysis Density Fit Analysis Probe Clashes NCS differences Pukka Puckers Alignment vs. PIR

F obs -F calc reveals errors in model Positive density Negative density Real Space Refine and drag Or Autofit Rotamer

F obs -F calc reveals errors in model Real Space Refine and drag Or Autofit Rotamer

water

Other solvent

Goals for Today Automated Refinement of ProK –Phenix –R work and R free for your model. Manual Refinement of ProK –correct errors with Coot Automated Refinement of ProK –Phenix –R work and R free for your model. Validate ProK model (web server) Awards Refine ProK-PCMBS complex Go forth wielding the tools of X-ray crystallography and discover the secrets of other biological macromolecules.

REAL vs RECIPROCAL Real Space Manual Local Improvement in the model is limited by the quality of the phases Large radius of convergence Reciprocal Space Automatic Global Improved phases will lead to improved maps and improved interpretability and improved model. Small radius of convergence

Radius of convergence Manual adjustments improve radius of convergence Rupp Torsion angle C  -C 

Reciprocal Space Target function: E data (R-factor) Move atoms to minimize the R-factor. Minimize the discrepancy between F obs and F calc. Specifically, minimize E E data =  w(F obs -F calc ) 2 Over all hkl. Past--We used least squares minimization to refine. Now--Maximum likelihood allows for non-random error model. Given this model, what is the probability that the given set of data would be observed.

Importance of supplementing the Data to Parameter Ratio in crystallographic refinement. PARAMETERS Each atom has 4 parameters (variables) to refine: x coordinate y coordinate z coordinate B factor In proteinase K there are approximately 2000 atoms to refine. This corresponds to 2000*4= 8000 variables. At 1.7 A resolution we have 25,000 observations. About 3 observations per variable. The reliability of the model is still questionable. Adding stereochemical restraints is equivalent to adding observations DATA At 2.5 A resolution we have 8400 observations (data points) (Fobs). When # of observations= # of variables A perfect fit can be obtained irrespective of the accuracy of the model.

Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between observed and calculated data. w data is a weight chosen to balance the gradients arising from the two terms. E stereochemistry comprises empirical information about chemical interactions between atoms in the model. It is a function of all atomic positions and includes information about both covalent and non-bonded interactions.

E stereochemistry (Geometry) –BOND LENGTHS & ANGLES have ideal values. Engh & Huber dictionary.  CHIRALITY of  -carbons –PLANARITY of peptide bonds and aromatic side chains –NONBONDED CONTACTS -two atoms cannot occupy the same space at the same time -TORSION ANGLE PREFERENCES side chains have preferred rotamers. –some values of  and  are forbidden. -Ramachandran. Not restrained- used for validation. e loop_ _chem_comp_bond.comp_id _chem_comp_bond.atom_id_1 _chem_comp_bond.atom_id_2 _chem_comp_bond.type _chem_comp_bond.value_dist _chem_comp_bond.value_dist_esd ALA N CA single ALA CA CB single ALA CA C single ALA C O double

Jeopardy clue: The appearance of the atomic model when stereochemical restraints are not included in crystallographic refinement. What is spaghetti, Alex? E total =E stereochemistry + w data E data

restrained not restrained

2 nd Jeopardy clue: The value of the R-factor resulting when stereochemical restraints are not included in crystallographic refinement. What is zero, Alex? E total =E stereochemistry + w data E data

An atomic model should be validated by several unbiased indicators Low RMS deviations in bond lengths and angles does not guarantee a correct structure R free is an unbiased indicator of the discrepancy between the model and the data. The data used in this R-factor calculation were not used in determining atomic shifts in the refinement process. Ramachandran plot is unbiased because phi and psi torsion angles are not restrained in the refinement process. The need for Cross-Validation

Stop Here Now, use COOT to correct errors in Phenix refined model: –prok-native_refine_001.pdb –Spend 15 minutes Run Phenix after COOT Resume discussion on structure validation while Phenix is running.

O N H BACKBONE AMIDE

N O Asn H H O N H BACKBONE AMIDE 2.8 Å BAD

N O H H O N H BACKBONE AMIDE 2.8 Å Asn GOOD

ERRAT examines distances between non-bonded atoms. Reports the deviations of C-C, C-N, C-O, N-N, N-O, O-O distances from distributions characteristic of reliable structures.

Verify 3D plot Indicates if the sequence has been improperly threaded through the density. It measures the compatibility of a model with its sequence. Evaluate for each residue in the structure: (1)Surface area buried (2) Fraction of side-chain area covered by polar atoms (3) Local secondary structure and compare to ideal library values for each amino acid type. Report the fraction of residues with score greater than 0.2 Backwards trace Correct trace

Submit coordinates to SAVS server Google for “UCLA SAVES” Continue with discussion on solving the ProK-PCMBS complex structure.

F Plan for today: Solve structure of ProK-PCMBS complex S ProK active site Cys74 PMB: p-chloromercuribenzoylsulfonate H Cl Hg SO 3

The beauty of isomorphism proteina (Å)b (Å)c (Å)  ProK ° ProK+PCMBS ° R iso =15.2% What is maximum possible R iso ? What is minimum possible R iso ? Initial phases: phases from native proteinase K structure  calc ProK. F obs amplitudes: Use |F Prok-PCMBS | data measured earlier in the course. Why don’t we have to use Heavy atoms? Why don’t we have to use Molecular Replacement?  x,y,z  =1/V*  |F obs |e -2  i(hx+ky+lz-  calc )

F o -F c Difference Fourier map  x,y,z  =1/V*  |F obs -F calc |e -2  i(hx+ky+lz-  calc ) Here, F obs will correspond to the Proteinase K-PMSF complex. F calc will correspond to the model of Proteinase K by itself after a few cycles of automated refinement. Positive electron density will correspond to features present in the PMSF complex that are not in the native structure. Negative electron density will correspond to features present in the native structure that should be removed in the inhibitor complex. After model building, do more automated refinement and then validate.

4 Key Concepts When to use isomorphous difference Fourier to solve the phase problem. How to interpret an Fo-Fc Difference Fourier map. Expected values of RMS deviation from ideal geometry methods of cross-validation

Validate protein structure by Running SAVES server grep -v hex prok-native_refine_001.pdb >prok-pmsf.pdb

Name _______________________ Refinement statisticsProteinase K native Proteinase K- PMSF Resolution Molecules in asymmetric unit1 Solvent content (%)36.3 Matthews coefficient (Å 3 /Da)1.9 Number of reflections used R work R free RMSD Bond lengths RMSD Bond angles Ramachandran plot: favored Ramachandran plot: allowed Ramachandran plot: generously allowed Ramachandran plot: outliers Number of atoms: protein Number of atoms: solvent Errat overall quality factor percentage with Verify3D score>0.2

R R Cis vs. Trans peptide CC C O N CC peptide plane R R C O N CC CC

Cis OK with glycine or proline R H CC C O N CC peptide plane R CC C O N CC Steric hindrance equivalent for cis or trans.

Steric hindrance equivalent for cis or trans proline. R CC CN CC peptide plane O R CC CN O CC CC CC CC CC CC CC