Overview of the Phase Problem

Slides:



Advertisements
Similar presentations
Reciprocal Space Learning outcomes
Advertisements

Intensities Learning Outcomes By the end of this section you should: understand the factors that contribute to diffraction know and be able to use the.
Phasing Goal is to calculate phases using isomorphous and anomalous differences from PCMBS and GdCl3 derivatives --MIRAS. How many phasing triangles will.
FORCE VECTORS, VECTOR OPERATIONS & ADDITION COPLANAR FORCES
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Methods: X-ray Crystallography
Planes in Lattices and Miller Indices
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
X-Ray Crystallography
Bob Sweet Bill Furey Considerations in Collection of Anomalous Data.
Internal – External Order We described symmetry of crystal habit (32 point groups) We also looked at internal ordering of atoms in 3-D structure (230 space.
CHAPTER 2 : CRYSTAL DIFFRACTION AND PG Govt College for Girls
Machine Transformations
Solid State Physics 2. X-ray Diffraction 4/15/2017.
Expression of d-dpacing in lattice parameters
A Brief Description of the Crystallographic Experiment
Twinning in protein crystals NCI, Macromolecular Crystallography Laboratory, Synchrotron Radiation Research ANL Title Zbigniew Dauter.
Hanging Drop Sitting Drop Microdialysis Crystallization Screening.
3. Crystals What defines a crystal? Atoms, lattice points, symmetry, space groups Diffraction B-factors R-factors Resolution Refinement Modeling!
Fourier transform. Fourier transform Fourier transform.
19 Feb 2008 Biology 555: Crystallographic Phasing II p. 1 of 38 ProteinDataCrystalStructurePhases Overview of the Phase Problem John Rose ACA Summer School.
Overview of the Phase Problem
Phasing based on anomalous diffraction Zbigniew Dauter.
The Effects of Symmetry in Real and Reciprocal Space Sven Hovmöller, Stockholm Univertsity Mirror symmetry 4-fold symmetry.
LIAL HORNSBY SCHNEIDER
Copyright © 2013, 2009, 2005 Pearson Education, Inc. 1 5 Systems and Matrices Copyright © 2013, 2009, 2005 Pearson Education, Inc.
Patterson Space and Heavy Atom Isomorphous Replacement
In serial femtosecond crystallography (SFX) with hard X-ray free-electron laser as light source, a set of three-dimensional single-crystal diffraction.
The ‘phase problem’ in X-ray crystallography What is ‘the problem’? How can we overcome ‘the problem’?
PHYS 430/603 material Laszlo Takacs UMBC Department of Physics
Diffraction Basics Coherent scattering around atomic scattering centers occurs when x-rays interact with material In materials with a crystalline structure,
Chem Patterson Methods In 1935, Patterson showed that the unknown phase information in the equation for electron density:  (xyz) = 1/V ∑ h ∑ k.
Chem Structure Factors Until now, we have only typically considered reflections arising from planes in a hypothetical lattice containing one atom.
Lesson 22 SIR2011 Patterson Theory DIRDIF Patty Solve your structures.
Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.
1. Diffraction intensity 2. Patterson map Lecture
THE PHASE PROBLEM Electron Density
Page 1 X-ray crystallography: "molecular photography" Object Irradiate Scattering lens Combination Image Need wavelengths smaller than or on the order.
Methods in Chemistry III – Part 1 Modul M.Che.1101 WS 2010/11 – 8 Modern Methods of Inorganic Chemistry Mi 10:15-12:00, Hörsaal II George Sheldrick
Lesson 13 How the reciprocal cell appears in reciprocal space. How the non-translational symmetry elements appear in real space How translational symmetry.
Lesson 13 How the reciprocal cell appears in reciprocal space. How the non-translational symmetry elements appear in real space How translational symmetry.
X-ray diffraction X-rays discovered in 1895 – 1 week later first image of hand. X-rays have ~ 0.1 – few A No lenses yet developed for x-rays – so no possibility.
What is the problem? How was the problem solved?
Protein Structure Determination Lecture 4 -- Bragg’s Law and the Fourier Transform.
Pattersons The “third space” of crystallography. The “phase problem”
Atomic structure model
Anomalous Differences Bijvoet differences (hkl) vs (-h-k-l) Dispersive Differences 1 (hkl) vs 2 (hkl) From merged (hkl)’s.
Electron Density Structure factor amplitude defined as: F unit cell (S) = ∫ r  (r) · exp (2  i r · S) dr Using the inverse Fourier Transform  (r) =
Calculation of Structure Factors
Electromagnetism Around 1800 classical physics knew: - 1/r 2 Force law of attraction between positive & negative charges. - v ×B Force law for a moving.
Absolute Configuration Types of space groups Non-centrosymmetric Determining Absolute Configuration.
Before Beginning – Must copy over the p4p file – Enter../xl.p4p. – Enter../xl.hkl. – Do ls to see the files are there – Since the.p4p file has been created.
Interpreting difference Patterson Maps in Lab this week! Calculate an isomorphous difference Patterson Map (native-heavy atom) for each derivative data.
X-ray Crystallography Kalyan Das. Electromagnetic Spectrum to 10 nM 400 to 700 nM to nM 10 to 400 nM 700 to 10 4 nM X-ray was discovered.
--Experimental determinations of radial distribution functions --Potential of Mean Force 1.
Methods in Chemistry III – Part 1 Modul M.Che.1101 WS 2010/11 – 9 Modern Methods of Inorganic Chemistry Mi 10:15-12:00, Hörsaal II George Sheldrick
Phasing in Macromolecular Crystallography
Fourier transform from r to k: Ã(k) =  A(r) e  i k r d 3 r Inverse FT from k to r: A(k) = (2  )  3  Ã(k) e +i k r d 3 k X-rays scatter off the charge.
Today: compute the experimental electron density map of proteinase K Fourier synthesis  (xyz)=  |F hkl | cos2  (hx+ky+lz -  hkl ) hkl.
Lecture 3 Patterson functions. Patterson functions The Patterson function is the auto-correlation function of the electron density ρ(x) of the structure.
Crystallography : How do you do? From Diffraction to structure…. Normally one would use a microscope to view very small objects. If we use a light microscope.
Amyloid Precursor Protein (APP)
CHARACTERIZATION OF THE STRUCTURE OF SOLIDS
Phasing Today’s goal is to calculate phases (ap) for proteinase K using MIRAS method (PCMBS and GdCl3). What experimental data do we need? 1) from native.
Introduction to Isomorphous Replacement and Anomalous Scattering Methods Measure native intensities Prepare isomorphous heavy atom derivatives Measure.
Nobel Laureates of X Ray Crystallography
Toshiro Oda, Keiichi Namba, Yuichiro Maéda  Biophysical Journal 
S. Takeda, A. Yamashita, K. Maeda, Y. Maeda
r(xyz)=S |Fhkl| cos2p(hx+ky+lz -ahkl)
Evidence of Cholesterol Accumulated in High Curvature Regions: Implication to the Curvature Elastic Energy for Lipid Mixtures  Wangchen Wang, Lin Yang,
Presentation transcript:

Overview of the Phase Problem Protein Crystal Data Phases Structure Overview of the Phase Problem Remember We can measure reflection intensities We can calculate structure factors from the intensities We can calculate the structure factors from atomic positions We need phase information to generate the image

What is the Phase Problem X-ray Diffraction Experiment All phase information is lost x,y.z Fhkl [Real Space] [Reciprocal Space] In the X-ray diffraction experiment photons are reflected from the crystal lattice (planes) in different directions giving rise to the diffraction pattern. Using a variety of detectors (film, image plates, CCD area detectors) we can estimate intensities but we loose any information about the relative phase for different reflections.

Phases Let’s define a phase for an individual atom, fj An atom at xj=0.40, yj=0.25, zj=0.10 for plane [213] fj = 2p[ 2•(0.40) + 1•(0.25) + 3•(0.10)] = 2p(1.) For k = 0 (a 2D case) then For plane [201] fj = 2p[ 2•(0.40) + 1•(0.10)] = 2p(0.) Now to understand what this means….

201 Phases c a fD = 2p[ 2•(0.40) + 1•(0.10)] = 2p(0.) A B G C H D F I E 0° 720° c a 201 planes 4p 360° 2p 1080° 6p 0.4, y, 0.1 fD = 2p[ 2•(0.40) + 1•(0.10)] = 2p(0.)

In General for Any Atom (x, y, z) dhkl 6π dhkl 4π Atom (j) at x,y,z dhkl 2π φ c Plane hkl Remember: We express any position in the cell as (1) fractional coordinates pxyz = xja+yjb+zjc (2) the sum of integral multiples of the reciprocal axes hkl = ha* + kb* + lc*

Phase for Any Atom

Why Do We Need the Phase? Structure Factor Electron Density Fourier transform Inverse Fourier transform Structure Factor Electron Density In order to reconstruct the molecular image (electron density) from its diffraction pattern both the intensity and phase, which can assume any value from 0 to 2, of each of the thousands of measured reflections must be known.

Importance of Phases Phases dominate the image! Hauptman amplitudes with Hauptman phases Karle amplitudes with Karle phases Karle amplitudes with Hauptman phases Hauptman amplitudes with Karle phases Phases dominate the image! Phase estimates need to be accurate

Understanding the Phase Problem The phase problem can be best understood from a simple mathematical construct. The structure factors (Fhkl) are treated in diffraction theory as complex quantities, i.e., they consist of a real part (Ahkl) and an imaginary part (Bhkl). If the phases, hkl, were available, the values of Ahkl and Bhkl could be calculated from very simple trigonometry: Ahkl = |Fhkl| cos (hkl) Bhkl = |Fhkl| sin (hkl) this leads to the relationship: (Ahkl)2 + (Bhkl)2 = |Fhkl|2 = Ihkl

Argand Diagram (Ahkl)2 + (Bhkl)2 = |Fhkl|2 = Ihkl The above relationships are often illustrated using an Argand diagram (right). From the Argand diagram, it is obvious that Ahkl and Bhkl may be either positive or negative, depending on the value of the phase angle, hkl. Note: the units of Ahkl, Bhkl and Fhkl are in electrons.

The Structure Factor f0 The scattering factor for each atom sinΘ/λ f0 Atomic scattering factors Here fj is the atomic scattering factor The scattering factor for each atom type in the structure is evaluated at the correct sinΘ/λ. That value is the scattering ability of that atom. Remember We now have an atomic scattering vector with a magnitude f0 and direction φj .

The Structure Factor Sum of all individual atom contributions real imaginary Individual atom fjs Resultant Fhkl Ahkl Bhkl

Electron Density Remember the electron density (image of the molecule) is the Fourier transform of the structure factor Fhkl. Thus Here V is the volume of the unit cell In practice, the electron density for one three-dimensional unit cell is calculated by starting at x, y, z = 0, 0, 0 and stepping incrementally along each axis, summing the terms as shown in the equation above for all hkl (as limited by the resolution of the data) at each point in space.

Solving the Phase Problem Small molecules Direct Methods Patterson Methods Molecular Replacement Macromolecules Multiple Isomorphous Replacement (MIR) Multi Wavelength Anomalous Dispersion (MAD) Single Isomorphous Replacement (SIR) Single Wavelength Anomalous Scattering (SAS) Direct Methods (special cases)

Solving the Phase Problem SMALL MOLECULES The use of Direct Methods has essentially solved the phase problem for well diffracting small molecule crystals. MACROMOLECULES Today, anomalous scattering techniques such as MAD or SAS are the most common techniques used for de novo structure determination of macromolecules. Both techniques require the presence of one or more anomalous scatterers in the crystal.

SIR and SAS Methods Need a heavy atom (lots of electrons) or a anomalous scatterer (large anomalous scattering signal) in the crystal. SIR - heavy atoms usually soaked in. SAS - anomalous scatterers usually engineered in as selenomethional labels. Can also be soaked. SIR collect a native and a derivative data set (2 sets total). SAS collect one highly redundant data set and keep anomalous pairs separate during processing. SAS - may want to choose a scatterer or wavelength that enhances the anomalous signal. Must find the heavy atoms or anomalous scatterers can use Patterson analysis or direct methods. Must resolve the bimodal ambiguity. use solvent flattening or similar technique

Heavy Atom Derivatives Heavy atom derivatives MUST be isomorphous Heavy atom derivatives are generally prepared by soaking crystals in dilute (2 - 20 mM) solutions of heavy atom salts (see Table II below for some examples). Crystal cracking is generally a good indication that that heavy atom is interacting with the crystal lattice, and suggests that a good derivative can be obtained by soaking the crystal in a more dilute solution. Once derivative data has been collected, the merging R factor (Rmerge) between the native and derivative data sets can be used to check for heavy atom incorporation and isomorphism. Rmerge values for isomorphous derivatives range from 0.05 to 0.15. Values below 0.05 indicate that there is little heavy atom incorporation. Values above 0.15 indicate a lack of isomorphism between the two crystals.

Finding the Heavy Atoms or Anomalous Scatterers The Patterson function - a F2 Fourier transform with f = 0 - vector map (u,v,w instead of x,y,z) - maps all inter-atomic vectors - get N2 vectors!! (where N= number of atoms) The Difference Patterson Map SIR - |DF|2 = |Fnat - Fder|2 SAS - |DF|2 = |Fhkl - F-h-k-l|2 Patterson map is centrosymmetric - see peaks at u,v,w & -u, -v, -w Peak height proportional to ZiZj Peak u,v,w’s give heavy atom x,y,z’s - Harker analysis Origin (0,0,0) maps vector of atom to itself From Glusker, Lewis and Rossi

Harker Analysis Example Space group P21 Patterson symmetry = Space group symmetry minus translations Example Space group P21 P21 space group symmetry operators x,y,z -x,1/2+y,-z x,y,z -x,1/2+y,-z x,y,z [(x,y,z) - (x,y,z)] [(x,y,z) - (-x,1/2+y,-z)] -x,1/2+y,-z [(-x,1/2+y,-z) – (x,y,z)] [(-x,1/2+y,-z) – (-x,1/2+y,-z)] x,y,z 000 2x,-1/2, 2z -x,1/2+y,-z -2x, 1/2,-2z 000 Harker section v = 1/2 where to look for heavy atom vectors ±2x, 1/2, ±2z Automated programs SOLVE, SHELXD, BNP are available

A Note About Handedness

The Phase Triangle Relationship DOLM = DOLN M Q L FPH = FP + FH O Need value of FH N From Glusker, Lewis and Rossi FP, FPH, FH and -FH are vectors (have direction) FP <= obtained from native data FPH <= obtained from derivative or anomalous data FH <= obtained from Patterson analysis

The Phase Triangle Relationship M Q L O N From Glusker, Lewis and Rossi In simplest terms, isomorphous replacement finds the orientation of the phase triangle from the orientation of one of its sides. It turns out, however, that there are two possible ways to orient the triangle if we fix the orientation of one of its sides.

Single Isomorphous Replacement From Glusker, Lewis and Rossi Note: FP = protein FH = heavy atom FP1 = heavy atom derivative The center of the FP1circle is placed at the end of the vector -FH1. X1 = ftrueor ffalse X2 = ftrueor ffalse The situation of two possible SIR phases is called the “phase ambiguity” problem, since we obtain both a true and a false phase for each reflection. Both phase solutions are equally probable, i.e. the phase probability distribution is bimodal.

Resolving the Phase Ambugity From Glusker, Lewis and Rossi Note: FP = protein FH = heavy atom FP1 = heavy atom derivative The center of the FP1circle is placed at the end of the vector -FH1. X1 = ftrueor ffalse X2 = ftrueor ffalse Add more information: Add another derivative (Multiple Isomorphous Replacement) Use a density modification technique (solvent flattening) Add anomalous data (SIR with anomalous scattering)

Multiple Isomorphous Replacement Note: FP = protein FH1 = heavy atom #1 FH2 = heavy atom #2 FP1 = heavy atom derivative FP2 = heavy atom derivative The center of the FP1 and FP1 circles are placed at the end of the vector -FH1 and -FH2, respectively. X1 = ftrue X2 = ffalse X3 = ffals From Glusker, Lewis and Rossi Exact overlap at X1 dependent on data accuracy dependent on HA accuracy called lack of closure We still get two solutions, one true and one false for each reflection from the second derivative. The true solutions should be consistent between the two derivatives while the false solution should show a random variation.

Similar to noise filtering Solvent Flattening Similar to noise filtering Resolve the SIR or SAS phase ambiguity From Glusker, Lewis and Rossi B.C. Wang, 1985 Electron density can’t be negative Use an iterative process to enhance true phase!

The solvent flattening process was made practical by the introduction of the ISIR/ISAS program suite (Wang, 1985) and other phasing programs such DM and PHASES are based on this approach.

Handedness Can be Determined by Solvent Flattening

Does the Correct Hand Make a Difference? YES! The wrong hand will give the mirror image!

Anomalous Dispersion Methods All elements display an anomalous dispersion (AD) effect in X-ray diffraction For elements such as e.g. C,N,O, etc., AD effects are negligible For heavier elements, especially when the X-ray wavelength approaches an atomic absorption edge of the element, these AD effects can be very large. The scattering power of an atom exhibiting AD effects is: fAD = fn + f' + if” fnis the normal scattering power of the atom in absence of AD effects f' arises from the AD effect and is a real factor (+/- signed) added to fn f" is an imaginary term which also arises from the AD effect f" is always positive and 90° ahead of (fn + f') in phase angle The values of f' and f" are highly dependent on the wave-length of the X-radiation. In the absence AD effects, Ihkl = I-h-k-l (Firedel’s Law). With AD effects, Ihkl ≠ I-h-k-l (Friedel’s Law breaks down). Accurate measurement of Friedel pair differences can be used to extract starting phases if the AD effect is large enough.

Breakdown of Friedel’s Law (Fhkl Left) Fn represents the total scattering by "normal" atoms without AD effects, f’ represents the sum of the normal and real AD scattering values (fn + f'), f" is the imaginary AD component and appears 90° (at a right angle) ahead of the f’ vector and the total scattering is the vector F+++. (F-h-k-l Right) F-n is the inverse of Fn (at -hkl) and f’ is the inverse of f’, the f" vector is once again 90° ahead of f’. The resultant vector, F--- in this case, is obviously shorter than the F+++ vector.

Collecting Anomalous Scattering Data Anomalous scatterers, such as selenium, are generally incorporated into the protein during expression of the protein or are soaked into the crystals in a manner similar to preparing a heavy atom derivative. Bromine, iodine, xeon and traditional heavy atom compounds are also good anomalous scatterers. The anomalous signal, the difference between |F+++| and |F---| is generally about one order of magnitude smaller than that between |FPH(hkl)|, and |FP(hkl)|. Thus, the signal-to-noise (S/n) level in the data plays a critical role in the success of anomalous scattering experiments, i.e. the higher the S/n in the data the greater the probability of producing an interpretable electron density map. The anomalous signal can be optimized by data collection at or near the absorption edge of the anomalous scatterer. This requires a tunable X-ray source such as a synchrotron. The S/n of the data can also be increased by collecting redundant data. The two common anomalous scattering experiments are Multiwavelength Anomalous Dispersion (MAD) and single wavelength anomalous scattering/dfiffraction (SAS or SAD) The SAS technique is becoming more popular since it does not require a tunable X-ray source.

Increasing Number of SAS Structures

Increasing S/n with Redundancy

Multiwavelength Anomalous Dispersion Note: FP = protein FH1 = heavy atom F+PH = F+++ F-PH = F--- F+H” = f”+++ F-H” = f”--- The center of the F+PH and F-PH circles are placed at the end of the vector -F+H” and -F-H” respectively. From Glusker, Lewis and Rossi In the MAD experiment a strong anomalous scatterer is introduced into the crystal and data are recorded at several wavelengths (peak, inflection and remote) near the X-ray absorption edge of the anomalous scatterer. The phase ambiguity resolved a manner similar to the use of multiple derivatives in the MIR technique.

Single Wavelength Anomalous Scattering The SAS method, which combines the use of SAS data and solvent flattening to resolve phase ambiguity was first introduced in the ISAS program (Wang, 1985). The technique is very similar to resolving the phase ambiguity in SIR data. The SAS method does not require a tunable source and successful structure determination can be carried out using a home X-ray source on crystals containing anomalous scatterers with sufficiently large f” such as iron, copper, iodine, xenon and many heavy atom salts. The ultimate goal of the SAS method is the use of S-SAS to phase protein data since most proteins contain sulfur. However sulfur has a very weak anomalous scattering signal with f” = 0.56 e- for Cu X-rays. The S-SAS method requires careful data collection and crystals that diffract to 2Å resolution. A high symmetry space group (more internal symmetry equivalents) increases the chance of success. The use of soft X-rays such as Cr K (= 2.2909Å) X-rays doubles the sulfur signal (f” = 1.14 e-). There over 20 S-SAS structures in the Protein Data Bank.

What is the Limit of the SAS Method f” = 0.56e- using Cu K X-rays

Molecular Replacement Molecular replacement has proven effective for solving macromolecular crystal structures based upon the knowledge of homologous structures. The method is straightforward and reduces the time and effort required for structure determination because there is no need to prepare heavy atom derivatives and collect their data. Model building is also simplified, since little or no chain tracing is required. The 3-dimensional structure of the search model must be very close (< 1.7Å r.m.s.d.) to that of the unknown structure for the technique to work. Sequence homology between the model and unknown protein is helpful but not strictly required. Success has been observed using search models having as low as 17% sequence similarity. Several computer programs such as AmoRe, X-PLOR/CNS PHASER are available for MR calculations.

Molecular Replacement Use a model of the protein to estimate phases Must be a structural homologue (RMSD < 1.7Å) Two step process 1. find orientation of model (red ==> black) 2. find location of orientated model (black ==> blue) px.cryst.bbk.ac.uk/03/sample/molrep.htm

Molecular Replacement Use a model of the protein to estimate phases Need to determine model’s orientation in X1s unit cell Use a Patterson rotation search (a, b, g) zyz convention The coordinate system is rotated by an angle a around the original z axis, then by an angle b around the new y axis, and then by an angle g around the final z axis.

Molecular Replacement Use a model of the protein to estimate phases Need to determine orientated model’s location in X1s unit cell Use an R-factor search Orientated model is stepped through the X1 unit cell using small increments in x, y, and z (eg. x => x+ step) Point where R is lowest represents the correct location Other faster methods are available e.g. PHASER