Download presentation

Presentation is loading. Please wait.

Published byEdward Hatchel Modified over 4 years ago

1
Multidimensional Molecular Replacement. Nicholas M. Glykos & Michael Kokkinidis IMBB, FORTH, Heraklion, Crete, GREECE

2
Rigid-body refinement.

4
2x

7
Rigid-body simulated annealing.

8
The program : Name : Queen of Spades Availability : absolutely free, open-source software, no warranties whatsoever. The distribution includes source code, plenty of documentation, plus pre-compiled executables for Irix, OSF, Linux, Solaris, VMS & windoze. Download the latest version via http://origin.imbb.forth.gr/software/http://origin.imbb.forth.gr/software/ β Current stable version : β, Release 1.0.

9
Using the program : Input : a.pdb file containing the model, and a formatted (ASCII) file containing h,k,l,F,σ(F). Output :.pdb files containing the final coordinates for each model, plus a packing diagram for each solution.

10
Running the program (1) : $ Qs –auto 1 or, $ Qs –auto 2 etc.

11
Running the program (2) : ########################################################## # Target function (can be R-FACTOR, CORR-1 or CORR-2) and # number of minimisations and steps. # TARGETR-FACTOR CYCLES5 STEPS100000000 ############################################################ # Annealing schedule & move size control. # BOLTZMANN START 0.06800 ############################################################ # Reflection selection. # KEEP 0.70 AMPLIT_CUTOFF1.0 SIGMA_CUTOFF2.0 RESOLUTION15.0 3.5.......

12
The algorithm : 1. Assign random initial positions & orientations to all molecules present in the asymmetric unit of the target crystal structure. Calculate F c s from this arrangement. 2. Calculate the R-factor between the F o s and the F c s. Call this R old.

13
The algorithm : 3. Randomly chose and alter the orientation and position of one of the molecules. Calculate the R-factor resulting from the new arrangement (R new ). 4. If R new < R old, then, the new arrangement is accepted and we start again from (3). 5. If the new R-factor is worse, we still accept the move with probability exp[ –(R new – R old ) / T ].

14
The algorithm : 3. Randomly chose and alter the orientation and position of one of the molecules. Calculate the R-factor resulting from the new arrangement (R new ). 4. If R new < R old, then, the new arrangement is accepted and we start again from (3). 5. If the new R-factor is worse, we still accept the move with probability exp[ –(R new – R old ) / T ].

15
Speeding it up : Avoid FFTs : calculate and store (in core) the molecular transform of the search model. Keep a table containing the contribution of each molecule to each reflection. CPU time per step ~ Number of reflections in P1.

16
Annealing schedules : Constant temperature run. Linear temperature gradient (slow cooling). Boltzmann annealing (logarithmic schedule). Heating bath mode.

17
Annealing schedules : Constant temperature run. Linear temperature gradient (slow cooling). Boltzmann annealing (logarithmic schedule). Heating bath mode. The temperature is automatically adjusted in such a way as to keep the fraction of moves performed against the gradient of the target function constant and equal to a user-defined value.

18
Temperature determination : At T=0.3125000, average R=0.59937 At T=0.1562500, average R=0.59707 At T=0.0781250, average R=0.59861 At T=0.0390625, average R=0.59028 At T=0.0195312, average R=0.58783 At T=0.0097656, average R=0.57545 At T=0.0048828, average R=0.55527 At T=0.0024414, average R=0.53016 At T=0.0012207, average R=0.52038 At T=0.0006104, average R=0.51799 At T=0.0003052, average R=0.51524

19
Temperature determination : At T=0.3125000, average R=0.59937 At T=0.1562500, average R=0.59707 At T=0.0781250, average R=0.59861 At T=0.0390625, average R=0.59028 At T=0.0195312, average R=0.58783 At T=0.0097656, average R=0.57545 At T=0.0048828, average R=0.55527 At T=0.0024414, average R=0.53016 At T=0.0012207, average R=0.52038 At T=0.0006104, average R=0.51799 At T=0.0003052, average R=0.51524

20
Temperature determination : At T=0.3125000, average R=0.59937 At T=0.1562500, average R=0.59707 At T=0.0781250, average R=0.59861 At T=0.0390625, average R=0.59028 At T=0.0195312, average R=0.58783 At T=0.0097656, average R=0.57545 At T=0.0048828, average R=0.55527 At T=0.0024414, average R=0.53016 At T=0.0012207, average R=0.52038 At T=0.0006104, average R=0.51799 At T=0.0003052, average R=0.51524

21
Temperature determination : At T=0.3125000, average R=0.59937 At T=0.1562500, average R=0.59707 At T=0.0781250, average R=0.59861 At T=0.0390625, average R=0.59028 At T=0.0195312, average R=0.58783 At T=0.0097656, average R=0.57545 At T=0.0048828, average R=0.55527 At T=0.0024414, average R=0.53016 At T=0.0012207, average R=0.52038 At T=0.0006104, average R=0.51799 At T=0.0003052, average R=0.51524

22
Temperature determination : At T=0.3125000, average R=0.59937 At T=0.1562500, average R=0.59707 At T=0.0781250, average R=0.59861 At T=0.0390625, average R=0.59028 At T=0.0195312, average R=0.58783 At T=0.0097656, average R=0.57545 At T=0.0048828, average R=0.55527 At T=0.0024414, average R=0.53016 At T=0.0012207, average R=0.52038 At T=0.0006104, average R=0.51799 At T=0.0003052, average R=0.51524

23
Temperature determination : At T=0.3125000, average R=0.59937 At T=0.1562500, average R=0.59707 At T=0.0781250, average R=0.59861 At T=0.0390625, average R=0.59028 At T=0.0195312, average R=0.58783 At T=0.0097656, average R=0.57545 At T=0.0048828, average R=0.55527 At T=0.0024414, average R=0.53016 At T=0.0012207, average R=0.52038 At T=0.0006104, average R=0.51799 At T=0.0003052, average R=0.51524

24
Move size control : Constant move size : max(Δt) = d min /max(a,b,c) ) max(Δκ) = d min (in degrees). Move size linearly dependent on current R-factor and time step : max(Δt) = 0.5 R (1.0 - t/t total ) max(Δκ) = π R (1.0 - t/t total )

25
Scaling & bulk solvent correction The default is to scale |Fc|s to |Fo|s using both a scale and a temperature factor even at the relatively low resolution used for molecular replacement calculations.

26
Scaling & bulk solvent correction The default is to scale |Fc|s to |Fo|s using both a scale and a temperature factor even at the relatively low resolution used for molecular replacement calculations. The program implements the exponential scaling model algorithm which allows a computationally efficient and model-independent correction to be applied : F corrected = F p { 1.0 – k sol exp[ -B sol / d 2 ] }

27
Scaling & bulk solvent correction The default is to scale |Fc|s to |Fo|s using both a scale and a temperature factor even at the relatively low resolution used for molecular replacement calculations. The program implements the exponential scaling model algorithm which allows a computationally efficient and model-independent correction to be applied : F corrected = F p { 1.0 – k sol exp[ -B sol / d 2 ] }

28
Examples : An 11D problem. Target structure 1lys, model 2ihl (rmsd 1.52 & 1.56Å). Two molecules of lysozyme per asymmetric unit. Monoclinic space group (P2 1 ), 4Å data. ±20% noise added to error-free data. Solutions appear after ~3.8 hours of CPU time.

29
Examples : A 12D problem. Target structure 1b6q. 30% solvent. Search model : one poly-Alanine helix. One monomer of Rop per a.u. Orthorhombic space group (C222 1 ). Real 15-4Å data. About 120 minutes of CPU time per run.

30
Examples : A 12D problem. Target structure 1b6q. 30% solvent. Search model : one poly-Alanine helix. One monomer of Rop per a.u. Orthorhombic space group (C222 1 ). Real 15-4Å data. About 120 minutes of CPU time per run.

31
Examples : A 12D problem. Target structure 1b6q. 30% solvent. Search model : one poly-Alanine helix. One monomer of Rop per a.u. Orthorhombic space group (C222 1 ). Real 15-4Å data. About 120 minutes of CPU time per run.

32
Examples : A 12D problem. Target structure 1b6q. 30% solvent. Search model : one poly-Alanine helix. One monomer of Rop per a.u. Orthorhombic space group (C222 1 ). Real 15-4Å data. About 120 minutes of CPU time per run. Run1.0-CorrFree 10.27780.3162 20.27440.6903 30.24070.3305 40.26390.3656 50.26320.8358 60.24730.4466 70.25900.4330 80.29370.2821 90.27250.6402

33
Examples : A 12D problem. Target structure 1b6q. 30% solvent. Search model : one poly-Alanine helix. One monomer of Rop per a.u. Orthorhombic space group (C222 1 ). Real 15-4Å data. About 120 minutes of CPU time per run.

34
Examples : A 17D problem. Target structure 1a2p, model 2bni. Three molecules of ribonouclease per asymmetric unit. Trigonal space group (P3 2 ), 15-4Å data. ±10% noise added to error-free data. 2.5 days per run on an Intel PIII at 800MHz.

35
Examples : A 17D problem. Target structure 1a2p, model 2bni. Three molecules of ribonouclease per asymmetric unit. Trigonal space group (P3 2 ), 15-4Å data. ±10% noise added to error-free data. 2.5 days per run on an Intel PIII at 800MHz.

36
Examples : A 23D problem. Target structure : monoclinic form of the A31P Rop mutant containing the equivalent of one 4-α-helix bundle in the asymmetric unit (two monomers). The structure of the orthorhombic form of the same mutant is known (1B6Q.pdb).

37
Examples : A 23D problem. Target structure : monoclinic form of the A31P Rop mutant containing the equivalent of one 4-α-helix bundle in the asymmetric unit (two monomers). The structure of the orthorhombic form of the same mutant is known (1B6Q.pdb). We had been consistently failing to make any progress since December 1998.

38
Examples : A 23D problem. Tried AMoRe & molrep using as search models individual helices, one monomer (helix-turn-helix), or the complete 4-α-helical bundle, with or without side- chains, and at various resolution ranges. Tried X-plor and CNS with several combinations of models, data and PC-refinement protocols. Even did an extensive heavy-atom derivative search.

39
Examples : A 23D problem. Systematic search with AMoRe using one poly-Ala helix as search model : 1. Keep the best 750 models for the first helix (by combining the best 15 orientations with the best 50 positions). 2. For each of those one-helix models, search with a second helix (562,500 models). Keep only those solutions that simultaneously decrease R and increase correlation (29,638 two-helix models). 3. For each of these, search with a third helix (22.2 million models). Keep only those models for which the addition of the third helix both decreased R and increased correlation (273,258 models).

40
Examples : A 23D problem. Systematic search with AMoRe using one poly-Ala helix as search model : 1. Keep the best 750 models for the first helix (by combining the best 15 orientations with the best 50 positions). 2. For each of those one-helix models, search with a second helix (562,500 models). Keep only those solutions that simultaneously decrease R and increase correlation (29,638 two-helix models). 3. For each of these, search with a third helix (22.2 million models). Keep only those models for which the addition of the third helix both decreased R and increased correlation (273,258 models). Best R=0.583, best Corr=0.37.

41
Examples : A 23D problem. Target structure : monoclinic form of 1b6q, model : one poly-Ala helix (13% of atoms). Four helices per asymmetric unit. Space group C2, 15-3.5Å data. Target function 1.0-Corr(Fo,Fc) 36 hours per run on an Intel PIII at 800MHz. Run1.0-CorrFree 10.24370.3509 20.24650.6189 30.24660.5131 40.25570.6295 50.22270.3175

42
Examples : A 23D problem. Target structure : monoclinic form of 1b6q, model : one poly-Ala helix (13% of atoms). Four helices per asymmetric unit. Space group C2, 15-3.5Å data. Target function 1.0-Corr(Fo,Fc) 36 hours per run on an Intel PIII at 800MHz.

43
Examples : A 23D problem.

48
Disadvantages : In most cases, treating the problem as 6n- dimensional is a waste of CPU time. You can only have one search model (ie you can not search simultaneously with your DNA & protein models). The structure of the search model is kept fixed throughout the calculation.

49
Disadvantages : The (putative) evidence from the self-rotation function and/or the native Patterson function are ignored. When the starting model deviates significantly from the target structure, (i) there is no guarantee that the global minimum of any chosen statistic will correspond to the correct solution, (ii) traditional methods may be more sensitive in identifying the correct solution.

50
Advantages : If there are just one or two molecules per asymmetric unit and CPU time is not a problem, the method can be used as a last ditch effort to conclusively show that there is no such thing as a pronounced global minimum (or otherwise ?). The computational procedures differ so much from those used in conventional methods, that the results obtained can be considered as independent.

51
Advantages : The methods only requirement is that the global minimum of the target function (for the given model and data), corresponds to the correct solution.

52
Advantages : The methods only requirement is that the global minimum of the target function (for the given model and data), corresponds to the correct solution. The method does not assume that the self- and cross-vectors are topologically segregated in the Patterson function, and is, thus, more robust in the case of closely-packed structures, or when the molecule deviates significantly from being approximately spherical.

53
Conclusion : n Substituting computing for thinking will almost certainly fail for n 5.

Similar presentations

OK

1 Combination Symbols A supplement to Greenleafs QR Text Compiled by Samuel Marateck ©2009.

1 Combination Symbols A supplement to Greenleafs QR Text Compiled by Samuel Marateck ©2009.

© 2018 SlidePlayer.com Inc.

All rights reserved.

To make this website work, we log user data and share it with processors. To use this website, you must agree to our Privacy Policy, including cookie policy.

Ads by Google