What is Needed for Proper Structure Validation and How to Act upon Validation ALERTS Ton Spek Utrecht University The Netherlands ACA-Denver, july 26, 2016.

Slides:



Advertisements
Similar presentations
Continuous improvement of macromolecular crystal structures Tom Terwilliger (Los Alamos National Laboratory) DDD WG member ECM 2012: Diffraction Data Deposition.
Advertisements

Marshing: Past, Present and Future Ton Spek National Single Crystal Service Facility Utrecht University The Netherlands.
Structure Validation : How to distinguish GOOD and reliable single crystal structures from BAD and UGLY reports A.L.Spek Utrecht University The Netherlands.
Crystal Structure Validation : The IUCr tool to distinguish GOOD and trustable single crystal structures from BAD and UGLY reports Ton Spek Bijvoet Center.
The MISSYM Family: Software for the detection of Missed or Pseudo Symmetry A.L.Spek Utrecht University The Netherlands.
Crystal Structure Validation A Tool to distinguish GOOD and reliable single crystal studies from BAD and UGLY reports. Ton Spek National Single Crystal.
PLATON, New Options Ton Spek, National Single Crystal Structure Facility, Utrecht, The Netherlands. Delft, Sept. 18, 2006.
PLATON TUTORIAL A.L.Spek, National Single Crystal Service Facility,
Structure Comparison, Analysis and Validation Ton Spek National Single Crystal Facility Utrecht University.
CIF, PLATON-2014, SHELXL-2014, VALIDATION & SQUEEZE
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
An Update on Current and New Structure Analysis Tools in PLATON Ton Spek, Bijvoet Center for Biomolecular Research, Utrecht University, The Netherlands.
PLATON Validation and Analysis Tools Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Sevilla, 14-Dec-2010.
Disorder.
PLATON/CheckCIF Issues Ton Spek Utrecht University The Netherlands Bruker User Meeting UCSD, La Jolla, March 22-24, 2012.
Small Molecule Example – YLID Unit Cell Contents and Z Value
PLATON/SQUEEZE Ton Spek Bijvoet Center Utrecht University, The Netherlands. PLATON Workshop Chicago, 24-July-2010.
The PLATON Toolbox Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Kyoto, 20-Aug-2008.
Automatic Detection of Poor or Incorrect Single Crystal Structures A.L.Spek Utrecht University The Netherlands.
Structure Validation Challenges in Chemical Crystallography Ton Spek Utrecht University, The Netherlands. Madrid, Aug. 26, 2011.
Chem Thermal Ellipsoids Remember that thermal ellipsoids can indicate problems with a refinement even when the R factors seem to indicate a reasonable.
Software Tools for the Analysis of Z’ > 1 Structures A.L.Spek, Utrecht University, National Single Crystal Service Facility The Netherlands. BCA-Meeting,
CheckCIF/PLATON Crystal Structure Validation
The System-S Approach to Automated Structure Determination: Problems and Solutions Ton Spek National Single Crystal Service Utrecht University, The Netherlands.
3. Crystals What defines a crystal? Atoms, lattice points, symmetry, space groups Diffraction B-factors R-factors Resolution Refinement Modeling!
Automated Crystal Structure Validation Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Platon Workshop Chicago,
Why Crystal Structure Validation ? Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Slovenia, 17-june-2010.
PLATON, AN OVERVIEW Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Platon Workshop Chicago, 24-July-2010.
Why Small Molecule Crystal Structure Validation ? Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Sevilla, 14-Dec-2010.
The Crystallographic Information File (CIF) Description and Usage Ton Spek, Bijvoet Center for Biomolecular Research Utrecht University Sevilla, 14-Dec
SYSTEM-S The Challenge of Automated Structure Determination Ton Spek National Single Crystal Service Utrecht University, The Netherlands.
Structure Validation in Chemical Crystallography with CheckCIF/PLATON Ton Spek, National Single Crystal Service Facility, Utrecht University The Netherlands.
Structure Validation in Chemical Crystallography Ton Spek, Bijvoet Centre for Biomolecular Research, Utrecht University, The Netherlands. CCP4-Leeds, 5-Jan
Structure Validation in Chemical Crystallography Principles and Application Ton Spek, National Single Crystal Service Facility, Utrecht University SAB-Delft,
PLATON and STRUCTURE VALIDATION Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Goettingen, 13-Oct-2007.
Methods in Chemistry III – Part 1 Modul M.Che.1101 WS 2010/11 – 10 Modern Methods of Inorganic Chemistry Mi 10:15-12:00, Hörsaal II George Sheldrick
Twinning Non-merohedral twinning Merohedral twinning.
On the Proper Reporting and Archival of Crystal Structure Data Ton Spek Utrecht University, NL (ACA2015-Philadelphia)
PLATON, A set of Tools for the Interpretation of Structural Results Ton Spek National Single Crystal Service Facility, Utrecht University,The Netherlands.
Structure Validation Ton Spek, Bijvoet Centre Utrecht University The Netherlands PLATON Course, Utrecht, April 18, 2012.
Update on Selective Editing and Implications for Staff Skills International Trade Conference September 2008 Ken Smart.
PLATON TUTORIAL A.L.Spek, National Single Crystal Service Facility, Utrecht, The Netherlands.
NUI, Galway X-Ray Diffraction Best Practice P. McArdle Feb Saving the X-ray Tube 2.Getting the most from your data 3.Large Unit Cell Parameter Problem.
Ton Spek Utrecht University The Netherlands IUCr-Montreal Aug 11, 2014
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
Information for (New) Co-editors This presentation was used at the new co-editors induction meeting held during the IUCR Osaka Congress, August It.
1. Diffraction intensity 2. Patterson map Lecture
The PLATON/TwinRotMat Tool for Twinning Detection Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Delft, 29-Sept-2008.
PLATON, A Multipurpose Crystallographic Tool Ton Spek, National Single Crystal Service Facility, Utrecht, The Netherlands.
The PLATON Toolbox History and Applications Ton Spek Utrecht University, The Netherlands. Bruker User Meeting, UCSD La Jolla, March 22-24, 2012.
Methods in Chemistry III – Part 1 Modul M.Che.1101 WS 2010/11 – 8 Modern Methods of Inorganic Chemistry Mi 10:15-12:00, Hörsaal II George Sheldrick
PLATON/SQUEEZE Ton Spek Bijvoet Center Utrecht University, The Netherlands. PLATON Course Utrecht, April 18, 2012.
Absolute Configuration Types of space groups Non-centrosymmetric Determining Absolute Configuration.
Methods in Chemistry III – Part 1 Modul M. Che
Updates on Validation and SQUEEZE Ton Spek Utrecht University Bruker User Meeting Jacksonville (FL), Jan 19, 2016.
Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.
Dr.V.Jaiganesh Professor
Modeling and Simulation CS 313
The PLATON checkCIF and SQUEEZE Tools
(check)CIF, SHELXL-2014, SQUEEZE
What Makes a Crystal Structure Report Valid?
Software for Crystallographic and Rietveld Analysis
Ton Spek Utrecht University (NL) ECS4-School, Warsaw, July 2-7, 2017
IAEA E-learning Program
Crystal Structure Validation with PLATON
Crystal structure determination
Why Crystal Structure Validation ?
The SQUEEZE Tool in PLATON and its use with SHELXL2013
Ton Spek Utrecht University The Netherlands Vienna –ECM
The PLATON/TwinRotMat Tool for Twinning Detection
Presentation transcript:

What is Needed for Proper Structure Validation and How to Act upon Validation ALERTS Ton Spek Utrecht University The Netherlands ACA-Denver, july 26, 2016

CIF was Introduced in 1991 The main target of many X-ray studies is the 3D molecular structure in the form of coordinates, displacement parameters, bonds and angles etc. The computer readable CIF file was introduced in the 1990’s as a replacement for the printed results. CIF facilitates easy data transfer, archival (CSD), geometry calculations, graphics, publication (Acta Cryst.) and validation (checkCIF) of the results of a crystal structure. A CIF, in its basic form, mainly reports the authors interpretation of the experimental diffraction data determination

However …. A CIF in its basic form is insufficient for proper validation of a structure report, in particular in case of unusual, reported or claimed, results. What is needed is detailed information of the refinement model, including the Fobs 2, Fcalc 2 and sigma(Fobs 2 ) listing file (FCF). An FCF allows for statistical analysis of the fit of the authors structure model on the reflection data (missed twinning, absolute structure etc.).

Good Scientific Practice In addition, it is good scientific practice to also supply and archive the unmerged set of observed data on which a study is based. The subject material might be unique or difficult to obtain ! This practice is now fully implemented in the SHELXL2014 refinement program with the embedding of the.res,.hkl (and.fab) files in an extended CIF format file. Other packages follow. This allows for an alternative refinement with the archived data (e.g. A disorder model refinement versus a published SQUEEZE based refinement)

IUCr checkCIF validation Structure validation based on a CIF was pioneered by the IUCr + Acta Cryst. C with the creation of a validation report, also starting in the early 1990’s Most journals currently require validation reports created by the IUCr checkCIF server for structure reports. Significant knowledge of the crystallographic procedures (and underlying chemistry) will still be needed for the proper interpretation of and acting upon ALERT messages that come in levels A, B, C & G Unfortunately, not all authors, referees and chemical journals appear to have that knowledge available.

ALERTS A,B,C,G Level All ALERTS should be inspected and possibly acted upon (in particular A-ALERTS with a suitable response) The on-line documentation of an ALERT should be consulted. Not all ALERTS are ERRORS but may point to interesting or unusual issues or just offer relevant information. A combination of low level ALERTS (e.g. C or G) may point to serious problems as well. checkCIF is a learning process and is still developing Following is an interesting case study that illustrates the need for a continuous update of checkCIF

QUESTION : Is this Structure Correct ? The Aug IUCr/checkCIF reports no serious ALERTS checkCIF: No Voids No Unusual Contacts Slightly high Difference Density Range R1 = wR2 = S = Rhomin Rhomax 0.47 What would a synthetic chemist’s comment be ?

No, The Structure was Deliberately INVENTED - This devious structure was clevery created by Natalie Johnson et al., Newcastle, UK, aiming to beat checkCIF & FCF. - Her point was that it is easy to invent a fake data set that does not raise checkCIF ALERTS. - So does the deposition of an FCF avoid frauded structures? - She even created diffr. Images. - Presented as an excellent Poster during ECM-2015, Croatia. But: Every crime leaves its traces ….. and we can learn from them.

Unusual Actual difference map Density Clear traces of the ‘Crime’ are in the Difference Density Map Difference map in the CH2 plane The CH 2 Hydrogen atoms at calculated positions are definitely not in F(obs) Expected type of difference map

NATALIE YLID How was Structure ‘Natalie’ created ? YLID ‘Starting Material’ (Bruker) Chemical issue: Se + No example in the CSD for the 6-membered ring System Mogul : Se – C outlier Se

  No H-density No Density on Bonds CURRENT PLATON/CHECKCIF VALIDATION REPORT FOR ‘NATALIE’

On the Positive side (Reported) attempts to beat checkCIF are extremely helpful (thanks Natalie!), not only to address fraud attempts (which have unfortunately happened in the past) but also as a source for additional tests for the detection of author errors. The two added ALERTS are excellent examples for this. The newly introduced ALERTS are helpful for the detection of accidently misplaced Hydrogen atoms. Analyses of the FCF may give further clues as shown next.

Look in Detail at the Reflection Data Partly, such information is in the.ckf file created by PLATON/checkCIF from the FCF data or use e.g. tools available in PLATON The Normal Probability PLOT is not perfect for the fake ‘Natalie’ structure with its low R-factor values

Look at the sigma(I) versus sqrt(I) Plot ‘Nardelli’ (Bruker) versus non-fake Bruker data Poisson Statistics

Look at log(Iobs) versus log(Icalc) Scatter Plot Log(Iobs) Log(iobs) Log(Icalc) Real Data Fake Data Normal spreading at low ILittle speading at low I

Inspect the List of Missing Reflections

Inspect R-value as a function of sin(theta)/lambda Note the very low wR 2 values at higher resolution

FCF-Validation (in.ckf) adds: Analysis of the quality of the refinement Analysis of the difference map (peaks) Detection of void content (SQUEEZE) Detection of missing reflections Detection of outliers Detection of missed twinning Check of the reported absolute structure

Common checkCIF Issues The ADDSYM ALERTS ADDSYM is designed to warn for possible higher symmetry within the supplied coordinate set with a suggestion for a revised space group. The tolerances used are relatively high in order to catch most known mis-assignments. An ADDSYM ALERT not necessarily means an error, but just something to be investigated and reported on. Many structures show pseudo-symmetry, in particular inorganic compounds and (chiral) Z’ = 2 structures The suggested higher symmetry should be tested with the reflection data (merging index) and refinement.

Common checkCIF Issues With multiple components in the unit cell or with polymers it is not always clear what the best choise of the formula unit is. CheckCIF might make a different choise than an author on the basis of chemical insight. The main issue is that the reported Z and related quantities are consistent with the reported formula unit. CheckCIF uses its own f’ and f’’ anomalous dispersion parameter values for a synchrotron based wavelength as a check against those supplied. The correct lambda should be supplied (not e.g. MoKa)

Outliers reported in checkCIF listing Details on missing reflection and outlier ALERTS can be found in the FCF validation report (in the.ckf file 

Inspect the list of Outliers in the.ckf listing

Common Validation Issues A BASF/TWIN refinement needed when the Flack x deviates significantly from zero. Inversion twinning is not automatically included in the refinement model (structure factor calculation) Do not cut data as an R-factor beautifier to resolution 0.6 Ang-1. Keep all measured data. Look carefully at difference maps, in particular at H-atom sites.

Oher Common ALERTS Make sure that at least the main molecule has its CG within the bounds of the unit cell and other species at contact distances The FCF file should have the ‘LIST 4’ style and not ‘LIST 6’. Short contacts: Might indicate missing bridging H or C-H being N or incorrect structure model. Hirshfeld ALERTS: check correct atom type C-C bond density (sign of good data)

Concluding Remarks CheckCIF is intended to be helpful and to protect against oversights rather than being annoying Many ALERTS have their origin in previous ‘incidents’ Some ALERTS can point to interesting features in a structure and might be relevant to be discussed Some ALERTS may be less important in a given context where the main interest is the chemistry. Sometimes, a combination of minor ALERTS can point to a problem that needs to be resolved The set of ALERTS is still fine tuned and extended

Thank you ! More info: (including this powerpoint presentation) Please send suggestions and examples (with data) of (annoying) issues (or ‘frauded’ structures that fail detection by checkCIF) to: