The Crystallographic Information File (CIF) Description and Usage Ton Spek, Bijvoet Center for Biomolecular Research Utrecht University Sevilla, 14-Dec.-2010.

Slides:



Advertisements
Similar presentations
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Advertisements

Publisher perspective eBank/R4L/SPECTRa Joint Consultation Workshop London Metropole Hotel 20 October 2006.
Marshing: Past, Present and Future Ton Spek National Single Crystal Service Facility Utrecht University The Netherlands.
Structure Validation : How to distinguish GOOD and reliable single crystal structures from BAD and UGLY reports A.L.Spek Utrecht University The Netherlands.
Crystal Structure Validation : The IUCr tool to distinguish GOOD and trustable single crystal structures from BAD and UGLY reports Ton Spek Bijvoet Center.
The MISSYM Family: Software for the detection of Missed or Pseudo Symmetry A.L.Spek Utrecht University The Netherlands.
Crystal Structure Validation A Tool to distinguish GOOD and reliable single crystal studies from BAD and UGLY reports. Ton Spek National Single Crystal.
PLATON, New Options Ton Spek, National Single Crystal Structure Facility, Utrecht, The Netherlands. Delft, Sept. 18, 2006.
PLATON TUTORIAL A.L.Spek, National Single Crystal Service Facility,
Structure Comparison, Analysis and Validation Ton Spek National Single Crystal Facility Utrecht University.
CIF, PLATON-2014, SHELXL-2014, VALIDATION & SQUEEZE
July 2010 D2.1 Upgrading strategy Javier Soto Catalog Release 3. Communities.
An Update on Current and New Structure Analysis Tools in PLATON Ton Spek, Bijvoet Center for Biomolecular Research, Utrecht University, The Netherlands.
PLATON Validation and Analysis Tools Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Sevilla, 14-Dec-2010.
Data activities of the International Union of Crystallography Brian McMahon IUCr 5 Abbey Square Chester CH1 2HU
The Crystallographic Information File (CIF) Description and Usage Ton Spek, Bijvoet Center for Biomolecular Research Utrecht University Leiden, 27-Jan
PLATON/CheckCIF Issues Ton Spek Utrecht University The Netherlands Bruker User Meeting UCSD, La Jolla, March 22-24, 2012.
LCELLS: an efficient search engine for laboratory unit cells Oleg V. Dolomanov, Alexander J. Blake, Neil R. Champness & Martin Schr ö der School of Chemistry,
Small Molecule Example – YLID Unit Cell Contents and Z Value
PLATON/SQUEEZE Ton Spek Bijvoet Center Utrecht University, The Netherlands. PLATON Workshop Chicago, 24-July-2010.
Click to edit Master subtitle style JISC XYZ Project Principal Investigator: Peter Murray-Rust Project Team: Nick England, Brian Brooks Unilever Centre,
The PLATON Toolbox Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Kyoto, 20-Aug-2008.
Automatic Detection of Poor or Incorrect Single Crystal Structures A.L.Spek Utrecht University The Netherlands.
Requirements Specification
Structure Validation Challenges in Chemical Crystallography Ton Spek Utrecht University, The Netherlands. Madrid, Aug. 26, 2011.
Software Tools for the Analysis of Z’ > 1 Structures A.L.Spek, Utrecht University, National Single Crystal Service Facility The Netherlands. BCA-Meeting,
CheckCIF/PLATON Crystal Structure Validation
The System-S Approach to Automated Structure Determination: Problems and Solutions Ton Spek National Single Crystal Service Utrecht University, The Netherlands.
Automated Crystal Structure Validation Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Platon Workshop Chicago,
Why Crystal Structure Validation ? Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Slovenia, 17-june-2010.
PLATON, AN OVERVIEW Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Platon Workshop Chicago, 24-July-2010.
Why Small Molecule Crystal Structure Validation ? Ton Spek, National Single Crystal Facility, Utrecht University, Utrecht, The Netherlands Sevilla, 14-Dec-2010.
SYSTEM-S The Challenge of Automated Structure Determination Ton Spek National Single Crystal Service Utrecht University, The Netherlands.
Structure Validation in Chemical Crystallography with CheckCIF/PLATON Ton Spek, National Single Crystal Service Facility, Utrecht University The Netherlands.
Structure Validation in Chemical Crystallography Ton Spek, Bijvoet Centre for Biomolecular Research, Utrecht University, The Netherlands. CCP4-Leeds, 5-Jan
Structure Validation in Chemical Crystallography Principles and Application Ton Spek, National Single Crystal Service Facility, Utrecht University SAB-Delft,
PLATON and STRUCTURE VALIDATION Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Goettingen, 13-Oct-2007.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
New Structures for Old: A Cautionary Tale of Fraud in Small Molecule Crystallography Jim Simpson Department of Chemistry University of Otago.
On the Proper Reporting and Archival of Crystal Structure Data Ton Spek Utrecht University, NL (ACA2015-Philadelphia)
PLATON, A set of Tools for the Interpretation of Structural Results Ton Spek National Single Crystal Service Facility, Utrecht University,The Netherlands.
Structure Validation Ton Spek, Bijvoet Centre Utrecht University The Netherlands PLATON Course, Utrecht, April 18, 2012.
© 2001 Business & Information Systems 2/e1 Chapter 8 Personal Productivity and Problem Solving.
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 8 Personal Productivity and Problem Solving.
PLATON TUTORIAL A.L.Spek, National Single Crystal Service Facility, Utrecht, The Netherlands.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Ton Spek Utrecht University The Netherlands IUCr-Montreal Aug 11, 2014
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
Information for (New) Co-editors This presentation was used at the new co-editors induction meeting held during the IUCR Osaka Congress, August It.
The PLATON/TwinRotMat Tool for Twinning Detection Ton Spek National Single Crystal Service Facility, Utrecht University, The Netherlands. Delft, 29-Sept-2008.
FEMA-MES SYSTEM. VERSION 2.0 ON LIVE LINUX PLATFORM Wacław PRZYBYŁO Jarosław KALINOWSKI.
PLATON, A Multipurpose Crystallographic Tool Ton Spek, National Single Crystal Service Facility, Utrecht, The Netherlands.
The PLATON Toolbox History and Applications Ton Spek Utrecht University, The Netherlands. Bruker User Meeting, UCSD La Jolla, March 22-24, 2012.
PLATON/SQUEEZE Ton Spek Bijvoet Center Utrecht University, The Netherlands. PLATON Course Utrecht, April 18, 2012.
Updates on Validation and SQUEEZE Ton Spek Utrecht University Bruker User Meeting Jacksonville (FL), Jan 19, 2016.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
CIF's CIFs from SHELX CIFs from MOLEN Working with CIFS ENCIFER PUBLCIF PLATON Validation.
What is Needed for Proper Structure Validation and How to Act upon Validation ALERTS Ton Spek Utrecht University The Netherlands ACA-Denver, july 26, 2016.
The PLATON checkCIF and SQUEEZE Tools
(check)CIF, SHELXL-2014, SQUEEZE
What Makes a Crystal Structure Report Valid?
Ton Spek Utrecht University (NL) ECS4-School, Warsaw, July 2-7, 2017
FEMA-MES SYSTEM. VERSION 2.0 ON LIVE LINUX PLATFORM
Crystal Structure Validation with PLATON
Crystal structure determination
Why Crystal Structure Validation ?
The SQUEEZE Tool in PLATON and its use with SHELXL2013
Ton Spek Utrecht University The Netherlands Vienna –ECM
The PLATON/TwinRotMat Tool for Twinning Detection
Presentation transcript:

The Crystallographic Information File (CIF) Description and Usage Ton Spek, Bijvoet Center for Biomolecular Research Utrecht University Sevilla, 14-Dec

Data Handling in the Pre-Internet era (or when I started crystallography in the 1960’s)

Flexowriter for the creation and editing of programs and data

Data Storage in the Past

Archival of Model Parameters in a Publication (Acta Cryst.)

Archival of Reflection Data in a Publication (Acta Cryst.)

Problems Around 1990 Multiple Data Storage Media (Often hardcopy only) No Standard Computer Readable Format for Archival and Data Exchange Data Entry of Published data done by Retyping. No easy Numerical Checking for Referees etc. CSD Database Archival by Retyping from the published paper Multiple typo’s and inconsistencies in the Published Data Often incomplete information reported

The CIF Solution CIF-Standard Proposal for data archival S.R. Hall, F.H. Allen, I.D. Brown (1991). Acta Cryst. A47, Pioneered and Adopted by the International Union for Crystallography Adopted early by the author of the nowadays most commonly used refinement program SHELXL (G.M.Sheldrick)

What is CIF ? From:

Practical Approach We ignore here the scary details that are not relevant in the current context We will Discuss the File structure We will look at its relevance for publication We will discuss software to edit and check the CIF file We will look at software that uses CIF as Input.

File Structure Both Computer and Human Readable Ascii encoded file Free Format Mostly 80 columns wide [officially 2048] Parsable in units Data Order Flexible Dataname and Value associations

Constructs data_name where name is the chosen identifier of the data Data associations e.g. _ cell_length_a (2) _ diffraction_radiation_source ‘sealed tube’ Repetition (loop) loop_ __symmetry_equiv_pos_as_xyz ‘x, y, z’ ‘-x, y+1/2, -z’

Construct for Text Text can be included between semi-columns Used for Acta Cryst. Section C & E Abstract and Comment sections Example _publ_section_comment ; This paper presents the first example of a very important compound. ;

CIF Example File

CIF Completion CIF Files are created by the refinement program (e.g. SHELXL) Missing Date can be added with a Text Editor, enCIFer (from the CCDC) or publCIF (From the IUCr). The Syntax can be checked with a locally installed version of the program enCIFer (Freely Available:

Missing Data PROGRAM enCIFer

Note on Editing the CIF The Idea of editing the CIF is to add missing information to the CIF. Some Acta Cryst. authors have been found to polish away less nice numerical values. This leaves traces and is generally detected by the validation software and not good for the career of the culprit…

CIF Applications Data Archival Deposition to the CSD (=> CSD number) Supplementary Material for Publication Input for Geometry and Graphics Software e,g. Mercury (from CCDC) and PLATON Standard Format for publications (Structure Communications) in Acta Cryst. Sections C & E. Structure Validation

Reflection CIF (FCF)

Calculations on Published Structures CIF data for a published structure can be obtained from the CCDC FCF Data are generally only retrievable from the IUCr website for Acta Cryst. Papers PLATON has a tool to re-create.ins and.hkl files for re-refinement with SHELXL Useful to investigate difference maps for more details.

Structure Validation Pioneered by the IUCr Currently most journals have implemented a validation scheme. Papers: A.L.Spek (2003). J. Appl. Cryst. 36, A.L.Spek (2009). Acta Cryst. D65,

How is Validation Implemented Computer readable structure analysis results in CIF format (Syd Hall & George Sheldrick) A file (Check.def) defines the issues that are tested with levels of severity and associated explanation and advise. The tests are executed by the program PLATON The tests can be executed both in-house or through the WEB-based IUCr CHECKCIF server.

EXAMPLE OF PLATON GENERATED ALERTS FOR A RECENT PAPER PUBLISHED IN J.Amer.Chem.Soc. (2007) Attracted special attention in Chemical and Engineering News (Referees obviously did not Bother)

Which Key Issues are Addressed Missed symmetry (“being Marshed”) Wrong chemistry (Misassigned atom types) Too many, too few or misplaced H-atoms Missed solvent accessible voids in the structure Missed Twinning Absolute structure Data quality and completeness

FCF-VALIDATION -Check for missing reflections -Check for CIF/FCF consistency Automatic twinning detection as part of the IUCr CheckCif procedure -Detection of ignored twinning -Detection of Applied Twinning Correction without being reported (Already available via PLATON/Check)

Common CIF Problem There exists a frequent misunderstanding about the correct specification of the ‘population’ parameter value in the CIF for an atom on a special position. E.g. A fully occupied position of an atom on an inversion centre has to be specified with 0.5 in the.res and 1.0 in the CIF.

Common Validation Problems CIF and FCF not from the final refinement SHELXL defaults left unchanged Completeness (up to 25 degrees)-do not cut Data names in CIF and FCF not identical ‘Non-standard’ reflection CIF’s Twinning, Powder, Incommensurate Struct. Improper parameter transformations (Uij’s) DAMP 0 0

Concluding Remarks The CIF standard makes it possible to easily do follow-up calculations for published structures The available information is more complete ‘ for more information

Resources Lectures: public.me.com/ton_spek Password: Utrecht enCIFer: Publcif: journals.iucr.org/c/services PLATON: