Visualisation of chemical data Brian McMahon Research & Development Officer International Union of Crystallography 5 Abbey Square Chester CH1 2HU

Slides:



Advertisements
Similar presentations
Special Features of Publishers Web Sites. Objectives Review standard features via Elsevier website Identify special features in the websites of the following.
Advertisements

IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
Linking Data from ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010.
Comb-e-Chem Jeremy Frey Sept 2003 From e-Science to Jeremy Frey School of Chemistry University of Southampton, UK X-ray single Mol STM.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Oxford University Press Journals Collection Online.
History Study Center Primary and secondary sources documenting global history 2010.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Publishing Data Catherine Jones Library Systems Development Manager, STFC Rutherford Appleton Laboratory CLADDIER workshop, Chilworth, Southampton, UK.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
Linking Data and Publications: the Chemistry Way Simon Coles School of Chemistry, University of Southampton, U.K. CLADDIER workshop.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
© S.J. Coles 2006 eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data Simon Coles School of Chemistry, University of Southampton,
Information Management and Publication in Crystallography I2S2 Workshop Future of Data Management Systems in the Structural Sciences, RAL, Oxon, 1 April.
Data Curation in Crystallography: Publisher Perspectives JISC Data Cluster Consultation Workshop CCLRC, Didcot, Oxon 10 October 2006.
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
UKOLN is supported by: Enhancing access to research data: the challenge of crystallography Rachel Heery, Monica Duke, Michael Day UKOLN, University of.
Publisher perspective eBank/R4L/SPECTRa Joint Consultation Workshop London Metropole Hotel 20 October 2006.
© S.J. Coles 2006 Institutional Data Repositories for Chemistry Simon Coles School of Chemistry, University of Southampton, U.K.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
I2S2 - Infrastructure for Integration in Structural Sciences Cross-Institutional Pilot
Continuous improvement of macromolecular crystal structures Tom Terwilliger (Los Alamos National Laboratory) DDD WG member ECM 2012: Diffraction Data Deposition.
Accessing the data: going beyond what the author wanted to tell you Brian McMahon International Union of Crystallography 5 Abbey Square, Chester CH1 2HU,
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
A vision involving raw data archiving via local archives as a supplement to the existing processed data archives (PDB, CSD, ICDD etc) John R. Helliwell,
Changing methods of data sharing in crystallography Professor John R Helliwell Imperial College, June 28th, 2006 The University of Manchester
Data activities of the International Union of Crystallography Brian McMahon IUCr 5 Abbey Square Chester CH1 2HU
Soichi Tokizane Aichi University
1.
1 Quality Control in Scholarly Publishing. What are the Alternatives to Peer Review? William Y. Arms Cornell University.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2005.
Click to edit Master subtitle style JISC XYZ Project Principal Investigator: Peter Murray-Rust Project Team: Nick England, Brian Brooks Unilever Centre,
Database Speaks Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan 1.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
Crystallographic Data Publication at Source International Union of Crystallography Peter R. Strickland and Brian McMahon IUCr 5 Abbey Square Chester CH1.
Journals.iucr.org/f/ Acta Crystallographica Section F Structural Biology and Crystallization Communications An electronic journal for macromolecular structure.
Royal Society of Chemistry RSC Journals and Databases CALIS 2nd Imported Database User Training Week, Dalian 11 May 2004 Dr Mike Hannant Key Account Manager.
Wiley Online Library. About Wiley Online Library Wiley Online Library hosts the world's broadest and deepest multidisciplinary collection of online resources.
Sam Kalb Scholarly Communication Services Coordinator QUEEN’S.
Data and Publications how to make things better Integration of Research Data and Publications Project ODE – workpackage 4 Eefke Smit International Association.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Improved Reporting of Crystal Structures: the Impact of Publishing Policy on Data Quality Brian McMahon 1, Peter R. Strickland 1 and John R. Helliwell.
Information Sources in Crystallography Your Logo Here Gregory K. Youngen Physics/Astronomy Librarian University of Illinois at Urbana-Champaign Gregory.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Cambridge Journals Online (CJO). CJO – E Publishing Service Content Delivery Site Administration Online Production Online Marketing and Promotion Customer.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
Applied common sense The why, what and how of validation (and what EM can learn of X-ray) Gerard J. Kleywegt Protein Data Bank in Europe EMBL-EBI, Cambridge,
Interactive visualization of data as a feature of online crystallography journal articles CODATA Conference 2008 Brian McMahon International Union of Crystallography.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
CiNii Articles is a service that provides information on scholastic articles, with an emphasis on Japanese papers. It allows users to find the articles.
Data Integration and Management A PDB Perspective.
Structure database: PDB Tuomas Hätinen. Protein Data Bank A repository for 3-D biological macromolecular structure. It includes proteins, nucleic acids.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
One publisher’s perspectives on an evolving industry Grace Baynes Nature Publishing Group October 2009.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Routine authoring and publication of enhanced figures IUCr submission for the ALPSP Award for Publishing Innovation 2008.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Publishing partner of the scientific communities.
Afternoon session: The archival problem and infrastructure for solutions Prof John R Helliwell Interactive Publications.
Presenters: Charles Romain and Clare Bakewell
Jmol Training Session Part I: Introduction to the Protein Data Bank
Crystal structure determination
Quick guide < Keyword search >
‘The eCrystals Federation’ Management and Publication of Small Molecule Structure Data for the Whole Crystallographic Community S.J. Colesa*, J.G. Freya,
Ton Spek Utrecht University The Netherlands Vienna –ECM
Presentation transcript:

Visualisation of chemical data Brian McMahon Research & Development Officer International Union of Crystallography 5 Abbey Square Chester CH1 2HU DataCite Summer Meeting, Hannover 7-8 June 2010 Use cases for publication of crystal structures

International Union of Crystallography International Scientific Union Publishes 8 research journals: Acta Crystallographica Section A: Foundations of Crystallography Acta Crystallographica Section B: Structural Science Acta Crystallographica Section C: Crystal Structure Communications Acta Crystallographica Section D: Biological Crystallography Acta Crystallographica Section E: Structure Reports Online Acta Crystallographica Section F:Structural Biology and Crystallization Communications Journal of Applied Crystallography Journal of Synchrotron Radiation Publishes major reference work International Tables for Crystallography (8 volumes) Promotes standard crystallographic data file format (CIF)

Crystal Structure reports - data-rich scientific articles 3-d positional coordinates Atomic motions Molecular geometry Chemical bonding Crystal packing Chemical behaviour arising from structure Two dedicated IUCr journals: Acta Cryst. C, E Important part of scientific discussion in many other titles: Acta Cryst. B, D, F

Data that inform the discussion Raw data (image plate, diffractometer, film) Primary data (structure factors) Derived data (six-dimensional structural model)

Structural data sets integral to publication Every structural paper has an associated data set (CIF) These are free to access, even for subscription journals Hence accessible from browsable Table of Contents (as well as from article) User can download CIF data set directly Or visualise the structure interactively in three dimensions... using a standard view supplied by a visualisation applet (Jmol) or a 'helper' application of the reader's choosing (Mercury)

Enhanced figures IUCr journals provide an authoring toolkit for creating enhanced figures – Data visualisations crafted by the author (but allowing the reader to interact fully with the data and create other visualisations if desired) The enhanced figure will appear (with caption) as a normal figure in the online journal; the PDF/print editions will have an equivalent static view. Additional views and interactive features can be added by the author if desired.

... from an external database using a known accession code (e.g. dn3141 for Crystallography Journals Online, 3dez for Protein Data Bank)... Data sources The structural data (CIF) can be uploaded to the journal production office from the user's hard drive (as part of the article submission process) or from a registered data DOI (e.g /pdb3dez/pdb )

DOIs for crystallographic structures (1) Article in IUCr Crystallography Journals Online Acta Cryst. (2010). C66, o274-o278 [ doi: /S ] Different hydrogen-bonding modes in two closely related oximes, G. Dutkiewicz, H. S. Yathirajan, R. Ramachandran, S. Kabilan and M. Kubickidoi: /S Describes structures of two molecules: 1-chloroacetyl-3-ethyl-2,6-diphenylpiperidin-4- one oxime, C 21 H 23 ClN 2 O 2 (at two distinct temperatures) and 1-chloroacetyl-2,6- diphenyl-3-(propan-2-yl)piperidin-4-one oxime, C 22 H 25 ClN 2 O 2 IUCr identifier: dn3141 points to article and associated data sets DOI: /S /dn3141sup1.cif –DOI for data set (CrossRef) delivers data file directly –One data file, three distinct data sets (file internally partitioned, data_I, data_I100K, data_II) Other supplementary data sets available (processed experimental data) –DOIs: /S /dn3141Isup2.hkl etc. –Separate data file for each of three refined structures

DOIs for crystallographic structures (2) Macromolecular structure from Protein Data Bank Orotate phosphoribosyltransferase from Streptococcus mutans PDB code: 3dez points to structure, associated data files, sequence, visualizations etc. DOI: /pdb3dez/pdb –DOI for data set (CrossRef) delivers data file directly –One data file, one distinct data set (protein structure) DOI of associated publication: /S –Acta Cryst. (2010). F66, [ doi: /S ] Structure of orotate phosphoribosyltransferase from the caries pathogen Streptococcus mutans, C.-P. Liu, R. Xu, Z.-Q. Gao, J.-H. Xu, H.-F. Hou, L.-Q. Li, Z. She, L.-F. Li, X.-D. Su, P. Liu and Y.-H. Dongdoi: /S PDB also archives structure factors (processed experimental data sets) –One per refined structure –No distinct DOI assigned PDB does not (yet) archive primary data sets –More than one per refined structure

DOIs for crystallographic structures (3) Crystal structure in U. Southampton eCrystals repository 2,2-dibutyl-1,3-propanediol eCrystals accession code: 643 points to structure, associated data files, visualizations etc. DOI: /ecrystals.chem.soton.ac.uk/643 –DOI assigned by CrossRef delivers portal page to all associated data files –Portal links to raw data sets (archived off-site at STFC Atlas Facility) –Portal links to derived information (chemical structure as CML files) DOI of associated publication: –Publisher-assigned DOI if structure published in peer-reviewed journal – unpublished

DOIs for crystallographic structures (4) Unilever Cambridge Centre for Molecular Informatics / Project XYZ Proposal for JISC project (partners: IUCr, BioMedCentral, OKF) Explore data overlay journal: linking, validation, rights relating to crystal structures: –Published (variety of commercial/society publishers) –Unpublished (repositories, laboratory collections) –Auto-generated annotation DOI: –Strategy not yet elaborated. Probably includes: –DOI assigned by DataCite delivers portal page to all associated data files –Portal links to derived information (chemical structure as CML files) DOI of associated publication: –Publisher-assigned DOI if structure published in peer-reviewed journal – unpublished

DOIs for crystallographic structures: summary IUCr Crystallography Journals Online DOI for article (CrossRef) Separate DOI for data set (CrossRef) Protein Data Bank DOI for data set (CrossRef) eCrystals / University of Southampton DOI for data collection (CrossRef) Unilever Cambridge Centre for Molecular Informatics / Project XYZ DOI for data collection? (DataCite) * Need consistent protocols for retrieving data set from larger data collection