EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos.

Slides:



Advertisements
Similar presentations
Chapter 4 Chemistry of Carbon
Advertisements

Chemistry of Carbon Molecules
Recognizing the 4 main classes of organic molecules.
What is Matter?.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
INTRODUCTION TO THE BEILSTEIN AND GMELIN DATABASES Margarete Bower Chemistry Library.
Unit 2: Chemical Interactions Chapter 8: Carbon Chemistry
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Unit 1: Biochemistry Part III: Organic Chemistry Carbohydrates.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
Getting More from CrossFire Helen Schofield MIDAS and UMIST Stephen Briggs Beilstein Information Systems Paul Meehan MIDAS.
Arrays (Chapter 5)‏ Definition Applications One-Dimensional –Declaration –Initialization –Use Multidimensional.
Management and Distribution of Chemical Data in the Protein Data Bank John Westbrook, Dimitris Dimitropoulos, Jasmine Young, Peter Rose, Philip E. Bourne.
Database-Driven Web Sites, Second Edition1 Chapter 8 Processing ASP.NET Web Forms and Working With Server Controls.
Bringing Structure to Biology: Small Molecules and the PDBe
1 Chapter 11: Organic Compounds: Alkanes. 2 ORGANIC COMPOUNDS: In 1828, Friedrich Wöhler first synthesized an organic compound from an inorganic source.
IDInstanceStatusSelectionDetailsScore LIGA505 LIGAND CODE MATCHED XYZ XYPA503CLOSE MATCH XYPA504NO MATCH MANA500PASSED GLC A501 PASSED NAG A502 PASSED.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
Carbon Chemistry - Properties of Carbon Carbon Atoms and Bonding Carbon atoms and the bonds between them can be modeled in several ways.
The IUPAC Stability Constants Database (SC-Database) The definitive collection of all significant published metal-complex stability constants Title Structure.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Type Today’s Date Here The Chemistry of Life. Brainteaser What fastens two people yet touches only one? Answer Here.
TOPIC 11 – ORGANIC CHEMISTRY. TOPIC 11 – Regents Review Organic compounds consist of carbon atoms bonded to each other in chains, rings, and networks.
1 Cheminformatics David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Data Integration and Management A PDB Perspective.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
ATOMS ELEMENTS PERIODIC TABLE MOLECULES COMPOUNDS.
Organic Chemistry. Introduction Organic chemistry is the study of carbon and its compounds. The major sources of carbon are the fossil fuels petroleum,
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Functional Groups CHEMISTRY 11 MS. MCGRATH. Functional Groups A functional group is a portion of a molecule that is a recognizable / classified group.
Macromolecules. Objectives List the elements that make up living things. List the four kinds of macromolecules. Describe carbohydrates, lipids, fats and.
Use of Machine Learning in Chemoinformatics
Description and exemplification use of a Data Dictionary. A data dictionary is a catalogue of all data items in a system. The data dictionary stores details.
Chemistry of Living Things ORGANIC AND INORGANIC COMPOUNDS.
Chemistry of Life. Elements and Compounds Atoms are the smallest unit of matter that cannot be broken down. An element is a substance made up of atoms.
1 Chapter 11: Organic Compounds: Alkanes. 2 ORGANIC COMPOUNDS: In 1828, Friedrich Wöhler first synthesized an organic compound from an inorganic source.
Carbon Chemistry Chapter 8. Essential Questions What are the 3 different arrangements of carbon? What are the 3 different arrangements of carbon? What.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Essential Questions How does the structure of water make it a good solvent? What are the similarities and differences between solutions and suspensions?
Biomolecules Macromolecules. Organic Compounds An organic compound is any compound that contains atoms of the element carbon. Carbon has 2 electrons in.
Carbon Chemistry Chapter 8. Organic Compounds Organic compounds are compounds composed of carbon based molecules. examples: sugar, starch, fuels, synthetic.
Notes 8-2 Carbon Compounds. Organic compounds Made up of carbon Have similar properties such as melting point, boiling point, odor, electrical conductivity,
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Sample Registration - Introduction
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Chemistry 22.3.
Chemistry 8.1.
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
What elements are found in a phospholipid?
Organic Chemistry Grade 12
Molecules Element Particle Bond Compound
4.2 Carbon compounds and cells
Carbon Based Molecules
Dimitris Dimitropoulos
Classification of Matter
Classification of Matter
Daylight and Discovery
Atoms, Molecules, Elements, and Compounds
Goal 5 – Elements and Compounds
Living By Chemistry SECOND EDITION
Carbon Chemistry Vocabulary Review
Table of Contents Properties of Carbon Carbon Compounds
Activity #16- Classification of Matter Chart Make the chart below in your notebook: ATOM ELEMENT COMPOUND MIXTURE MOLECULE.
Chapter 10.1: Organic chemistry fundamentals
CARBON AND THE MOLECULAR DIVERSITY OF LIFE The Importance of Carbon
The molecules that form life.
The Chemistry of Carbon
Organic Compounds (molecules that contain Carbon) AKA Carbon Macromolecules Chapter 2-3.
Presentation transcript:

EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos

EMBL-EBI The chemical database

EMBL-EBI MSDchem ligand dictionary  Complete, clean, up to date collection of all the chemical species and small molecules in the PDB  A ligand in MSDchem is a complete, distinct stereo isomer of a chemical compound  Atoms and element types  Bonds and bond orders  Stereo configuration of atoms and bonds in cases of stereo-isomers (R/S – E/Z)  Atom names and coordinates are not fundamental properties

EMBL-EBI Role in the MSD database  An integral component in the core of MSD database  Relational reference from entities where a molecule or atom name is used in the PDB (protein residues and atoms)  It is not possible for an ATOM line: HETATM 4342 C2 PLA C to be loaded if the “PLA” ligand is not defined or it does not include a “C2” atom.

EMBL-EBI Chemistry and PDB  Eliminate chemical inconsistencies from new PDB entries  Structure and derived properties of a ligand apply automatically to residues and bound molecules that reference it  The basic structure is carefully determined during curation, and a rich set of derived attributes is calculated for each ligand  Graph isomorphism is being applied to check the consistency of the PDB, taking stereo-configuration into account  Old legacy PDB entries are chemically “corrected” when loaded in the MSD database  In thousands of cases errors are identified and corrected, involving most of them times inconsistent naming or different stereo-configuration  Exchanged in cooperation with RCSB and the wwPDB

EMBL-EBI More than just the PDB codes All ligands are modelled as separate inter-related ligands and the appropriate one is referenced  No distinction is made in the PDB between ribo- and deoxyribonucleotides (all are identified with the same residue name i.e., A, C, G, T, U, I)  Modified nucleic acids are given as +A etc regardless of modification  No distinction between different topological variants (12 different variants can be found for HIS in PDB)

EMBL-EBI Derived information  External scientific software (CACTVS, VEGA, CORINA, ACD-labs, CCP4, OELIB) together with in house development has been used to derive:  Stereochemistry (R/S – E/Z) DCF C4' R C3' S C1' R DCM C4' S C3' R C1' S THIOALANINE (ALT) CC(N)C(O)=S - (2S)-2-aminopropanethioic O-acid  Smiles and detailed gifs  Systematic IUPAC names

EMBL-EBI Derived information Fingerprints:  A bit string in hexadecimal form that indicates the presence or not of segments from predefined lists  Useful for fast search and classification  Different libraries of predefined lists can be set  Currently calculated for the CACTVS library (500 segments) MoleculeSegments BitStrin g Fingerprint: 2A

Click to see attributes

EMBL-EBI Search options  By ligand code  By ligand name or synonym  By formula or formula range  By non stereo substructure  By non stereo superstructure  By exact stereo or non stereo structure  By fingerprint similarity  By fragment expression

Activate JME molecule editor

Clear structureDelete atomChange atom type after drawing bonds JME editor allows generation of SMILE string to enter search mode Click when complete

EMBL-EBI Search for ligand structures containing 3-chloro-phenol Results Click to get Details for EAA Get PDB entries and bound molecule instances containing 3-chloro-phenol

EAA details substructure of 3-chloro-phenol

Get the PDB entries that include EAA Get the bound molecule instances and site interaction details Viewing & saving options

EMBL-EBI KWT (1S,6BR,9AS,11R,11BR)-9A,11B-DIMETHYL-1-[(METHYLOXY)METHYL]- 3,6,9-TRIOXO-1,6,6B,7,8,9,9A,10,11,11B-DECAHYDRO-3H-FURO[4,3,2- DE]INDENO[4,5-H][2]BENZOPYRAN-11-YL ACETATE (1S,6bR,9aS,11R,11bR)-1-(methoxymethyl)-9a,11b-dimethyl-3,6,9- trioxo-1,6,6b,7,8,9,9a,10,11,11b-decahydro-3H-furo[4,3,2-de]indeno[4,5- h]isochromen-11-yl acetate PDB residue KWT

EMBL-EBI Formula-fragment expression search Formula expression Example: Search for ligands with more than 10 oxygens no nitrogens and sulphurs Fragment expression Example: Search for ligands with furan rings but not any saturated carbon rings (cyclobutane,cyclopropane,cyclohexane)