SMILES. Simplified molecular input line entry specification The simplified molecular input line entry specification or SMILES is a specification for unambiguously.

Slides:



Advertisements
Similar presentations
How to predict a Product
Advertisements

SMILES Simplified Molecular Input Line Entry System (SMILES) Widely used AND computationally efficient Uses atomic symbols and a set of intuitive rules.
 Single Replacement Reactions + + . General Equation A + BX  AX + B.
(R) and (S) Configuration Both enantiomers of alanine receive the same name in the IUPAC system: 2-aminopropanoic acid. Only one enantiomer is biologically.
IONS AND IONIC COMPOUNDS MONATOMIC IONS are atoms with a positive or negative charge.MONATOMIC IONS are atoms with a positive or negative charge. Taking.
Chapter 9 “Chemical Names and Formulas”
Chapter 3 Ionic Compounds.
Writing Ionic Formulas Chemical Formulas from Names & Names from Chemical Formulas.
Chapter 3 The Composition of Molecules Department of Chemistry and Biochemistry Seton Hall University.
1 mass p ≈ mass n ≈ 1840 x mass e -. 2 Atomic number (Z) = number of protons in nucleus Mass number (A) = number of protons + number of neutrons = atomic.
Chemical Communication Nomenclature Edward A. Mottel Department of Chemistry Rose-Hulman Institute of Technology.
Chemistry 201 Edward A. Mottel Department of Chemistry Rose-Hulman Institute of Technology.
1 Double Bond: C 2 H 4 An sp 2 hybridized C atom has one electron in each of the three sp 2 lobes Top view of the sp 2 hybrid Side view of the sp 2 hybrid.
Lecture 10 Cheminformatics of TCM Y.Z. Chen Department of Pharmacy National University of Singapore Tel: ; ; Web:
Naming Ionic Compounds
Names and Formulas For Ionic Compounds
Chapter 4 Compounds and Their Bonds
Oxidation Numbers (Ox #’s) What are they used for? Why do you need to learn them? to write chemical names and chemical formulas to balance redox equations.
Binary Compounds Metals (fixed oxidation) + Nonmetals Objectives:
Naming Ions 9.1 and 9.2. Review: cations and anions Ions are formed by the gain or loss of electrons Ions are formed by the gain or loss of electrons.
Atoms, Molecules ..
Oxidation: any process by which an entity loses electrons. e.g.2Mg o + O 2 o 2Mg 2+ + O 2- H 2 o + F 2 o 2H + F - Mg and H are oxidised in these examples.
Bell Work Write the electron configuration for an element with an atomic number of 23. Then, draw the electrons in their proper orbitals.
Ionic Compounds Nomenclature. Introduction We use the periodic table to tell us the most commonly expected charges for the ions of elements. For example:
Ions Wednesday January 8, 2014
1 Cheminformatics David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Reactions Reference. Solubility Rules 1.All nitrates, acetates, and chlorates are soluble. 2.All chlorides, bromides, and iodides are soluble except for.
Ionic Bonding (Part IV)
Ionic Compound Names and Formulas. Monovalent Ionic Binary Compounds “+” means lose “-” means gain The number (+1,+2,+3)represents the number of electrons.
Breakthroughs in Organizing Elements Odling: elements can be organized into 13 groups based on physical & chemical properties Pretty close to modern groups.
Chapter 1 Structure and Bonding.
Ions. Remember…  Atomic Number is the number of protons in an atom.  The number of protons equal the number of electrons in a neutral atom.  Atomic.
9.1 Naming Ions > 1 Copyright © Pearson Education, Inc., or its affiliates. All Rights Reserved. Chapter 9 Chemical Names and Formulas 9.1 Naming Ions.
Part I: Describing Chemical Reactions chemical reaction the process by which one or more substances are changed into one or more different substances.
Cyclic Structures Numbers indicate start and stop of ring Same number indicates start and end of the ring, entered immediately following the start/end.
6-1: Ionic Bonding 6-2: Covalent Bonding 6-3: Naming Formulas and Writing Compounds.
1 Elements each element has a unique number of protons in its nucleus the number of protons define the element the number of protons in the nucleus of.
Sec. 7.1 & 9.1: Formation and Naming of Ions Valence Electrons The electrons responsible for the chemical properties of atoms, and are those in the outer.
CHEMICAL SYMBOLS, FORMULAS, AND EQUATIONS Looking at water as a molecule made up of atoms, the building blocks of matter.
Ionic Bonding (Part 3) The Last Step: Adding Polyatomic Ions.
Chemical Names and Formulas
Atoms, Elements and the Periodic table
Drawing Lewis Dot Diagrams
Ionic Compounds.
9/1/11 As you come in… Turn in questions on page 50. Have out B.4 Cornell notes. Pick up 2 handouts. Objectives I can define an ion and the types of.
1.3 Ions and Octet Rule.
3.00_Chemistry of Biology Biology I (Ions)
Chemical Names and Formulas
Naming Ionic Compounds
Aluminum Ion CrO42- Ammonium ion Bromide ion Ba+2 Carbonate ion Cr+2
Binary Ionic Compounds Formula to Name
What elements form ionic compounds? How is an ionic bond formed?
Single Replacement Reactions
Predicting Reactions.
Ionic Compounds.
Putting Atoms Together (7.1)
Chapter 7 Chemical Formulas & Chemical Compounds
Daylight and Discovery
Putting Atoms Together (7.1)
3.2 Names and Formulas of Ionic Compounds
How to Name Chemical Compounds
Drill: Ionic bonding Objective:
Bonding theories.
Formulas.
Names & Formulas of Ionic Compounds
Names and Formulas of Compounds
We will begin Chapter 9 today after a review of shapes and polarity.
Physical Inorganic Chemistry : THE STRENGTH OF ACIDS & BASES
Names & Formulas of Ionic Compounds
Ionic Bonding Ions are charged atoms. This is when an atom has gained or lost an electron. An atom’s ion is based on their outer electron shell. The number.
Presentation transcript:

SMILES

Simplified molecular input line entry specification The simplified molecular input line entry specification or SMILES is a specification for unambiguously describing the structure of chemical molecules using short ASCII stringschemicalmoleculesASCIIstrings SMILES strings can be imported by most molecule editors for conversion back into two-dimensional drawings or three-dimensional models of the moleculesmolecule editorstwo-dimensional three-dimensional

SMILES Simplified Molecular Input Line Entry System (SMILES) Widely used AND computationally efficient Uses atomic symbols and a set of intuitive rules Uses hydrogen-suppressed molecular graphs (HSMG)

Canonical SMILES and Isomeric SMILES The term Canonical SMILES refers to the version of the SMILES specification that includes rules for ensuring that each distinct chemical molecule has a single unique SMILES representation – A common application of Canonical SMILES is for indexing and ensuring uniqueness of molecules in a databasedatabase The term Isomeric SMILES refers to the version of the SMILES specification that includes extensions to support the specification of isotopes, chirality, and configuration about double bonds isotopeschirality – A notable feature of these rules is that they allow rigorous partial specification of chirality.

Graph-based definition In terms of a graph-based computational procedure, SMILES is a string obtained by printing the symbol nodes encountered in a depth- first tree traversal of a chemical graph The chemical graph is first trimmed to remove hydrogen atoms and cycles are broken to turn it into a spanning tree Where cycles have been broken, numeric suffix labels are included to indicate the connected nodes Parentheses are used to indicate points of branching on the tree

SMILES Bonds SINGLE* DOUBLE TRIPLE AROMATIC* * can be omitted -=#:-=#:

SMILES Branches Represented by enclosure in parentheses Can be nested or stacked Examples: CC(O)CC is 2-Butanol OCC(C)C is iso-Butanol OC(C)(C)C is tert-Butanol

SMILES Bonds Ethene Chloroethene 1,1-Dichloroethene cis-1,2-Dichloroethene Trichloroethene Perchloroethene C=C ClC=C ClC(Cl)=C ClC=CCl ClC(Cl)=CCl ClC(Cl)=C(Cl)Cl

SMILES Symbols String of alphanumeric characters and certain punctuation symbols Terminates at the first space encountered when read left to right The ORGANIC SUBSET: B, C, N, O, P, S, F, Cl, Br, I

Other SMILES Atoms Aliphatic or nonaromatic carbon: C Atom in aromatic ring: lowercase letter Designate ring closure with pairs of matching digits, e.g. c1ccccc1 is Benzene, whereas C1CCCCC1 is Cyclohexane

SMILES Charges Specify attached hydrogens and charges in square brackets Number of attached hydrogens is the symbol H followed by optional digit

SMILES Charges [H+] [OH-] [OH3+] [Fe++] [NH4+] proton hydroxyl anion hydronium cation iron(II) cation ammonium cation

SMILES Cyclic Structures Break one single or one aromatic bond in each ring Number in any order – Designate ring-breaking atoms by the same digit following the atomic symbol

Cyclic Structures Numbers indicate start and stop of ring Same number indicates start and end of the ring, entered immediately following the start/end atoms Only numbers 1 – 9 are used A number should appear only twice Atom can be associated w. 2 consecutive numbers, e.g., Napthalene: c12ccccc1cccc2

SMILES Conventions Avoid two consecutive left parentheses if possible Strive for the fewest number of possible branches Tautomeric bonds are not designated; enter the appropriate form

Further Restrictions A branch cannot begin a SMILES notation A branch cannot immediately follow a double- or triple-bond symbol Example: C=(CC)C is invalid, but C(=CC)C or C(CC)=C are valid SMILES

SMILES Fragments Nitro Nitrate Nitrite Sulfonic acid Cyanide/Nitrile Azide Azido N(=O)(=O) ON(=O)(=O) ON(=O) S(=O)(=O)O C#N N=N#N N+=N-

SMILES Metals [Al] [As] [Au] [Be] [Bi] [Cd] [Ca] [Fe] [Hg] [K] [Li] [Mg] [Na] [Ni] [Pt] [Sb] [Sn] [Zn] [Zr]

Disconnected Structures Tetramethyl ammonium bromide C[N+]C(C)C.[Br-]

Isomeric and Chiral SMILES Isomeric configuration indicated by forward and backward slashes: / \ Examples: – trans-1,2-dibromoethene: Br/C=C/Br – cis-1,2-dibromoethene: Br/C=C\Br Chirality indicated by the symbol

Another Application SMILESCAS Database Over 103,000 SMILES notations Input CAS Registry Number Leads to SMILES and thence to a structure search

Example 1 CC(C(C)(C)(Br))C

Example 2

Example 3

Example 4

Example 5

Example 6

Example 7

Example 8

Example 9

Example 10

Example 11

Example 12

Example 13

Example 14

Example 15

Example 16