Molecular Descriptors

Slides:



Advertisements
Similar presentations
ChemAxon in 3D Gábor Imre, Adrián Kalászi and Miklós Vargyas Solutions for Cheminformatics.
Advertisements

Analysis of High-Throughput Screening Data C371 Fall 2004.
1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
PharmaMiner: Geometric Mining of Pharmacophores 1.
Introduction to Chemical Bonding
Fast Descriptor Calculation for Combinatorial Libraries Geoff Downs & John Barnard Sheffield, UK.
Lipinski’s rule of five
CHAPTER 4 CARBON AND THE MOLECULAR DIVERSITY OF LIFE Section A: The Importance of Carbon 1.Organic chemistry is the study of carbon compounds 2.Carbon.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
Cheminformatics II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
Identifying Patterns in Road Networks Topographic Data and Maps Henri Lahtinen Arto Majoinen.
Basic Steps of QSAR/QSPR Investigations
Lectures Molecular Bonding Theories 1) Lewis structures and octet rule
Quantitative Structure-Activity Relationships (QSAR) Comparative Molecular Field Analysis (CoMFA) Gijs Schaftenaar.
Regionalized Variables take on values according to spatial location. Given: Where: A “structural” coarse scale forcing or trend A random” Local spatial.
Bioinformatics IV Quantitative Structure-Activity Relationships (QSAR) and Comparative Molecular Field Analysis (CoMFA) Martin Ott.
Lecture 10 Cheminformatics of TCM Y.Z. Chen Department of Pharmacy National University of Singapore Tel: ; ; Web:
Bonding and Isomerism Nanoplasmonic Research Group Organic Chemistry Chapter 1.
Molecular Modeling: Statistical Analysis of Complex Data C372 Dr. Kelsey Forsythe.
1 Chemical Structure Representation and Search Systems Lecture 6. Nov 18, 2003 John Barnard Barnard Chemical Information Ltd Chemical Informatics Software.
Graph theory as a method of improving chemistry and mathematics curricula Franka M. Brückler, Dept. of Mathematics, University of Zagreb (Croatia) Vladimir.
Quantitative Structure-Activity Relationships (QSAR)  Attempts to identify and quantitate physicochemical properties of a drug in relation to its biological.
Lecture 7: Computer aided drug design: Statistical approach. Lecture 7: Computer aided drug design: Statistical approach. Chen Yu Zong Department of Computational.
QSAR Qualitative Structure-Activity Relationships Can one predict activity (or properties in QSPR) simply on the basis of knowledge of the structure of.
Pharmacophore and FTrees
Functional groups / Pharmacological Activity
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
Unit 6 Notes – Part II Mr Nelson Bonding & Lone Pairs Electron pairs that are shared are called bonding pairs Electron pairs that are not bonded.
Bonding IB Chemistry 2 Robinson High School Andrea Carver.
Similarity Methods C371 Fall 2004.
Surveillance monitoring Operational and investigative monitoring Chemical fate fugacity model QSAR Select substance Are physical data and toxicity information.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
C ARBON AND THE M OLECULAR D IVERSITY OF L IFE Chapter 4 Biology – Campbell Reece.
“Topological Index Calculator” A JavaScript application to introduce quantitative structure-property relationships (QSPR) in undergraduate organic chemistry.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
3D- QSAR. QSAR A QSAR is a mathematical relationship between a biological activity of a molecular system and its physicochemical parameters. QSAR attempts.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
June 9, 2009 – Class 35 and 36 Overview
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Honors Biology Unit 2: Biochemistry Monkemeier.  Organic compounds contain carbon and hydrogen and can exist as solids, liquids or gases.  Scientists.
Selecting Diverse Sets of Compounds C371 Fall 2004.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Use of Machine Learning in Chemoinformatics
Bioinformatics in Drug Design and Discovery Unit 2.
Part 2. Physicochemical Properties 1.Rules ( 양혜란 ) 2.Liphophilicity ( 백아름 ) 3.pKa ( 박숙진 ) 4.Solubility ( 전종수, 최영재 ) 5.Permeability ( 김소연, 강경태 )
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Hybridization Combination of atomic orbitals to form hybrid orbitals OR Hybridization is also way to explain molecular shapes that can't be explained easily.
Chapter 8 “Covalent Bonding” Part 3 Ball-and-stick model.
Bell Work  Bonding / Chap 8 Reading Is Due Today!  Pick Up Note Sheet  Pick Up & Complete Bell Work.
Organic Chemistry The magic of the carbon atom. Organic Chemistry Objectives Bonding of the carbon atom.
Lipinski’s rule of five
Chemical Bonding II: VESPR
CARBON AND MOLECULAR DIVERSITY
Benzene C6H6 This is a very important example in organic chemistry C C C C C C.
Building Hypotheses and Searching Databases
Daylight and Discovery
Virtual Screening.
Carbon and the Molecular Diversity of Life
CZ3253: Computer Aided Drug design Lecture 4: Structural modeling of chemical molecules Prof. Chen Yu Zong Tel:
Topological Index Calculator III
Chemical bonding II UNIT 8
Carbon and the Molecular Diversity of Life
Carbon and the Molecular Diversity of Life
Unit 15 Molecular Compounds
Presentation transcript:

Molecular Descriptors C371 Fall 2004

INTRODUCTION Molecular descriptors are numerical values that characterize properties of molecules Examples: Physicochemical properties (empirical) Values from algorithms, such as 2D fingerprints Vary in complexity of encoded information and in compute time

Descriptors for Large Data Sets Descriptors representing properties of complete molecules Examples: LogP, Molar Refractivity Descriptors calculated from 2D graphs Examples: Topological Indexes, 2D fingerprints Descriptors requiring 3D representations Example: Pharmacophore descriptors

DESCRIPTORS CALCULATED FROM 2D STRUCTURES Simple counts of features Lipinski Rule of Five (H bonds, MW, etc.) Number of ring systems Number of rotatable bonds Not likely to discriminate sufficiently when used alone Combined with other descriptors for best effect

Physicochemical Properties Hydrophobicity LogP – the logarithm of the partition coefficient between n-octanol and water ClogP (Leo and Hansch) – based on small set of values from a small set of simple molecules BioByte: http://www.biobyte.com/ Daylight’s MedChem Help page http://www.daylight.com/dayhtml/databases/medchem/medchem-help.html Isolating carbon: one not doubly or triply bonded to a heteroatom

ACD Labs Calculated Properties http://www.acdlabs.com ACD Labs values now incorporated into the CAS Registry File for millions of compounds I-Lab: http://ilab.acdlabs.com/ Name generation NMR prediction Physical property prediction

Molar Refractivity MR = n2 – 1 MW -------- ----- n2 + 2 d where n is the refractive index, d is density, and MW is molecular weight. Measures the steric bulk of a molecule.

Topological Indexes Single-valued descriptors calculated from the 2D graph of the molecule Characterize structures according to size, degree of branching, and overall shape Example: Wiener Index – counts the number of bonds between pairs of atoms and sums the distances between all pairs

Topological Indexes: Others Molecular Connectivity Indexes Randić (et al.) branching index Defines a “degree” of an atom as the number of adjacent non-hydrogen atoms Bond connectivity value is the reciprocal of the square root of the product of the degree of the two atoms in the bond. Branching index is the sum of the bond connectivities over all bonds in the molecule. Chi indexes – introduces valence values to encode sigma, pi, and lone pair electrons

Kappa Shape Indexes Characterize aspects of molecular shape Compare the molecule with the “extreme shapes” possible for that number of atoms Range from linear molecules to completely connected graph

2D Fingerprints Two types: One based on a fragment dictionary Each bit position corresponds to a specific substructure fragment Fragments that occur infrequently may be more useful Another based on hashed methods Not dependent on a pre-defined dictionary Any fragment can be encoded Originally designed for substructure searching, not for molecular descriptors

Atom-Pair Descriptors Encode all pairs of atoms in a molecule Include the length of the shortest bond-by-bond path between them Elemental type plus the number of non-hydrogen atoms and the number of π-bonding electrons

BCUT Descriptors Designed to encode atomic properties that govern intermolecular interactions Used in diversity analysis Encode atomic charge, atomic polarizability, and atomic hydrogen bonding ability

DESCRIPTORS BASED ON 3D REPRESENTATIONS Require the generation of 3D conformations Can be computationally time consuming with large data sets Usually must take into account conformational flexibility 3D fragment screens encode spatial relationships between atoms, ring centroids, and planes

Pharmacophore Keys & Other 3D Descriptors Based on atoms or substructures thought to be relevant for receptor binding Typically include hydrogen bond donors and acceptors, charged centers, aromatic ring centers and hydrophobic centers Others: 3D topographical indexes, geometric atom pairs, quantum mechanical calculations for HUMO and LUMO

DATA VERIFICATION AND MANIPULATION Data spread and distribution Coefficient of variation (standard deviation divided by the mean) Scaling (standardization): making sure that each descriptor has an equal chance of contributing to the overall analysis Correlations Reducing the dimensionality of a data set: Principal Components Analysis