Klaus Gubernator, Craig James, e Molecules Inc. ACS 232nd National Meeting Division of Chemical Information San Francisco, September 14, 2006 Chemical.

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

Confluence Wiki Implementation? 14 June Agenda What? Why? Wow! How? When? 2.
Accessing electronic journals from off- campus This causes lots of headaches, but dont despair, heres how to do it! (Please note – this presentation is.
Databases vs the Internet Coconino Community College Revised August 2010.
Open 24 hours a day 7 days a week The Library. Your subject librarian…. Linda Humphreys x5248 Library 4.04.
Getting started with ENDNOTE Compiled by Helene van der Sandt.
The Deep Web & A Quick Review of CCOL James Backer, Ph.D. Cambridge College.
Schaffer Library of Health Sciences E-Journals Troubleshooting Guide Reference: Circulation: Click here to begin.
Publishing Scientific Data for Electronic Books: Challenges and Opportunities Expanding the Accessibility of Critically Evaluated Data Dr. Joan Fuller,
ACS PUBLICATIONS An Overview of Products & Services A C S P U B L I C A T I O N S H I G H Q U A L I T Y. H I G H I M P A C T.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
Information Literacy in a Rapidly Evolving Educational and Research Environment The Hybrid Research Environment Prof. Monica Berger June 8, 2005.
Eirplay (c) 2009 Web 2.0 and Games The contents of this plan are confidential and are not to be reproduced with express written consent.
Oxford Scholarship Online Lund Online, 18 th October 2007, Lund Sweden Adina Teusan, OUP Online Sales Executive – Scandinavia.
Tunable Machine Vision-Based Strategy for Automated Annotation of Chemical Database ChemReader Jungkap Park, Gus R. Rosania, and Kazuhiro Saitou University.
Lund Online 07/10/2009 Ingolf Kaspar, Regional Sales Manager EBSCO Publishing.
3. Chemical Data and Data Bases. 2 Datasets and Databases Many small datasets are available Several commercial databases of compounds and reactions (e.g.
Hudson Valley Community College Marvin Library GOOGLE SCHOLAR
Internet Research Search Engines & Subject Directories.
Nijhoff Online resources in International Law Seminar on International Law Resources for Science, Education and Business ICSTI and Swets Information Services.
PubMed/History; Accessing Full-Text Articles (module 4.4)
SFU Library services, resources, and research tips for SIAT researchers (or: How libraries are still useful in the age of the Digital Revolution and Breaking.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Internet Marketing. What is Marketing? n The strategies and actions firms take to establish a relationship with a consumer and encourage purchases of.
E-Commerce. What is E-Commerce Industry Canada version Commercial activity conducted over networks linking electronic devices (usually computers.) Simple.
Getting started on informaworld™ How do I register with informaworld™? What do I do if I forget my password? My institution does not subscribe to any journals,
Rich Foley - Executive Vice President Academic & Public Markets Helen Wilbur - Vice President Consortia Sales & Marketing Digital ArchivesResearch CollectionseBooks.
Getting started on informaworld™ How do I register my institution with informaworld™? How is my institution’s online access activated? What do I do if.
Module 3: Business Information Systems Chapter 8: Electronic and Mobile Commerce.
Cambridge Journals Online (CJO). CJO – E Publishing Service Content Delivery Site Administration Online Production Online Marketing and Promotion Customer.
English 115 GoogleScholar/ OneSearch Hudson Valley Community College Marvin Library Learning Commons 1.
© Vision ICT Ltd Working in Partnership Vision ICT Ltd is working in partnership with the Society of Local Council Clerks to support Local Councils.
GeNii New Contents Services of NII
A Survey of Patent Search Engine Software Jennifer Lewis April 24, 2007 CSE 8337.
ACS PUBLICATIONS Over a Century of Essential Chemistry on Your Desktop H I G H Q U A L I T Y. H I G H I M P A C T. A C S P U B L I C A T I O N S Robert.
WorldCat Local & World Cat Quick Start a new way to search your library’s resources and the world beyond.
Affordable Sources for Property Data Authors: Carrie Newsom Susan K. Cardinal Last Updated: 3/20/07 A product of the ACS CINF Education Committee.
E-Books at the ETH Zurich Eidgenössische Technische Hochschule Marseille 2nd May 2005Ann McLuckie.
Copyright © 2010 Pearson Education, Inc.Copyright © 2007 Pearson Education, Inc. Slide 1-1 ELC 200 Day 15.
ICOLC Las Vegas March 28, 2003 TDNet E-Management Services for Consortia From E-Journals to E-Resources Michael Markwith President, TDNet Inc.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Nerinx Hall Library Ebscohost. Library Website—choose online resources.
Wafaa abu zayed E-Business Instructed: safaa Dallual.
1 ENG 350 Children’s Literature A Guide to Resources at the Dick Smith Library.
Some Free Resources on the Internet for Introducing Chemical Information Steven P. Wathen Siena Heights University Adrian, Michigan.
Ordnett and Store norske leksikon IGEP
Welcoming Remarks – and a Very Brief History of U.S. Govt. Chemical Databases and Open Chemistry Marc C. Nicklaus Computer-Aided Drug Design Group Chemical.
RSC Publishing Platform Amanda Sun
Gold Rush Electronic Resource Discovery and Management System George Machovec Colorado Alliance of Research Libraries
Daniel Boivin OCLC Canada OCLC and Access98. AgendaAgenda n What’s new with FirstSearch 4.0 n New FirstSearch or FirstSearch 5.0.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Forms 5.02 Understand database queries, forms, and reports.
E-resource with no wall and no firewalls Dr.H.S.Siddamallaiah Principal Library and Information officer (Rtd) NIMHANS, Bangalore.
Successful Web searches!. If you type your keywords into Google, you’ll get millions of hits! Is that useful?
Roger Mills February don’t be evil stand on the shoulders of giants.
Open Access & Researcher Support UWTSD Partnership Librarians Conference 5 th May 2016.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
Chapter 9.  Personal Knowledge & Experience  Select familiar topics ▪ Personal knowledge is good support ▪ Examples, illustrations, explanations ▪ From.
Tech Talk: Google Scholar Wednesday, September 25, 2013 MIGUEL 307 Bernadette López-Fitzsimmons Information Services Librarian.
The Sci-tech Information Industry: Changes, Challenges and the CAS Response Presented by Michael Walsh Manager of International Marketing Operations Chemical.
General & Background InformationPractical & Useful DataDetailed, Original Research Encyclopedias Dictionaries Reference Texts Books Safety Information.
Summon - HINARI Search (Module 3)
E-Commerce Lecture 8.
Marvin Library Web Page
Search Engines & Subject Directories
Online Database Searching
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
Search Engines & Subject Directories
Search Engines & Subject Directories
5.02 Understand database queries, forms, and reports.
Presentation transcript:

Klaus Gubernator, Craig James, e Molecules Inc. ACS 232nd National Meeting Division of Chemical Information San Francisco, September 14, 2006 Chemical Structure Search Engines in Cyberspace

 The web has revolutionized the way we retrieve information  Chemistry is a late participant in this revolution Chemistry on the Internet

NN Search Google Images for “Aspirin”

Acta Cryst. (1975). B31, The crystal structure of 7-amino-2H,4H-vic-triazolo[4,5-c]-1,2,6- thiadiazine 1,1-dioxide (ATT) C. Foces-FocesC. Foces-Foces, F. H. Cano and S. García-BlancoF. H. CanoS. García-Blanco Buy online You may purchase this article in PDF and/or HTML formats. For purchasers in the UK, and for purchasers elsewhere in the European Community who do not have a VAT number, VAT will be added to the price of the article. Format* PDF (US $40, plus US $7 for EC purchases) Structure of “triazolo thiadiazine”

Datasets (which are, in contrast to other dataset lists, available in a structural format) This list will be expanded continuously. Please don't hesitate to make published datasets publicly available here. Currently available: 44 Datasets Note: The Briem/Lessel and Hert/Willett Dataset are only available as MDDR ID's due to license reasons. Please contact MDL for further information on the database. The datasets have nonethless been included here because they are standard datasets for similarity searching. – Andreas BenderMDL  Binary (active/inactive) datasets Binary (active/inactive) datasets  QSAR datasets QSAR datasets  QSPR datasets QSPR datasets  Toxicity datasets Toxicity datasets  Metabolism datasets Metabolism datasets  Permeability datasets Permeability datasets  Docking datasets Docking datasets  Mechanistic datasets Mechanistic datasets  Mixed/Other datasets Mixed/Other datasets Cheminformatics.org

CS(=O)(=O)Nc1ccc(cc1OC2CCCCC2)N(=O)=O CS(=O)(=O)Nc1cc2CCC(=O)c2cc1Oc3ccc(F)cc3F CS(=O)(=O)Nc1cc2CCC(=O)c2cc1Sc3ccc(F)cc3F CS(=O)(=O)Nc1ccc(cc1Sc2ccc(F)cc2F)C(=O)N CS(=O)(=O)Nc1ccc(cc1Sc2ccc(Cl)cc2Cl)S(=O)(=O)N COc1ccc(cc1)c2sc(nc2c3ccc(cc3)S(=O)(=O)C)c4ccccc4Cl COc1ccc(cc1)c2sc(nc2c3ccc(SC)cc3)c4ccccc4Cl CS(=O)(=O)c1ccc(cc1)n2nc(cc2c3ccc(F)cc3)C(F)(F)F CS(=O)(=O)c1ccc(cc1)n2nc(cc2c3ccc(Br)cc3)C(F)(F)F Cc1ccc(cc1)c2cc(nn2c3ccc(cc3)S(=O)(=O)N)C(F)(F)F CS(=O)(=O)c1ccc(cc1)c2snnc2c3ccc(F)cc3 CC(=O)c1nc(c(o1)c2ccc(c(F)c2)S(=O)(=O)N)c3ccccc3 Cc1nc(C2CCCCC2)c(o1)c3ccc(c(F)c3)S(=O)(=O)N CS(=O)(=O)c1ccc(cc1)c2[nH]c(nc2C3CCCCC3)C(F)(F)F CS(=O)(=O)c1ccc(cc1)c2[nH]c(nc2c3ccc(F)cc3)C(F)(F)F CS(=O)(=O)c1ccc(cc1)C2=C(C(=O)OC32CC3)c4ccccc4 CS(=O)(=O)c1ccc(cc1)C2=C(C(=O)OC32CCCC3)c4ccccc4 CS(=O)(=O)c1ccc(cc1)c2cnn(Cc3ccccc3)c(=O)c2c4ccccc4 CS(=O)(=O)c1ccc(cc1)c2nn(Cc3ccccc3)c(c2c4ccc(F)cc4)C(F)(F)F NS(=O)(=O)c1ccc(cc1)c2c(CO)onc2c3ccccc3 CS(=O)(=O)c1ccc(cc1)c2cc(Cl)nn2c3ccc(F)cc3 NS(=O)(=O)c1ccc(cc1)c2cc(nn2c3ccc(F)cc3)C(F)(F)F NS(=O)(=O)c1ccc(cc1)n2nc(cc2c3nc4cccc(F)c4s3)C(F)F Stahl dataset

Unnamed -MTS D V S C C C C N C O C O O C C … M END > $$$$ Yokoyama dataset

Search Genbank for “aattccgg”

C

C

Why is so little chemistry on the web?  Tradition?  Strong providers of subscription services?  Searching for chemical structures is significantly more difficult than text searching?  Chemical identifiers are not standardized?

Open Access Chemical Search Engines PubChem - NIH ChemBank – Harvard ZINC – UCSF ChemDB – UC Irvine ChemExper - Lausanne ChemFinder – CambridgeSoft

www. e molecules.com  New Chemistry Search Engine  A large database of publicly available molecular structures  Launched November 2005  50,000 searches per month, rapidly growing

www. e molecules.com Free chemistry search site for publicly available chemical information

Advanced Search Powerful features:  hit list management  union, intersect, subtract, difference  manual selection  export lists in many formats  persistent hitlists

T O

Content: 16M entries, 5.6M structures Academic and government databases  NIST WebBook  DrugBank  Protein Ligands Chemical suppliers  150 electronic catalogs included Future goal  All publicly available chemical information

Why is it so fast?  Novel chemical search engine technology  Method represents a major departure from previously known algorithms - Molecular keys (MDL) - Fingerprints (Daylight) - Feature Trees (BioSolv)

Search engine technology  Analyze each molecule for distinguishing structural features  Generate all features algorithmically  Normalize features and use them for indexing  Result: very fast searches

Who is e Molecules? Klaus Gubernator Craig A. James Rashmi Mistry

Summary  Free for depositors and users  Very fast search engine  High quality user interface  Rich functionality  Complementary with other engines

Contact Information Klaus Gubernator Skype: emolecules