Name to structure, Structure to name, chemicalize.org Daniel Bonniot Solutions for Cheminformatics.

Slides:



Advertisements
Similar presentations
February 2013 Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Advertisements

Microfilm CAR, Film and Paper Scanners OCR Image Indexing VersaVIEW VersaIMAGE GOLD Twain Paper Scanners Automated Microfilm Readers TWAIN & ISIS Microfilm.
Don’t Type it! OCR it! How to use an online OCR..
Version 5.3, February 2010 Scientific & technical presentation JChem Base.
Scientific & technical presentation JChem Cartridge for Oracle
Scientific & technical presentation Calculator Plugins January 2011.
Instant JChem INFORMATICS MATTERS
Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
Calculator Plugins József Szegezdi, Nóra Máté. ChemAxon Calculator Plugins ChemAxons plugin handling mechanism provides a framework for calculating various.
JChem Web Services Server Jonathan Lee Solutions for Cheminformatics Technical Product Presentation.
Chemical Naming Daniel Bonniot, PhD October 2008.
Nov 2008 Scientific & technical presentation JChem for Excel.
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics.
Solutions for Cheminformatics
Interfacing the JChem Suite outside of Java Jonathan Lee Solutions for Cheminformatics.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
Java Solutions for Cheminformatics Structure based predictions – new plugins Zsolt Mohácsi, Nóra Máté, József Szegezdi, Ödön Farkas, Gábor Imre, Imre Jákli.
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
Solutions for Cheminformatics Marvin features and news Akos Papp.
June, 2007 Petr Hamernik Extending Instant JChem 2.0 Architecture & API.
June, 2007 Daniel Bonniot IUPAC Naming. Available in Marvin since (April 2007) Present several ways to use it Evaluation Work in progress.
Name to structure, Structure to name, chemicalize.org Daniel Bonniot de Ruisselet Solutions for Cheminformatics.
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Physical Reference Data of the NIST Physics Laboratory Presentation for the DASER Symposium Digital Archives for Science & Engineering Research Saturday,
Publishing Process An attempt -- GWB.
Using the free reference management tool Zotero
Visit the ccScan Website Scan, Import, and Automatically File documents to the Cloud SCAN, IMPORT, AND AUTOMATICALLY FILE DOCUMENTS TO SALESFORCE ® Introduction.
1 Scanshell.Net CSSN – Card Scanning Solutions THE ULTIMATE, ALL-IN-ONE CARD-SCANNING SOLUTION.
Multi-Media Handbook The software program that efficiently manages all types of information.
HalFILE 2.1 New Application Features Session I-a.
WEB SERVICES. FIRST AND FOREMOST - LINKS Tomcat AXIS2 -
Microsoft Word By: Phuong Nguyen.
1 How Do I Order From.decimal? Rev 05/04/09 This instructional training document may be updated at anytime. Please visit and check the.
© Paradigm Publishing, Inc Access 2010 Level 2 Unit 2Advanced Reports, Access Tools, and Customizing Access Chapter 8Integrating Access Data.
AIMSweb Progress Monitor Online User Training
EndNote. What is EndNote:  EndNote is referencing software that enables you to create a database of references from your readings. Your database of references.
Getting Started with EndNote X4 Dr. Christiane Holtz, 28.September
XHTML Basics.
Premier Director Document Imaging
Internet Research Techniques Graham Seibert Copyright 2006 This is a segment of the draft version of a large syllabus. I need your feedback to improve.
Introducing new web content management tools for Priority...
AN OVERVIEW OF MAC PDF TOOLS 1. PDF Tools for Mac PDF files can be used either in Windows, Unix or Apple’s Mac OS operating system commonly. It still.
Advanced Workgroup System. RED Advanced Workgroup Systems: Scan Features Copy Print Scan DNSG Software Our Customers Documents Our Customers Documents.
Creating Accessible PDF’s in Adobe Acrobat Professional 7.0.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Inking. 2 Pen Basics You MUST keep your pen tethered at all times. If you lose the stylus, the replacement cost is $30. Buttons should face YOU in garage.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
The most powerful high-speed scanning, indexing and OCR solution on the market Supports many high speed scanners: Fujitsu, Canon, Kodak, Epson, Avision,
Confidential, I.R.I.S. © 2005, All rights reserved Discover… The most robust solution to structure, index, compress and convert all your documents into.
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
Enricher Converter Analyzer Parser & Renderer UNIVERSAL, FAST AND RELIABLE.
Managing References Using the free reference management tool Zotero.
Lesson 12: Creating a Manual and Using Mail Merge.
What is TrinDocs A fully integrated document management system enabling: Archiving Instant Retrieval Workflow & Routing OCR and Intelligent Form Recognition.
U1_a02_copyright text and images ADD NAME HERE. Insert below a copyright free IMAGE that could be used in your health and safety presentation.
GUI Design With The Appx Client Presented By: Gary Rogers.
Graphical Enablement In this presentation… –What is graphical enablement? –Introduction to newlook dialogs and tools used to graphical enable System i.
EndNote: The Next Steps Rebecca Starkey Reference Librarian The Joseph Regenstein Library
Greenstone Building your own collection. Overview Installation Usage Building a collection.
Test Automation For Web-Based Applications Portnov Computer School Presenter: Ellie Skobel.
FIRST COURSE Integration Tutorial 2 Integrating Word, Excel, and Access.
EWB-UMN Wiki Tutorial November 15, 2008 B.Bell 1.Attach a document to your project wiki 2.Edit your project wiki to move your document 3.Add a website.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
XP New Perspectives on Creating Web Pages With Word Tutorial 1 1 Creating Web Pages With Word Tutorial 1.
June 2016, Version Scientific & technical Presentation Pipeline Pilot Integration.
Editing Editing – the process of updating a word processing document to: make changes correct errors make it visually appealing.
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Reference Management Software Tools Zotero - Open Source (Module 12)
JEdit.
Affinity Consulting John A. Federico
Presentation transcript:

Name to structure, Structure to name, chemicalize.org Daniel Bonniot Solutions for Cheminformatics

Overview We will talk about: Name generation (structure to name, s2n) Name import (name to structure, n2s) Name extraction (document to structure, d2s) Name correction (OCR-error fixing) Name highlighting (chemicalize.org)

Structure to name Usage: Plugin in MarvinView and MarvinSketch Label updated in real-time in MarvinSketch Batch: Save As: IUPAC Name in MarvinView Batch: command line (molconvert, cxcalc) Instant JChem 2 Options: Strict IUPAC or traditional Timeout

Structure to name Already stable (focus on n2s) Comparison between and on NCI database (260K structures) Both over 99.9% named, 4% changed More fused names supported (60% to 66%): 5-methyl-6-azatricyclo[ ^{2,7}]tetradeca-1(10),2(7),3,5,8,11,13-heptaene 3-methylbenzo[f]quinoline Better support for ions (e.g. -olate) Stricter IUPAC numbering and priorities Overspecific E/Z labels removed

Name to structure Usage Edit/Import Name... in MarvinSketch Automatic format recognition Paste a name from the clipboard Open IUPAC Name file in MarvinView or MarvinSketch (.name extension) Batch from command line (molconvert)

Name to structure: evaluation Molecule->Name->Molecule on NCI 5.1.0: 90.0% names imported, 69% identical 5.2.2: 97.3% names imported, 91% identical Pubchem data 5.1.0: 87.7% names imported, 94% identical 5.2.2: 97.5% names imported, 95% identical Name to structure: +33% speed

Document to structure Goal Process documents containing text Recognize chemical names Convert them to structures Return locations, names and structures Formats Implemented: text, html, xml Planned: PDF, Doc,... Release upcoming (5.2.3)

OCR error fixing Scanned texts contain numerous recognition errors Examples: L (small L) instead of 1 I instead of l (Il, iL) Uses: By default in d2s Option in n2s (from 5.2.3)

chemicalize.org Adds structural information to existing public webpages Popup window with structure image Link to structure predictions (logP, pKa,...) Searchable structure->webpage index Could be installed natively on custom website, with custom features

Chemicalize.org

Find out more Product descriptions & links Forum Presentations and posters Download