Mats Dahlberg Research Informatics iNovacia AB, Sweden ChemAxon UGM, Budapest June 7 2006 BeeHive a datamining tool at Biovitrum and iNovacia.

Slides:



Advertisements
Similar presentations
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Advertisements

Version 5.3, April 2010 The ChemAxon Markush project overview and development discussion.
JChem Web Services Server Jonathan Lee Solutions for Cheminformatics Technical Product Presentation.
4 August 2009Copyright © 2009 – Kelaroo, Inc. Kelaroo & ChemAxon Robert D. Feinstein, PhD Vice President & CSO, Kelaroo, Inc.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
ChemAxon's Java Components in a Heterogeneous, Server-Centric Application Environment ChemAxon 2005 User Group Meeting May 19th and 20th, Budapest, Hungary.
Interfacing the JChem Suite outside of Java Jonathan Lee Solutions for Cheminformatics.
UGM, June, 2007 Presenting: Szabolcs Csepregi JChem Base and Cartridge latest.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
Leveraging ChemAxon Cheminformatics in an Integrated Drug Discovery and Development Platform Zhenbin Li, Paul Starbard, Jim Gregory, Donald Chen, Paul.
19 May 2005Copyright © 2005 – Kelaroo, Inc. Kelaroo Applications & ChemAxon Components: Reagent Management Robert D. Feinstein, Ph.D. Kelaroo, Inc. –
Chemaxon's chemo-informatics toolkit integration into the Affectis Data Management System Database Automated Data Integration - Example: IC50 Data generated.
DeltaSofts ChemCart Next Generation Access to Research Data ChemAxon User Group Meeting Budapest, Hungary June 13-14, 2007.
PUBLIC ChemAxon European UGM Building an Electronic Research Habitat at ETC Peter Condron.
An integrated suite of applications using ChemAxon components
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
EIONET Training Beginners Zope Course Miruna Bădescu Finsiel Romania Copenhagen, 27 October 2003.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Oracle SQL Developer Data Modeler 3.0: Technical Overview March 2011.
Boundless business Broaden your business horizons.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Sysment Notebook Presentation of a web-based ELN system.
Lecture-7/ T. Nouf Almujally
Lecture 1 Introduction to the ABAP Workbench
1 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
Single view of customer Support deposit and loan accounts Fully integrated General Ledger module that can be customised according to customer specification.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
Chapter 3 Database Management
SmartSQL AlfaTech Software Solutions Application Requirements Document  Radi Bekker  Vladimir Goldman  Marina Shaevich  Alexander Shapiro Team Members:
BMC Control-M Architecture By Shaikh Ilyas
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Sysment Reaction Tool Presentation of a smart reaction editor application.
Building Ad-Hoc Reports using the SQL Server 2005 Reporting Services (SSRS) Report Builder (SQL307) Adrian Rupp Business Intelligence Solutions Specialist.
Presented By: Shashank Bhadauriya Varun Singh Shakti Suman.
Professional Informatics & Quality Assurance Software Lifecycle Manager „Tools that are more a help than a hindrance”
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Overview of SQL Server Alka Arora.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
SednaSpace A software development platform for all delivers SOA and BPM.
4 Copyright © 2009, Oracle. All rights reserved. Designing Mappings with the Oracle Data Integration Enterprise Edition License.
© Paradigm Publishing Inc. 9-1 Chapter 9 Database and Information Management.
Information Systems Chapter 5 Building the database Part 1. Unsing Access.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Fundamentals of Database Chapter 7 Database Technologies.
1 Accelerated Web Development Course JavaScript and Client side programming Day 2 Rich Roth On The Net
Completing the Model Common Problems in Database Design.
Introduction to Database Management. 1-2 Outline  Database characteristics  DBMS features  Architectures  Organizational roles.
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
MET280: Computing for Bioinformatics Introduction to databases What is a database? Not a spreadsheet. Data types and uses DBMS (DataBase Management System)
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Oracle Application Express. Program Agenda Oracle Application Express Overview Use Cases Key Features Packaged Applications Packaging Pricing Call to.
Archivists' Toolkit - CDL Presentation, October 17, 2005 The Archivists’ Toolkit Lee Mandell Brad Westbrook.
Informatics Software and Services Jim Shaw BergenShaw International Integrate. Automate. Manage. Your company Logo In collaboration.
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
The european ITM Task Force data structure F. Imbeaux.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
Kako razvijate PL/SQL pakete? File based PL/SQL development Mitja Golouh SIOUG 2006,
DATAWHERE - MIAB1 DATAWHERE The MIAB Solution to Information Support and Data Conversion/Migration 통신망 연구실 석사 3 학기 임 수 정.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
ThinStructure: An Overview Support for ThinStructure demonstration. Jean Georges Perrin – Annandale, 21 st April 2004.
Overview of Basic 3D Experience (Enovia V6) Concepts
 1- Definition  2- Helpdesk  3- Asset management  4- Analytics  5- Tools.
Building Enterprise Applications Using Visual Studio®
Accessing the Database Server: ODBC, OLE DB, and ADO
CS 174: Server-Side Web Programming February 12 Class Meeting
Roland Knispel Bioeddie.
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Oracle SQL Developer Data Modeler
Presentation transcript:

Mats Dahlberg Research Informatics iNovacia AB, Sweden ChemAxon UGM, Budapest June BeeHive a datamining tool at Biovitrum and iNovacia

Research Informatics Philosophy All data in Oracle –Safe, pharma industry standard (e.g. many chemical cartridges, ChemAxon, MDL, Accelrys,...) –Data is our asset. Programs come and go. Integration through database layer –...but hidden to the users. Multiple front-ends allowed Applications rapidly adapted to users needs –Close connection developers - users –Workflow support requires full control over the code Unorthodox solutions are allowed –Sometimes quick and dirty development –Sometimes unstable code (but usually fixed quickly...) –Sometimes non-standard technical platform (e.g. Bee language)

BeeHive Function –Main repository for ALL research data (almost) –Used by all project teams –Technical platform for various modules Features –Advanced on-the-fly join of DB table –Versatile handling of lists (compounds, batches, projects...) and Queries –Data grouping (One-line-per-compound) –Fully customisable through meta-data, easy to add new branches (CBT, ELN stats etc) –Structure searching through ChemAxon Oracle cartridge –Built on Bee language from MolSoft LLC, San Diego Status –Moved from MDLs cartridge 2006 –Business critical. Appr 250 users throughout R&D

The heart – just a SQL generator… Defines column types and cost for all joinable columns All possible joins are pre- calculated, travelling salesman problem (more then 300 tables)

Meta data structure Define entities and clean up the dictionaries –Compound numbers, protein targets, batches, plasmids... –One source for every entity possible to validate numbers no misspellings improved data quality This is the core of integration - not a particular client or system None of this comes out-of-the-box! Cross database client Prog 1 Prog 2 Example from Biovitrum

The BEE language High level object oriented scripting language The core interpreter can be extended with powerful modules for –Graphical user interface components –Database connectivity for Oracle, mySQL etc –Molecular objects including chemical drawing –XML-friendly Very compact and efficient code compared with e.g. Java Platform independent (Linux, MacOS X and Windows) Written by Ruben Abagyan & Eugene Raush Code Result

Activity, solubility, chemist etcQuery builder with structural searchingNavigate through all tables BeeHive Overview

Query builder All unique values in drop-down lists No hard-coded values Easy to spot errors

Extraction of data for SAR analysis One compound per line Average IC 50 and SD values Hill number from ActivityBase Structure pop-up window

Systems and applications: BeeHive Modules That Uses JChem CIMS –Chemical Inventory Management System –Keeps track of all chemicals (bottle history, location, risk phrases etc) –Replaced previous MDL system –Fully barcoded (bottles, shelves, people...) –Has improved compliance, reagent availability and speed of inventory work Reagent Search –ACX database of chemical catalogues from CambridgeSoft –Cross-linked to CIMS –Give me all amines under 250 Dal and show in-house on top of the list

Chemical inventory

Reagent searching

Systems and applications: BeeHive Modules /contd/ ChemSpec –Registration of all new compounds –Structure based logic for new compounds and batches –BVT (iNo) number assignment –Connection point for analytical data and requests –Used by all medicinal and analytical chemists

What is next on the list? JChem Calculated properties on all molecule databases –pKa, logP, logD,... Generation of diverse screening sets on the fly (BCUT?)...

Summary - informatics Data sharing is crucial Excel is not enough! No database no modelling Each organisation must define their meta data You need a database administrator Define the data structure first - applications can be improved gradually

RI People and their roles Mats Kihlén –MSc Eng Physics. Head of RI, Ex-computational chemist. Mats Dahlberg –Developer. MSc Computer Science. All-purpose - any language. Mikael Malmgren –DBA. Database architecture & maintenance. Chemical Registration John Marelius –Developer. PhD Computational Chemistry. Lab automation.