1 InstantJChem: a flexible chemical database system G. Marcou, D. Horvath + Laboratoire d’infochimie, Université de Strasbourg, 1, rue Blaise Pascal, 67000.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation JChem Cartridge for Oracle
Advertisements

Scientific & technical presentation Calculator Plugins January 2011.
Instant JChem INFORMATICS MATTERS
History Data Service1 Good Design for Historical source based Databases History Data Service Hamish James.
Chapter 10: Designing Databases
Post Exam Study Database Design
XML DOCUMENTS AND DATABASES
ICDL Software Applications - Database Concepts. Unit 6 Data and Data Representation Database Concepts –File Structure –Relationships Database Design –Data.
What is a Database By: Cristian Dubon.
Database management system (DBMS)  a DBMS allows users and other software to store and retrieve data in a structured way  controls the organization,
The Hierarchical Model
Chapter 12 File Processing and Data Management Concepts
Introduction to Databases
Managing data Resources: An information system provides users with timely, accurate, and relevant information. The information is stored in computer files.
File Systems and Databases
Chapter 14 Getting to First Base: Introduction to Database Concepts.
Geographic Information Systems
A Guide to SQL, Seventh Edition. Objectives Understand the concepts and terminology associated with relational databases Create and run SQL commands in.
Chapter 11 Data Management Layer Design
Information Technology in Organizations
Concepts of Database Management Sixth Edition
Chapter 12 Information Systems. 2 Chapter Goals Define the role of general information systems Explain how spreadsheets are organized Create spreadsheets.
Academic Year 2014 Spring.
Page 1 ISMT E-120 Introduction to Microsoft Access & Relational Databases The Influence of Software and Hardware Technologies on Business Productivity.
Databases & Data Warehouses Chapter 3 Database Processing.
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
CSI315CSI315 Web Development Technologies Continued.
GIS Concepts ‣ What is a table? What is a table? ‣ Queries on tables Queries on tables ‣ Joining and relating tables Joining and relating tables ‣ Summary.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
ASP.NET Programming with C# and SQL Server First Edition
Concepts of Database Management Seventh Edition
Introduction to Accounting Information Systems
Introduction to SQL Steve Perry
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
7.1 Managing Data Resources Chapter 7 Essentials of Management Information Systems, 6e Chapter 7 Managing Data Resources © 2005 by Prentice Hall.
End Show 3.4 Database Management System Unit 3. End Show What is a database? It’s an organized collection of data, related to a particular subject or.
Concepts of Database Management Seventh Edition
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Database A database is a collection of data organized to meet users’ needs. In this section: Database Structure Database Tools Industrial Databases Concepts.
Concepts of Database Management Seventh Edition
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
Chapter 17 Creating a Database.
1 Database Concepts 2 Definition of a Database An organized Collection Of related records.
Chapter 4c, Database H Definition H Structure H Parts H Types.
 2001 Prentice Hall Business Publishing, Accounting Information Systems, 8/E, Bodnar/Hopwood A field may be a single character or number, or it.
Visual C# 2012 How to Program © by Pearson Education, Inc. All Rights Reserved.
Prepared By Prepared By : VINAY ALEXANDER ( विनय अलेक्सजेंड़र ) PGT(CS),KV JHAGRAKHAND.
ITGS Databases.
Concepts of Database Management Eighth Edition Chapter 3 The Relational Model 2: SQL.
Database revision.
Access Chapter 5-Table Tricks, Advanced Queries and Custom Forms.
1 Database Basics: Filemaker 7 Introduction Center for Faculty Development, SJSU Steve Sloan
Chapter 10 Database Management. Data and Information How are data and information related? p Fig Next processing data stored on disk Step.
Microsoft Access Database Creation and Management.
Chapter 13.3: Databases Invitation to Computer Science, Java Version, Second Edition.
Use of Machine Learning in Chemoinformatics
Introduction to Databases Angela Clark University of South Alabama.
1 10 Systems Analysis and Design in a Changing World, 2 nd Edition, Satzinger, Jackson, & Burd Chapter 10 Designing Databases.
Database Overview What is a database? What types of databases are there? How are databases more powerful than spreadsheets?
Data Resource Management Lecture 8. Traditional File Processing Data are organized, stored, and processed in independent files of data records In traditional.
1 Section 1 - Introduction to SQL u SQL is an abbreviation for Structured Query Language. u It is generally pronounced “Sequel” u SQL is a unified language.
Introduction Multimedia initial focus
Chapter 12 Information Systems.
What is a Database and Why Use One?
CIS16 Application Programming with Visual Basic
Data Model.
Database Systems Instructor Name: Lecture-3.
Spreadsheets, Modelling & Databases
Presentation transcript:

1 InstantJChem: a flexible chemical database system G. Marcou, D. Horvath + Laboratoire d’infochimie, Université de Strasbourg, 1, rue Blaise Pascal, Strasbourg

Introduction  The goal is to present InstantJChem for the storage and manipulation of chemical information 1.General presentation 2.Database search 3.Creation of a database from scratch

What is a database?  A database stores data in an ordered form on a precise subject.  A relational database stores information into tables which possess inter-references  A relational database management system (RDBMS) is a software that manages relational databases  InstantJChem is not a database and is not an RDBMS.

What is InstantJChem?  InstantJChem is a friendly interface between a RDBMS, chemical information and the user. User RDBMS Chemical Information

Key concepts of InstantJChem ProjectsSchemaDatabases and TablesEntitiesData TreesViews

Exercise 1 Create a new project names IJCExercises…

Key concept: Project Project contains resources and connections to one or more databases. icon

Exercise 1 …and import the file SC100.SDF in it….

Key concept: Schema Schema/ Database Contains connection to a database and special tables (JChemProperties) icon

Key concept: Database and Tables Table Database and tables are managed by the RDBMS. Actually store information. icon

What can be stored TypeDescription Standard table IntegerLong integer: 232 = TextUser can specify widths of text fields as large as needed. RealReal double-precision DateAllows to store dates. BooleanValue is True or False List (Standard)To store a list of database items JChem table Chemical termsA list of functions evaluated on chemical structures: logD, pKa, tautomers,... StructureChemical structure, automatically created with a Jchem table

Key concept: Entities Entity An entity is a representation of data. icon It is a unique interface to conceptually different types of tables (Standard, Chemical, SQL, Extractions, etc).

Key concept: Data Trees Data Tree A collection of entities and views. icon Organize information using a hierarchy (parent- child relationship between entities).

Exercise 1 ….Customize a browser for it.

Key concept: Views Views An interface to data. icon For simple data, a spreadsheet view is relevant. For complex relational data, a form is mandatory.

Exercise 2 In the SC100 database, search for fluorobenzene and pyridine containing molecules. Use Substructure or Similarity search.

Exercise 2 In the SC100 database, search for fluorobenzene and pyridine containing molecules. Use Substructure or Similarity search. Substructure search: 20 hits Similarity search: 0 hits Substructure search: 14 hits Similarity search: 0 hits Similarity search uses Chemical Hashed Fingerprints defined at database creation.

Chemical Hashed Fingerprints (CHF) Pattern Length: number of bonds of a pattern Fingerprint Length: total number of bits to store the fingerprint Bits per pattern: number of bits a pattern shall set on Efficient annotation to accelerate structure search

Exercise 3 Combine molecule 25 and 89 into a pseudo-molecule to perform a superstructure query.

Exercise 4 Use compound 46 as a Full and Full fragment query to search the database. Repeat after removing the bromide from the query.

Structure Searches

Exercise 5 Search benzene containing compounds, which name contains “pyrimidin” and annotated as “Good” concerning their aqueous solubility.

Exercise 6 Search for compounds with at least one aromatic ring containing at least on Nitrogen atom

Exercise 7 Search for compounds which MolWeight > 200 and not containing a benzene ring

Exercise 8 Search for compounds with MolWeigh > 200, then for compounds without a benzene ring and search for the union of the hit lists.

Execrise 9 Search for compounds possessing more than 4 microspecies at pH=4.0….

Exercise 9 … Export your hit list.

Exercise 10 Import in your project the file ISICCRsm.RDF…

Exercise 10 … Create a Browser for this database

Exercise 11 Search for reactions including an imidazole ring into their reactants then into their products.

Exercise 12 Add to your Schema a new data tree and structure entity named AlkanBoilingPoint…

Exercise 12 … and add a floating point value field named BoilingPoint.

Exercise 13 Add to the AlkanBoilingPoint entity the following data.

Exercise 14 Add to the AlkanBoilingPoint entity a new date field named Date and fill it.

Exercise 15 Add to the AlkanBoilingPoint entity a calculated value of LogP using a Chemicalterm field.

Summary  Create a project and schema  Import data  Search by substructure, superstructure, similarity, and exact match  Search by keyword  Combining queries and result lists  Export query results  Create a new database

Conclusion  InstantJChem is a Chemoinformatics layer above a standard SGDB.  Provides many more Chemoinformatics services (databases overlap, QSPR modeling, plots, enumeration, scripting) SGDB InstantJChem