TCOF 3 :Repositioning of Chemical compounds From Different Classes as part of Virtual Screening Under the Guidance of PI: Dr UCA JALEEL (IISc Research.

Slides:



Advertisements
Similar presentations
Leveraging ChemAxon Cheminformatics in an Integrated Drug Discovery and Development Platform Zhenbin Li, Paul Starbard, Jim Gregory, Donald Chen, Paul.
Advertisements

Distributed Drug Discovery Indiana University Purdue University, Indianapolis.
In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
Analysis of High-Throughput Screening Data C371 Fall 2004.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Feature selection and transduction for prediction of molecular bioactivity for drug design Reporter: Yu Lun Kuo (D )
C2D Cheminformatics : Methods,Tools and Results By OSDD-Cheminformatics team.
Why Are We Still Doing Industrial Age Drug Discovery For Neglected Diseases in The Information Age? Sean Ekins Collaborations In Chemistry, Fuquay Varina,
Classification of the aesthetic value of images based on histogram features By Xavier Clements & Tristan Penman Supervisors: Vic Ciesielski, Xiadong Li.
How to Run WEKA Demo SVM in WEKA T.B. Chen
Assuming normally distributed data! Naïve Bayes Classifier.
WEKA - Experimenter (sumber: WEKA Explorer user Guide for Version 3-5-5)
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Active Learning Strategies for Drug Screening 1. Introduction At the intersection of drug discovery and experimental design, active learning algorithms.
Active Learning Strategies for Compound Screening Megon Walker 1 and Simon Kasif 1,2 1 Bioinformatics Program, Boston University 2 Department of Biomedical.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Customer Satisfaction/Loyalty Turna Koksal. Goal Characterize the customer of a bank Customer satisfaction Customer loyalty Relationship between satisfaction.
QSAR Modelling of Carcinogenicity for Regulatory Use in Europe Natalja Fjodorova, Marjana Novič, Marjan Vračko, Marjan Tušar, National institute of Chemistry,
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
CSCI 347 / CS 4206: Data Mining Module 05: WEKA Topic 04: Data Preparation Tools.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
CS 5604 Spring 2015 Classification Xuewen Cui Rongrong Tao Ruide Zhang May 5th, 2015.
Data Mining – Algorithms: OneR Chapter 4, Section 4.1.
A Multivariate Biomarker for Parkinson’s Disease M. Coakley, G. Crocetti, P. Dressner, W. Kellum, T. Lamin The Michael L. Gargano 12 th Annual Research.
Asia’s Largest Global Software & Services Company Genomes to Drugs: A Bioinformatics Perspective Sharmila Mande Bioinformatics Division Advanced Technology.
Matthew Greenstein | METEO 485 | Apr. 26, 2004 Using Neural Networks and Lagged Climate Indices to Predict Monthly Temperature and Precipitation Anomalies.
Introduction In order for us to learn from the extensive prior literature we have collated information on molecules screened versus Mycobacterium tuberculosis.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
EXPLORING CHEMICAL SPACE FOR DRUG DISCOVERY Daniel Svozil Laboratory of Informatics and Chemistry.
We will complete another date search by entering 2008 to 2010 in the Specify date range option and clicking on Search.
1 G. P. S. Raghava Institute of Microbial Technology, Chandigarh.
Evaluating What’s Been Learned. Cross-Validation Foundation is a simple idea – “ holdout ” – holds out a certain amount for testing and uses rest for.
Hands-on predictive models and machine learning for software Foutse Khomh, Queen’s University Segla Kpodjedo, École Polytechnique de Montreal PASED - Canadian.
Limits From the initial (HINARI) PubMed page, we will click on the Limits search option. Note also the hyperlinks to Advanced search and Help options.
Open source software and web services for designing therapeutic molecules G. P. S. Raghava, Head Bioinformatics Centre, Institute of Microbial Technology,
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
Empirical Validation of the Effectiveness of Chemical Descriptors in Data Mining Kirk Simmons DuPont Crop Protection Stine-Haskell Research Center 1090.
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
Prediction of Influencers from Word Use Chan Shing Hei.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
TCOF 3 :Repositioning of Chemical compounds From Different Classes as part of Virtual Screening Under the Guidance of PI: Dr UCA JALEEL, Dr Bheemarao Ugarkar.
PREDICTION OF CATALYTIC RESIDUES IN PROTEINS USING MACHINE-LEARNING TECHNIQUES Natalia V. Petrova (Ph.D. Student, Georgetown University, Biochemistry Department),
Slides for “Data Mining” by I. H. Witten and E. Frank.
Aid Management Platform (AMP) Advanced User Training, Module Creating AMP Reports and Analyzing Data.
WEKA Machine Learning Toolbox. You can install Weka on your computer from
December 1, Classification Analysis of HIV RNase H Bioassay Lianyi Han Computational Biology Branch NCBI/NLM/NIH Rocky ‘07.
Ensemble Methods in Machine Learning
An Exercise in Machine Learning
Catalyst TM What is Catalyst TM ? Structural databases Designing structural databases Generating conformational models Building multi-conformer databases.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
Weka Tutorial. WEKA:: Introduction A collection of open source ML algorithms – pre-processing – classifiers – clustering – association rule Created by.
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
Use of Machine Learning in Chemoinformatics
Title: Assign Pathways to Gene Set June 21, 2007 Guanming Wu.
Mag Lev Vehicles Case Study #2 “Magnetic Levitation Transportation” Northern Highlands Regional High School Applied Technology Department Real World Engineering.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Pharmacy Orientation Part II Carrie L. Gassett, M.S.I.S. Aug. 9, 2013.
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
A Smart Tool to Predict Salary Trends of H1-B Holders
Children’s “How the Universe was Created” Foldable Book – 100pts
Saving Obvibase Files Correctly
Fig. 1 The FAF-Drugs4 and FAF-QED servers
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
Machine Learning with Weka
Adaptive Interpolation of Multidimensional Scaling
Megon Walker Bioinformatics Program Boston University
Machine Learning for Cyber
Data Mining CSCI 307, Spring 2019 Lecture 8
Presentation transcript:

TCOF 3 :Repositioning of Chemical compounds From Different Classes as part of Virtual Screening Under the Guidance of PI: Dr UCA JALEEL (IISc Research Unit, Bangalore) Swati Gandhi [Shah] 3.2 TCOF Fellow (MSc Bioinformatics, The Maharaja Sayajirao University of Baroda) Blog Url: swatigandhishah.wordpress.com

We are suppose to follow repositioning of Chemical compound database which again divided under three sub classes:- 1)Pesticides 2)Antimicrobial molecules 3)Phytomolecules Our group is Targeting on Pesticides database The aim of this project is to develop classes of anti MTb compounds and reposition them by screening pesticides which are found active against TB which we can further proceed with clinical trials. Repositioning of Chemical compound database divided under three sub classes:- 1)Pesticides 2)Antimicrobial molecules 3)Phytomolecules -> Me and My Group worked on Pesticides showing anti TB activity: In search of Pesticide database we started with many search engine like Pubchem,PAN (Pesticide Action Network) pesticide database,Eu Pesticide Database & finally our search comes to an end with EPA (Environmental Protection Agency).Environmental Protection Agency In EPA we got some 654 pesticide molecules out of which we have structure and SDF file for 487 molecules remaining structure is drawn by (Ayisha safeeda) with the help of “MARVIN” and saved in SDF file format. Next Slide will give an over view of the Project in the form of Flowchart to Explain the process.

As per the Flow Chart of the Previous Slide We have Initialized the WEKA Part; The algorithms are applied directly to a dataset; Training Set and Test Set generated on a ratio of 80:20 WEKA Model is generated with the Help of this Training and Test Set and Next Slide Defines the Step.

Accessing the HTS bioassay data Upload the sdf file All compounds sdf file Generate descriptor file Open the CSV file in Excel Bioassay result (all) Testing TrainingFile splitting Remove the useless attributes Select the actives and inactive compounds Apply classifier algorithms Selection of best classifier model TP %, FP 70% Append the bioassay result corresponding to the compounds PubChem PowerMV Excel WEKA (machine learning) Module – Work Flow

Current Stage of Project is Tuning of Model Generated by WEKA: We are trying to Tune the Model to the Most Stable state Applying the Cost Matrix on it. We have generated the Results using different Classifiers like Naïve bayes and Random Forest We are trying to Tune the Model giving the Cost Matrix to it. Next Slide will draw some light on this. Next Stage is to Go for Screening and then We will proceed Further

Sheet Defining the Results After Applying the Cost Matrix

References: 1) Schierz AC. Virtual screening of bioassay data. J Cheminform Dec 22;1:21. doi: / PubMed PMID: ) Periwal V etal., Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets. BMC Res Notes Nov18;4:504. doi: / PubMed PMID: ) Ekins S, etal., Combining Computational Methods for Hit to Lead Optimization in Mycobacterium Tuberculosis Drug Discovery. Pharm Res Oct 17. [Epub ahead of print] PubMed PMID: ) Enviornmental Protection Agency

Heartiest Thanks and Acknowledgement: 1) Prof. Dr. Samir Bramachari 2)Dr Jaleel (PI TCOF3) 3) Dr Bheemarao Ugarkar 4)Dr TS Balganesh 5)OSDD Team 6)IISc Research Unit, Bangalore 7) Group Members [Yatindra Yadav and Ayisha Safeeda]