1 Overview of Chemical Informatics and Cyberinfrastructure Collaboratory Aug 16 2006 Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology.

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

Indiana University Chemical Informatics Programs Gary Wiggins
From Chemical Information to Cheminformatics: Graduate Programs at Indiana University Gary Wiggins School of Informatics May 21, 2007.
Indiana University School of David Wild – CICC Quarterly Meeting, Jan Page 1 Projects 1-4 update David Wild CICC Quarterly Meeting January 27.
Educational Opportunities in Cheminformatics at IU Gary Wiggins
CICC June meeting IUPUI team: Kelsey Forsythe Malika Mahoui Deepthi Jonnala Usha Cheemakurthi.
VARUNA – Towards a Grid- based Molecular Modeling Environment CICC/MACE – Meeting May 22, 2006 Mookie Baik Department of Chemistry & School of Informatics.
Educational Activities in Cheminformatics at IU Gary Wiggins
Indiana University School of David Wild – Research Overview April Page 1 Research Update, April 2006 David Wild Assistant Professor of Chemical Informatics.
Pulan Yu School of Informatics Indiana University Bloomington Web service based Varuna.Net.
Distributed Drug Discovery Indiana University Purdue University, Indianapolis.
Distributed Drug Discovery William L. Scott Department of Chemistry and Chemical Biology Indiana University Purdue University Indianapolis, Indianapolis.
Building a Chemical Informatics Grid Marlon Pierce Community Grids Laboratory Indiana University.
CICC Chemical Compound Mining Workflows Jungkee (Jake) Kim Community Grids Laboratory.
Fighting Malaria With The Grid. Computing on The Grid The Internet allows users to share information across vast geographical distances. Using similar.
GENOMICS MEETS GRID INTRODUCTION CHRIS THOMPSON. BACKGROUND SR £120M NATIONAL PROGRAMME ON e- SCIENCE £8M BBSRC PROGRAMME ON BIOINFORMATICS AND.
INDIANA UNIVERSITY LIBRARIES Enabling the library in university systems Trial and evaluation in the use of library services away from the library Chris.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
1 Challenges and New Trends in Data Intensive Science Panel at Data-aware Distributed Computing (DADC) Workshop HPDC Boston June Geoffrey Fox Community.
Educational Activities in Cheminformatics at IU Gary Wiggins
28 October 2005Jeremy Frey, University of Southampton1 “The CombeChem Experience” CICC Workshop 28 October 2005 Bloomington Indiana.
Community Grids Lab CICC Activities Geoffrey Fox, Marlon Pierce Indiana University.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
Chemical Informatics and Cyber- infrastructure Building Blocks Chemical Informatics Resources:  Deluge of experimental data > 100,000 compounds screened.
1 E-Chemistry and Web 2.0 Marlon Pierce Community Grids Lab Indiana University.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
The Indiana University School of Informatics Bobby Schnabel: Dean, Indiana University School of Informatics Presented by Geoffrey Fox: Associate Dean for.
Changing the Way Research is Done: Advances in Chemical Information Science and Cheminformatics Gary Wiggins 25 th Anniversary Chemistry Librarianship.
SALSASALSASALSASALSA Digital Science Center June 25, 2010, IIT Geoffrey Fox Judy Qiu School.
1 Gary Wiggins for Geoffrey Fox April 30, 2007 Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana.
18:15:32Service Oriented Cyberinfrastructure Lab, Grid Deployments Saul Rioja Link to presentation on wiki.
National Center for Supercomputing Applications The Computational Chemistry Grid: Production Cyberinfrastructure for Computational Chemistry PI: John Connolly.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
Grids for Chemical Informatics Randall Bramley, Geoffrey Fox, Dennis Gannon, Beth Plale Computer Science, Informatics, Physics Pervasive Technology Laboratories.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE Molecular Science in NPACI Russ B. Altman NPACI Molecular Science Thrust Stanford Medical.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Computational Science and the School of Informatics at Indiana University IU/HBCU STEM Initiative IUPUI April Geoffrey Fox Computer Science, Informatics,
1 Joint meeting of the Molecular Libraries Screening Centers Network (MLSCN) and the Exploratory Centers for Cheminformatics Research (ECCR): Talk II July.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
1 Semantic Research Grid Open Grid Forum Web 2.0 Workshop OGF21, Seattle Washington October Geoffrey Fox, Aurel Cami, Ahmet Fatih Mustacoglu, Ahmet.
Indiana University School of David Wild, Geoffrey Fox, Bioinformatics retreat, February Page 1 Chemoinformatics David Wild, Bioinformatics.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
ECCR Overview/MLSCN. NIH Roadmap Series of initiatives designed to pursue major opportunities in biomedical research and gaps in current knowledge that.
1 Joint meeting of the Molecular Libraries Screening Centers Network (MLSCN) and the Exploratory Centers for Cheminformatics Research (ECCR): Talk I July.
1 Web 2.0 and Grids for Scholarly Research Peking University July Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories.
Social Networking for Scientists (Research Communities) Using Tagging and Shared Bookmarks: a Web 2.0 Application Marlon Pierce, Geoffrey Fox, Joshua Rosen,
1 Overview of Chemical Informatics and Cyberinfrastructure Collaboratory October Geoffrey Fox Computer Science, Informatics, Physics Pervasive.
SALSASALSASALSASALSA Digital Science Center February 12, 2010, Bloomington Geoffrey Fox Judy Qiu
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
IU OREChem Summary Slides Marlon Pierce, Geoffrey Fox, Sashikiran Challa.
IU Site Update TeraGrid Round Table Craig Stewart, Stephen Simms, Kurt Seiffert November 4, 2010.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Chemical Informatics and Cyberinfrastructure Collaboratory An NIH-Funded Exploratory Center for Cheminformatics Research Project of the IU School of Informatics.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Chemical Informatics and Cyberinfrastructure Collaboratory
Gary Wiggins for Geoffrey Fox
Recap: introduction to e-science
CICC Combines Grid Computing with Chemical Informatics
Cyberinfrastructure and PolarGrid
CICC Chemical Compound Mining Workflows
New Ms and BS Chemical Informatics Programs
PolarGrid and FutureGrid
Chemical Informatics and Cyberinfrastructure Collaboratory
Presentation transcript:

1 Overview of Chemical Informatics and Cyberinfrastructure Collaboratory Aug Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN

2 Capabilities Local Teams, successful Prototypes and International Collaboration set up in 3 initial major focus areas Chemical Informatics Cyberinfrastructure/Grids with services, workflows and demonstration uses building on success in other applications (LEAD) and showing distributed integration of academic and commercial tools Computational Chemistry Cyberinfrastructure/Grids with simulation, databases and TeraGrid use Education with courses and degrees Review of activities suggest we also formalize work in two further areas Chemical Informatics Research – model applicability Interfacing with the User - bench chemist-friendly portal

3 Current Status Web site Wiki chosen to support project as a shared editable web space Building Collaboratory involving PubChem – Global Information System accessible anywhere and at any time – enhance PubChem with distributed tools (clustering, simulation, annotation etc.) and data Adopted Taverna as workflow as popular in Bioinformatics but we will evaluate other systems such as GPEL from LEAD Preparing large set of runs on local Big Red 23 Teraflop supercomputer (OSCAR3 CDK Mopac) Initial results discussed at conferences/workshops/papers Gordon Conferences, ACS, SDSC tutorial First new Cheminformatics courses offered Advisory board set up and met Videoconferencing-based meetings with Peter Murray-Rust and group at Cambridge roughly every 2-3 weeks Good or potentially good interactions with NIH DTP, Scripps, Lilly and Michigan ECCR

4 CICC Senior Personnel Geoffrey C. Fox Mu-Hyun (Mookie) Baik Dennis B. Gannon Marlon Pierce Beth A. Plale Gary D. Wiggins David J. Wild Yuqing (Melanie) Wu Peter T. Cherbas Mehmet M. Dalkilic Charles H. Davis A. Keith Dunker Kelsey M. Forsythe Kevin E. Gilbert John C. Huffman Malika Mahoui Daniel J. Mindiola Santiago D. Schnell William Scott Craig A. Stewart David R. Williams From Biology, Chemistry, Computer Science, Informatics at IU Bloomington and IUPUI (Indianapolis)

5 CICC Advisory Board Alan D. Palkowitz (Eli Lilly) Chris Peterson (Kalypsys) David Spellmeyer (IBM) Dimitris K. Agrafiotis (Johnson & Johnson) Horst Hemmerle (Eli Lilly) James M. Caruthers (Purdue University) Jeremy G. Frey (University of Southampton) Joel Saltz (Ohio State University/University of Maryland/Johns Hopkins University) John M. Barnard (Digital Chemistry) John Reynders (Eli Lilly) Peter Murray-Rust (University of Cambridge) Peter Willett (University of Sheffield) Thompson Doman (Eli Lilly) Val Gillet (University of Sheffield) Industry and Academia Met October 2005 will meet this fall

6 CICC Combines Grid Computing with Chemical Informatics CICCCICC Chemical Informatics and Cyberinfrastucture Collaboratory Funded by the National Institutes of Health Indiana University Department of Chemistry, School of Informatics, and Pervasive Technology Laboratories Science and Cyberinfrastructure. Large Scale Computing Challenges Chemical Informatics is non-traditional area of high performance computing, but many new, challenging problems may be investigated. CICC is an NIH funded project to support chemical informatics needs of High Throughput Cancer Screening Centers. The NIH is creating a data deluge of publicly available data on potential new drugs. CICC supports the NIH mission by combining state of the art chemical informatics techniques with World class high performance computing National-scale computing resources (TeraGrid) Internet-standard web services International activities for service orchestration Open distributed computing infrastructure for scientists world wide NIH PubMed DataBase OSCAR Text Analysis POVRay Parallel Rendering Initial 3D Structure Calculation Toxicity Filtering Cluster Grouping Docking Molecular Mechanics Calculations Quantum Mechanics Calculations IUs Varuna DataBase NIH PubChem DataBase Chemical informatics text analysis programs can process 100,000s of abstracts of online journal articles to extract chemical signatures of potential drugs. OSCAR-mined molecular signatures can be clustered, filtered for toxicity, and docked onto larger proteins. These are classic pleasingly parallel tasks. Top- ranking docked molecules can be further examined for drug potential. Big Red (and the TeraGrid) will also enable us to perform time consuming, multi-stepped Quantum Chemistry calculations on all of PubMed. Results go back to public databases that are freely accessible by the scientific community.

CICC Prototype Web Services Molecular weights Molecular formulae Tanimoto similarity 2D Structure diagrams Molecular descriptors 3D structures InChi generation/search CMLRSS Basic cheminformatics Application based services Compare (NIH) Toxicity predictions (ToxTree) Literature extraction (OSCAR3) Clustering (BCI Toolkit) Docking, filtering,... (OpenEye) Varuna simulation Define WSDL interfaces to enable global production of compatible Web services; refine CML Look at Pipeline Pilot Extend Computational Chemistry (Varuna) Services Routine TeraGrid Big Red use Ready to try Prototype Production on OSCAR3 CDK Mopac Develop more training material Link to screening center via Scripps Next steps? Key Ideas Add value to PubChem with additional distributed services and databases Wrapping existing code in web services is not difficult Provide core (CDK) services and exemplars of typical tools Provide access to key databases via a web service interface Provide access to major Compute Grids

8 Varuna environment for molecular modeling (Baik, IU) QM Database Researcher Simulation Service FORTRAN Code, Scripts Chemical Concepts Experiments QM/MM Database PubChem, PDB, NCI, etc. ChemBioGrid Reaction DB DB Service Queries, Clustering, Curation, etc. Papers etc. Condor TeraGrid Supercomputers Flocks