Ludek Matyska, CESNet on behalf of HealthGrid on behalf of HealthGrid National University of Singapore, May 4th, 2010 National University of Singapore,

Slides:



Advertisements
Similar presentations
WISDOM/HealthGrid: overview of grid applications in life sciences
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Why Grids Matter to Europe Bob Jones EGEE.
Addressing emerging diseases on the grid
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
The EGI – a sustainable European grid infrastructure Michael Wilson STFC RAL.
FP6−2004−Infrastructures−6-SSA [ Empowering e Science across the Mediterranean ] Grids and their role towards development F. Ruggieri – INFN (EUMEDGRID.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks World-wide in silico drug discovery against.
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
INFSO-RI Enabling Grids for E-sciencE WISDOM mini-workshop Vincent Breton (CNRS-IN2P3, LPC Clermont-Ferrand) ISGC 2007 March 28th,
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Using the WS-PGRADE Portal in the ProSim Project Protein Molecule Simulation on the Grid Tamas Kiss, Gabor Testyanszky, Noam.
KISTI’s Activities on the NA4 Biomed Cluster Soonwook Hwang, Sunil Ahn, Jincheol Kim, Namgyu Kim and Sehoon Lee KISTI e-Science Division.
FKPPL workshop May 2012 BUI The Quang Prof. Vincent Breton Prof. Doman Kim Prof. NGUYEN Hong Quang Prof. PHAM Quoc Long Grid enabled in silico drug discovery.
IST E-infrastructure shared between Europe and Latin America Biomedical Applications in EELA Esther Montes Prado CIEMAT (Spain)
Application of e-infrastructure to real research.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Building Grid-enabled Virtual Screening Service.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Application Case Study: Distributed.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN)
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
INFSO-RI Enabling Grids for E-sciencE WHO, 2/11/04 Grid enabled in silico drug discovery Vincent Breton CNRS/IN2P3 Credit for the.
INFSO-RI Enabling Grids for E-sciencE EGEE - a worldwide Grid infrastructure opportunities for the biomedical community Bob Jones.
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
Parameter Sweep Workflows for Modelling Carbohydrate Recognition ProSim Project Tamas Kiss, Gabor Terstyanszky, Noam Weingarten.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Grid Enabled High Throughput Virtual Screening Against Four Different Targets Implicated in Malaria Presented by Vinod.
The Grid Beyond Physics Bob Jones, CERN EGEE project director.
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
EMBRACE An example of Grid Integration (I): The EMBRACE project Jean SALZEMANN CNRS/IN2P3.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM, a grid enabled virtual screening initiative Yannick Legré LPC Clermont-Ferrand,
INFSO-RI Enabling Grids for E-sciencE Biomedical applications V. Breton, CNRS-IN2P3.
INFSO-RI Enabling Grids for E-sciencE In silico docking on EGEE infrastructure, the case of WISDOM Nicolas Jacq LPC of Clermont-Ferrand,
1 e-Infrastructures e-Infrastructures Taking stock and looking ahead an European perspective Bernhard Fabianek European Commission - DG INFSO GÉANT & e-Infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM in EGEE-2, biomed meeting, 2006/04/28 WISDOM : Grid-enabled Virtual High Throughput.
Ian Bird LHC Computing Grid Project Leader LHC Grid Fest 3 rd October 2008 A worldwide collaboration.
INFSO-RI Enabling Grids for E-sciencE Grid-enabled drug discovery to address neglected diseases N. Jacq – CNRS-IN2P3 EGAAP meeting.
INFSO-RI Enabling Grids for E-sciencE Towards grid-enabled telemedicine in Africa Yannick Legré on behalf of Vincent Breton CNRS-IN2P3,
Avian Flu Data Challenge Hsin-Yen Chen ASGC 29 Aug APAN24.
Future of grids V. Breton CNRS. EGEE training, CERN, May 19th Table of contents Introduction Future of infrastructures : from networks to e-
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE08 conference, Istambul Biomed community meeting V. Breton, CNRS.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks INFSO-RI Enabling Grids for E-sciencE.
INFSO-RI Enabling Grids for E-sciencE EGEE Review WISDOM demonstration Vincent Bloch, Vincent Breton, Matteo Diarena, Jean Salzemann.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Activités biomédicales dans EGEE-II Nicolas.
B i o i n f o r m a t i c s / B i o m e d i c a l A p p l i c a t i o n s i n E E L A Mexico, D.F., october 22 – 26, e – s c i e n c e M e x i c.
BIOINFOGRID: Bioinformatics Grid Application for life science MILANESI, Luciano National Research Council Institute of.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE08 conference, Istambul Life sciences cluster perspective on EGI V. Breton, CNRS On.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks UK-Ireland-France Regional Participation.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE: Enabling grids for E-Science Bob Jones.
Milanesi Luciano Catania, Italy 13/03/2007 Bioinformatics challenges in European projects in Grid. Milanesi Luciano National Research Council Institute.
Bioinformatics Grid Application for Life Science. COMMUNICATION NETWORK DEVELOPMENT SPECIFIC SUPPORT ACTION BIOINFOGRID Andreas Gisel & Luciano Milanesi.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE project and the future of European.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to Grids and the EGEE project.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
2 nd EGEE/OSG Workshop Data Management in Production Grids 2 nd of series of EGEE/OSG workshops – 1 st on security at HPDC 2006 (Paris) Goal: open discussion.
The digital divide: how to improve the situation ? Vincent Breton Institut des Grilles du CNRS LPC Clermont-Ferrand 1 Credit: L. Maigne, P. de Vlieger,
The digital divide: how to improve the situation ?
EGEE support for HEP and other applications
WISDOM-II, status of preparation
Introduction to D4Science
In silico docking on grid infrastructures
Presentation transcript:

Ludek Matyska, CESNet on behalf of HealthGrid on behalf of HealthGrid National University of Singapore, May 4th, 2010 National University of Singapore, May 4th, Biomedical activities using Grid Technology Credits: A. Da Costa, Y. Legré, V. Breton

Outline 1.Grid computing to adress biomedical challenges 2.European Grid Infrastructures 3.WISDOM success story 4.Biomedical applications in the EUAsiaGrid project 5.Conclusion 2

Outline 1.Grid computing to adress biomedical challenges 2.European Grid Infrastructures 3.WISDOM success story 4.Biomedical applications in the EUAsiaGrid project 5.Conclusion 3

Introduction THE LIFE SCIENCE COMMUNITY NEEDS BOTH E-SCIENCE AND GRID INFRASTRUCTURES E-science focusses at creating new research environments for biologists –Use of the most recent information technologies (semantics, ontologies) –Design of virtual laboratories where the biologist can run experiments and manipulate the knowledge she/he is familiar with –Examples: MyGrid (UK) and VLe (Netherlands) Grid infrastructures provide ressources needed at different levels –to support bioinformaticians who maintain data bases accessed by e- science environments (update, curate, store/duplicate) –To increase resources for e-science environments when needed –To enable specific heavy computing or data production projects 4

Biologists vs Bioinformaticians 1.Biologists need growing capability to handle all the data relevant to their research topics –Design of complex analysis workflows –Knowledge management 2.Bioinformaticians who are developing the IT services for the biologists need growing resources –To store, update, curate exponentially growing databases –To run increasingly complex algorithms on this growing data set –To build new databases exploiting the growing body of knowledge 3.Biologists and bioinformaticians have therefore different needs –Biologists need high level environments and little resources –Bioinformaticians need large resources to develop and/or update the services needed by the biologist 5

Added Value of Grid for Biologists  The grid provides the centuries of CPU cycles required on demand  The grid provides the reliable and secure data management services to store and replicate the biochemical inputs and outputs  The grid offers a collaborative environment for the sharing of data in the research community on emerging and neglected diseases 6

Biomedical challenges Grid computing can help to solve biomedical challenges: –Data Grid: storage of vast amounts of complex biological data and federation of distributed data sources –Computational Grid: mobilize large CPU resources to adress growing processing needs to analyse data: from algorithmic and computational modelling –Knowledge Grid: information management and retrieval to extract knowledge 7

Life Sciences requirements  Identified needs –Access to grids of clusters and to supercomputers –Stability and Sustainability –Friendly user interfaces –Standard access to services (One single API, whatever the middleware) – Security of medical data Importance of international standards –Integration of resources into european infrastructures and european initiatives ESFRIs Virtual Physiological Human Which standards ? –Open Grid Forum –Web services 8

Outline 1.Grid computing to adress biomedical challenges 2.European Grid Infrastructures 3.WISDOM success story 4.Biomedical applications in the EUAsiaGrid project 5.Conclusion 9

European Grid Infrastructures SINCE 2001, BIOMEDICAL APPLICATIONS HAVE BEEN DEPLOYED ON DISTRIBUTED COMPUTING INFRASTRUCTURES 2001: DataGrid –Deployment of a prototype testbed 2002: DEISA - Distributed European Infrastructure for Supercomputing Applications 2004: EGEE I - Deployment of a European infrastructure for scientific production 2006: EGEE-II - Further deployment of an infrastructure for scientific production Cloud Computing : Amazon Web services 2008: EGEE-III: Same as EGEEII + migration to a federation of national grids DEISA …: European Grid Initiative 10

EGEE Grid Infrastructure 11 Real Time Monitor k/rtm/ 240 sites~50k CPU 45 pays5 PO disk > 7500 users> 150k tasks/day > 200 VOs

EGEE Grid Infrastructure 12 6 scientific areas are covered by EGEE grid < 100 applications deployed 6/20062/20071/ Astron. & Astrophysics289 Comp. Chemistry62721 Earth Science16 18 Fusion234 High-Energy Physics9117 Life Sciences Others41421 Total Condensed Matter Physics Comp. Fluid Dynamics Computer Science/Tools Civil Protection

European Grid Initiative 13 1.Goal: long-term sustainability of grid infrastructures in Europe 2.Approach: establish a federated model bringing together National Grid Infrastructures (NGIs) to build the European Grid Infrastructure (EGI) 3.EGI Organisation: coordination and operation of a common multi- national, multi-disciplinary Grid infrastructure To enable and support international Grid-based collaboration To provide support and added value to NGIs To liaise with corresponding infrastructures outside Europe

Outline 1.Grid computing to adress biomedical challenges 2.European Grid Infrastructures 3.WISDOM successful story 4.Biomedical applications in the EUAsiaGrid project 5.Conclusion 14

In silico Drug Discovery Pharmaceutical development: –Time-consuming: more than 10 years to develop a new medicine –Expensive: hundreds millions of dollars –Emergent and neglected diseases need fast and cheap answer Computational tools: –More and more known and registered protein 3D structures –More and more libraries of known chemicals –More and more computing power available –Better quality of prediction for bioinformatics tools, but CPU- consuming 15 Virtual screening using grid to speed-up the process and minimize the costs

WISDOM WISDOM (World-wide In Silico Docking On Malaria) is an initiative aims to demonstrate the relevance and the impact of the grid approach to address drug discovery for neglected and emerging diseases Wisdom-I Malaria Plasmepsin DataChallenge Avian Flu Neuraminidase Wisdom-II Malaria 4 targets DataChallenge Diabetes Alpha-amylase EGEE, Auvergrid, TwGrid, EELA, EuChina, EuMedGrid Embrace EGEE BioInfoGrid SCAI, CNU Academica Sinica of Taiwan ITB, Unimo Univ,, LPC, CMBA CERN-Arda, Healthgrid, KISTI GRIDS EUROPEAN PROJECTSINSTITUTES

Main objectives Computational activities –To show the relevance of computational grids in biomedical applications –To develop an environment to monitor the deployments on grid: Wisdom Production environment –To provide the grid to non-experts users Biological activities –To establish virtual screening workflow on computational grids –To find new inhibitors against neglected diseases 17

Elements 1.TARGET: 3D structure for a key protein in a disease 2.LIGAND: database of chemical compounds commercially available 3.SOFTWARES for virtual screening: docking, molecular dynamics –Parallel computations –Licences if needed ( BioSolveIT and CCDC provided free licences for specific projects) 18 +

Targets and Ligands 19 ProjectProteinFunctionLigands Malaria Plasmepsin PMII Hemoglobin degradation ZINC subset 1 million Glutathione-S-Transférase GST Detoxification ZINC 4, 3 millions Dihydrofolate Reductase DHFRDNA synthesis ZINC 4, 3 millions Avian fluNeuraminidaseRelease of new virus ZINC subset 300, 000 millions + chemical combinatorial library DiabetesAmylase/Gluco-amylase Carbohydrate cleavage ZINC subset 300, 000 millions sc-PDB7000 PDBall4000 PDB

Workflow deployed 20 1.Screening –High Throughput Docking  Autodock/Flexx/Gold –Scoring function:  Binding energy estimation  Rank molecules  Rank interactions 1.Screening –High Throughput Docking  Autodock/Flexx/Gold –Scoring function:  Binding energy estimation  Rank molecules  Rank interactions 2.Refinement –Best conformations from docking –BEAR (Binding energy after Refinement)  Amber package  MM-MD-MMPB(G)SA –Solvation 2.Refinement –Best conformations from docking –BEAR (Binding energy after Refinement)  Amber package  MM-MD-MMPB(G)SA –Solvation 3.Checking –Best conformations from BEAR –Complex visualization  Chimera UCSF –Hydrogen bonding –Hydrophobic interactions 3.Checking –Best conformations from BEAR –Complex visualization  Chimera UCSF –Hydrogen bonding –Hydrophobic interactions 4.Validation –Few compounds –Protein synthesis –Activity assay  IC50/Ki 4.Validation –Few compounds –Protein synthesis –Activity assay  IC50/Ki

Workflow deployed 21 FLEXX/ AUTODOCK Catalytic aspartic residues AMBER CHIMERA WET LABORATORY Molecular docking Molecular dynamics Complex visualization Complex visualization in vitro in vivo

2 steps screening on grid 1.Docking results based on: –Scoring –Match information –Different parameters settings –Knowledge of binding site 1.Molecular Dynamics: from docked poses –Distance-dependent dielectric energy minimization (e = 4r) –Molecular dynamics on the active site as well as ligand atoms –Final re-minimization –Re-scoring by MM-PBSA is a program which is used to estimate energies and entropies from the snapshots contained within trajectory files 22

Grid Performances (Docking) 23 Number of dockings CPU years Real Time CPUs used Produced Data size Crunching Factor Distribution efficiency Model Malaria I 41 millions806 sem17001TB40025% Malaria II 142 millions4002,5 mois Jusqu’à ,6 TB200040% Avian Flu 4 millions1001,5 mois GB % (>80% DIANE) Diabetes 300, ,5 jours %PULL

Wet Lab results 24 Prot. Nbre composés testés Type d’analyse Référence Nbre actifs IC50Ki PM II30FRET Pepstatin A IC50=4.3 nM 26 / nM-1.8 µM Neuraminidase 185 Fluorogenic substrate Oseltamivir 79/185 (59 > Oseltamivir) GST ( Colorimétrie S-hexyl glutathione Ki=35 μM 4 / µM Patent deposited in South Korea: « Pharmaceutical composition preventing and treating malaria comprising compounds that inhibit Plasmepsin II activity and the method of treating malaria using thereof »

Outline 1.Grid computing to adress biomedical challenges 2.European Grid Infrastructures 3.WISDOM success story 4.Biomedical applications in the EUAsiaGrid project 5.Conclusion 25

EUAsiaGrid Appropriate scientific areas have been for porting and deployment of applications to the grid: –High Energy Physics –Computational Chemistry –Social Science Applications –Mitigation of Natural Disasters –Bioinformatics and Biomedical –Digital Culture and Heritage 26

Fight diseases [1] 1.Avian Flu - DC2 refined (GVSS) 8 avian-flu mutant targets from EGEE DC28 avian-flu mutant targets from EGEE DC2 20,000 highest scored ligands from EGEE DC2 results20,000 highest scored ligands from EGEE DC2 results ASGC, AdMU, UPM, CESNET, MIMOS, IAMI, HAII 27 Number of dockings CPU timeDuration CPU-cores used on EUAsia VO Produced data size Status 120,0003 years25 days12512,8 GB Completed 2300, years2 months26846 GB Completed 2. Dengue fever (GVSS) Dengue NS3 targetDengue NS3 target 300,000 ligands which are prepared from ZINC.300,000 ligands which are prepared from ZINC. Drug Discovery: large-scale deployment of virtual screening softwares to identify potential inhibitors

Fight diseases [2] 3Vietnamese natural products (WISDOM) Goal: Grid-enabled the study of the potential inhibitory action of chemical compounds extracted from Vietnamese natural products, on important diseases Status: –Chemist from INPC spent one month in France (CNRS, HealthGrid) –Definition of a project-specific structure for the WISDOM Information System (based on the AMGA metadata catalogue) –Workflow definition: 1) First docking 2) Q.S.A.R 3) Second docking –“Docking grid service” deployed on the WISDOM Production Environment Plans –Collect informations about all chemicals extracted from Vietnamese natural compounds to populate the Information System –Select targets of interest –Deploy large scale virtual screening on grid 28 INPC (outside consortium), HealthGrid, CNRSS

Monitor diseases IFI, HealthGrid, CNRS 29 g-Info: Grid-based International Network for Surveillance to dynamically analyze the influenza molecular biology data, made available on public databases NCBI Influenza Virus Resource AMGA MuscleGBlocksPhyML

Share medical data Telemedicine: HOPE is a collaborative platform for doctors and physicians (sharing and simulating) –Grid site (UI, CE, SE, LFC, WN1, WN2) at IAMI which connected to VinaREN network –HOPE installed and configured in the grid node Medical Image Compression to decrease the size of medical images without any loss of information using JPEG 2000 compression encoder ITB, IAMI30 Help physicians to overcome several medical issues like storing, sharing exchanging, analysing image in a secure way using Grid technology,

Integrate Knowledge Development of Dementia Brain (DBRAIN): integration of gene discovery, protein predictions an drug discovery in a multi- agents systems for diagnosis, therapeutics and treatment of dementia-affected people Grid-enabled research network and infostructure of University of Malaya (GeRaNIUM): grid-based platform for researchers to access their collective resources, skills, experiences, and results in a secure, reliable and sacalable manner –Examples: Neurosciences: altered behaviour in neurodegenerative diseases and addictions Grids initiatives for health, biomedical, diseases ecology and biodiversity IIUM, USM, UTM, UTP, MIMOS, UM 31

Outline 1.Grid computing to adress biomedical challenges 2.European Grid Infrastructures 3.WISDOM success story 4.Biomedical applications in the EUAsiaGrid project 5.Conclusion 32

Conclusion –Grid is a well suited to adress Life Sciences research and important health concerns of our society –Grid provides to the research community: – Essential data components (genomic sequences, metadata…) – Analytical and modelling capabilities – Interoperable virtual environment for high performance distributed computation –EUAsiaGrid project intends to gather european and asian scientists to strenghen such research and find solutions using the Grid 33

CONTACTS Ana Lucia DA COSTA Yannick Legré Vincent BRETON 34 THANKS FOR YOUR ATTENTION