Milanesi Luciano EGEE User Forum, Clermont-Ferrand, France 11-14 February, 2008 BioinfoGRID Project Milanesi Luciano National Research Council Institute.

Slides:



Advertisements
Similar presentations
Camerino, 7/9/2004NETTAB04 Workshop NETTAB 2005 Presentation of the NETTAB 2005 Workshop Paolo Romano 1 & Angelo Facchiano 2 1 National Cancer Research.
Advertisements

Plateforme de Calcul pour les Sciences du Vivant Embrace WP3 meeting Vincent Breton Chargé de Recherches au CNRS.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks World-wide in silico drug discovery against.
INFSO-RI Enabling Grids for E-sciencE WISDOM mini-workshop Vincent Breton (CNRS-IN2P3, LPC Clermont-Ferrand) ISGC 2007 March 28th,
V. Breton, Lyon BBE, September 2002 WP10 Status. V. Breton, Lyon BBE, September 2002 WP10 goals between now and the end of the year Deploy the applications.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Archives and Information Retrieval
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Luxembourg, Sep 2001 Pedro Fernandes Inst. Gulbenkian de Ciência, Oeiras, Portugal EMBER A European Multimedia Bioinformatics Educational Resource.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks BioinfoGRIDBlast: A new approach at automatic.
Bioinformatics Grid Application for Life Science. COMMUNICATION NETWORK DEVELOPMENT SPECIFIC SUPPORT ACTION BIOINFOGRID Luciano Milanesi CNR-ITB.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Santiago de Chile, 1st EELA Conference, 4-5/9/06 1 Status.
Cloud Usage Overview The IBM SmartCloud Enterprise infrastructure provides an API and a GUI to the users. This is being used by the CloudBroker Platform.
Milanesi Luciano CAPI Milan, Italy HPC AND GRID BIOCOMPUTING APPLICATIONS IN LIFE SCIENCE Milanesi Luciano National Research Council Institute of.
Ranking-Aware Integration and Explorative Search of Distributed Bio-Data Dipartimento di Elettronica e Informazione NETTAB 2012 Integrated Bio-Search November.
IST E-infrastructure shared between Europe and Latin America Biomedical Applications in EELA Esther Montes Prado CIEMAT (Spain)
EGEE’ September 2009 BARCELLONA,SPAINMilanesi Luciano Bioinformatics GRID and HPC challenges in Biomedicine and Biosciences. Milanesi Luciano National.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Building Grid-enabled Virtual Screening Service.
Integrated Biomedical Information for Better Health Workprogramme Call 4 IST Conference- Networking Session.
BIOINFOGRID: Bioinformatics Grid Application for Life Science Giorgio Maggi INFN and Politecnico di Bari
INFSO-RI Enabling Grids for E-sciencE EGEE - a worldwide Grid infrastructure opportunities for the biomedical community Bob Jones.
07:44:46Service Oriented Cyberinfrastructure Lab, Introduction to BOINC By: Andrew J Younge
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
INFSO-RI Enabling Grids for E-sciencE BioDCV: a grid-enabled complete validation setup for functional profiling EGEE User Forum.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ignacio Blanquer Vicente Hernández Bioinformatics.
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
EMBRACE An example of Grid Integration (I): The EMBRACE project Jean SALZEMANN CNRS/IN2P3.
1 Large-Scale Profile-HMM on the Grid Laurent Falquet Swiss Institute of Bioinformatics CH-1015 Lausanne, Switzerland Borrowed from Heinz Stockinger June.
INFSO-RI Enabling Grids for E-sciencE Status of the Biomedical Applications in EELA Project (E-Infrastructures Shared Between Europe.
INFSO-RI Enabling Grids for E-sciencE Biomedical applications V. Breton, CNRS-IN2P3.
INFSO-RI Enabling Grids for E-sciencE In silico docking on EGEE infrastructure, the case of WISDOM Nicolas Jacq LPC of Clermont-Ferrand,
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WISDOM in EGEE-2, biomed meeting, 2006/04/28 WISDOM : Grid-enabled Virtual High Throughput.
Pathogenomics How this project began: Ann Rose - take advantage of DNA sequence information - genomics Julian Davies - use the information to understand.
INFSO-RI Enabling Grids for E-sciencE EGEE Review WISDOM demonstration Vincent Bloch, Vincent Breton, Matteo Diarena, Jean Salzemann.
INFSO-RI Grupo de Redes y Computación de Altas Prestaciones Actividades del Grupo de Redes y Computación de Altas Prestaciones.
Bioinformatics and Computational Biology
B i o i n f o r m a t i c s / B i o m e d i c a l A p p l i c a t i o n s i n E E L A Mexico, D.F., october 22 – 26, e – s c i e n c e M e x i c.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Application Porting INFN Giuseppe.
BIOINFOGRID: Bioinformatics Grid Application for life science MILANESI, Luciano National Research Council Institute of.
Enabling Grids for E-sciencE EGEE-III INFSO-RI The Life Sciences SSC V. Breton on behalf of EGEE-III Biomed cluster.
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
GROCK ( GRid dOCK) High Throughput Docking on the Grid EMBnet/CNB EGEE 4 th Conference, Pisa October 2005.
Bioinformatics Dipl. Ing. (FH) Patrick Grossmann
INFSO-RI Enabling Grids for E-sciencE EGEE-2 NA4 Biomed Bioinformatics in CNRS Christophe Blanchet Institute of Biology and Chemistry.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America WP3 General Overview Rafael Mayo CIEMAT.
EGEE is a project funded by the European Union under contract IST Enabling bioinformatics applications to.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
1 st EGI CMMST VT meeting 19 February 2013 A. Laganà (UNIPG, Italy)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bioinformatics activity Christophe BLANCHET.
Tutorial on "GRID Computing“ EMBnet Conference 2008 CNR - ITB GRID distribution supporting chaotic map clustering on large mixed microarray.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GRID distribution supporting chaotic map clustering on large mixed.
Milanesi Luciano Catania, Italy 13/03/2007 Bioinformatics challenges in European projects in Grid. Milanesi Luciano National Research Council Institute.
Update on CHEP from the Computing Speaker Committee G. Carlino (INFN Napoli) on behalf of the CSC ICB, October
INFSOM-RI The ETICS Service Configuration, Building and Testing Elisabetta Ronchieri, ETICS Project, INFN CNAF.
Bioinformatics Grid Application for Life Science. COMMUNICATION NETWORK DEVELOPMENT SPECIFIC SUPPORT ACTION BIOINFOGRID Andreas Gisel & Luciano Milanesi.
EMBRACE Workshop Appled Gene Ontology ITB – CNR Bari, Italy 7. – 9. November 2007 Domenica D’Elia, Giulia De Sario, Andreas Gisel, Cecilia Saccone, Angelica.
1 The Life-Science Grid Community Tristan Glatard 1 1 Creatis, CNRS, INSERM, Université de Lyon, France The Spanish Network for e-Science 2/12/2010.
Clouds , Grids and Clusters
A web portal for management of biological data and applications
V. Breton LPC Clermont-Ferrand
GridICE monitoring for the EGEE infrastructure
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
WISDOM-II, status of preparation
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Introduction to Bioinformatic
In silico docking on grid infrastructures
Presentation transcript:

Milanesi Luciano EGEE User Forum, Clermont-Ferrand, France February, 2008 BioinfoGRID Project Milanesi Luciano National Research Council Institute of Biomedical Technologies, Milan, Italy

Milanesi Luciano BioinfoGRID Symposium, Milan December Networks of resources The potential of new biological and biomedical technological platforms in connection with HPC and GRID technology will be particularly useful to deal with the increasing amount, complexity, and heterogeneity of biological and biomedical data. Bioinformatics applications for eHealth have become an ideal research area where computer scientists can apply and further develop new intelligent computation methods, in both experimental and theoretical cases.

Milanesi Luciano BioinfoGRID Symposium, Milan December BioinfoGRID Project BioinfoGRID Project web site:

Milanesi Luciano BioinfoGRID Symposium, Milan December Consortium

Milanesi Luciano BioinfoGRID Symposium, Milan December BioinfoGRID Objectives Objective of the BioinfoGRID project

Milanesi Luciano BioinfoGRID Symposium, Milan December Interaction with related projects At present the BioinfoGRID project has established co-operations with the following projects initiative: EGEE BELIEF EMBRACE EUCHINAGRID EUMEDGRID EELA DILIGENT ICEAGE LITBIO LIBI HEALTHGRID WISDOM

Milanesi Luciano BioinfoGRID Symposium, Milan December BioinfoGRID Work Packages Project Management OfficeWP8 Dissemination and Outreach.WP7 Coordination of technical aspects and relation with Grid infrastructure Projects, user training, application support and resources integration. WP6 Molecular Dynamics ApplicationsWP5 Database and Functional Genomics ApplicationsWP4 Transcriptomics Applications in GRIDWP3 Proteomics Applications in GRIDWP2 Genomics Applications in GRIDWP1 Work Package titleWork-package No

Milanesi Luciano BioinfoGRID Symposium, Milan December HUSAR Program Package GCG EMBOSS DATABASES SRS (Sequence Retrieval System)‏ In-house developments Third-party programs (~130 programs)‏ - >300 - Prompt updates (daily, weekly)‏ (~150 programs)‏ - own programms - automated tasks WP1 – Genomics Applications

Milanesi Luciano BioinfoGRID Symposium, Milan December SoapL ab ScLinux (OS)‏ Grid Client toolkit any more software ?? Interface % formatdb … % blastall … Grid CE WebService Grid API W3H analysis tasks Solaris (OS)‏ % formatdb … % blastall … Grid CE W2H HTML ScLinux (OS)‏ Grid Client toolkit % submit_formatd b … % submit_blastal l or anywhere else ssh target setuppreliminary setup any more software ?? WP1 – Genomics Applications Integrating W3H, SoapLab and the GRID

Milanesi Luciano BioinfoGRID Symposium, Milan December WP2 – Proteomics Applications Perform functional protein analysis in GRID by using the functional protein domain annotations on large protein families using GRID and related databases. All 518 human protein kinases and 5129 proteins from non-redundant chainset of Protein DataBank were analyzed with InterProScan applications

Milanesi Luciano BioinfoGRID Symposium, Milan December WP2 – Proteomics Applications Protein surface calculation in GRID. : the grid was used to compute the volumetric description of the proteins obtaining a precise representation of the corresponding surface. Then protein interactions could be quickly screened by the mean of surface analysis. –The ProSite domains were analyzed all-against-all –ATP-E against its inhibitor –Collagen against integrin

Milanesi Luciano BioinfoGRID Symposium, Milan December WP3 – Transcriptomics applications Phylogenetics : Reconstructing the evolutionary history of a group of taxa is major research thrust in computational biology and a standard part of exploratory sequence analysis. An evolutionary history not only gives relationships among taxa, but also an important tool for inferring structural, physiological, and biochemical properties of sequences from other similar sequences, and reconstruction of tissue evolution.

Milanesi Luciano BioinfoGRID Symposium, Milan December 2007 WP4 – Databases & Genomics Applications Work Package 4: Databases and Functional Genomics Applications –Testing the main biological databases in the Grid environment  optimization on storage space, bandwidth, download time –Testing performances and scalability of database-based applications  performances/scalability testing according to various use cases and submission algorithms –1 challenge: Gene Analogous Finder  55+ years of computation on a single CPU, not feasible in a local environment.

Milanesi Luciano BioinfoGRID Symposium, Milan December 2007 GridDBManager –Automatic Updater  Timer based monitoring and update of Grid ported databases –Adaptive replica manager  Constantly adapts the number of replicas in relation to the usage of each database in the last 10 days –Version Regression  Keeps patches on the Grid for allowing regression of each database to an earlier version WP4 – Databases handling

Milanesi Luciano BioinfoGRID Symposium, Milan December WP4 – Methods - GridDBManager

Milanesi Luciano BioinfoGRID Symposium, Milan December Testing performances and scalability of Database-Oriented Bioinformatics Applications (DBApp) in the EGEE GRID –Testing Performance and Scalability  Grid: too many variables (queue time, database download time, queue failures, execution failures)‏  Submission mode: too many variables (number of jobs, rate-limiting settings, resubmission algorithm)‏  Application too many variables: (performance of specific application, location of database)‏  Probing of Grid performances  Numeric simulation for all algorithms WP4 – Methods - DBApp Perf. Testing

Milanesi Luciano BioinfoGRID Symposium, Milan December Probing Grid performances (Example)‏ –Grid queue times and reliability  Sent 150 jobs in 3 groups of 50 at different times WP4 – Methods - DBApp Perf. Testing

Milanesi Luciano BioinfoGRID Symposium, Milan December WP5 – Molecular docking The neuraminidase viruses is considered a valid target for antiviral drugs

Milanesi Luciano BioinfoGRID Symposium, Milan December Docking: predict how small molecules bind to a receptor of known 3D structure WP5 – Molecular docking There are successful examples –rapid, –cost effective… But there are limitations –CPU and storage needed More specific talk by Ana Lucia Da Costa Wednesday 13 th 11:15 – Room: Bordeaux

Milanesi Luciano BioinfoGRID Symposium, Milan December WP7 – Dissemination The following series of events were specifically associated to or organized by the BioinfoGRID project: –BioinfoGRID Symposium 2007: December 10 th -13 th 2007, Milan –BioinfoGRID Session at EGEE '07: October 4 th 2007, Budapest –Biomed Grid School, Varenna, Italy, May 14 th -19 th 2007 –BioinfoGRID Workshop at Healthgrid 2007 Conference - Geneva, Switzerland, 24 th April 2007 –NETTAB 2006 Workshop: Distributed Applications, Web Services, Tools and GRID Infrastructures for Bioinformatics - Santa Margherita di Pula, Sardinia, Italy - July th, 2006 –BioinfoGRID Initial Training Course, Bari, Italy, March 8 th -10 th 2006 In addition, the BioinfoGRID project has been represented at 58 national and international conferences and workshops.

Milanesi Luciano BioinfoGRID Symposium, Milan December WP7 – Dissemination 24 Journal Articles written within the frame of the BioinfoGRID project: –9 - BMC Bioinformatics –4 - IEEE Transactions on Nanobioscience –3 - Studies in Health Technology and Informatics –1 - Journal of Parallel and Distributed Computing –1 - Journal of Chemical Information and Modeling –1 - Parallel Computing –1 - Int. J. of Bioinformatics Research and Applications –1 - IEEE Transactions on Systems Science and Applications –1 - Nucleic Acids Research –1 - BMC Genetics –1 - Bioinformatics

Milanesi Luciano BioinfoGRID Symposium, Milan December WP7 – Dissemination 19 Conferences proceedings achieved within BioinfoGRID –6 – NETTAB '06 –2 – EGEE User Forum 06/07 –2 – BITS '06 –2 – HPDC '07 –1 – EGEE 06/07 –1 – CAPI 2006 –1 – Bioinformatics of African Pathogens and Disease Vectors. Nairobi 2007 –1 – MAS-BIOMED '06 Workshop –1 – CCGrid '07 Symposium –1 – EvoBIO '08 –1 – CHEP '07

Milanesi Luciano BioinfoGRID Symposium, Milan December People Acknowledgments Cristina Aiftimiei Roberta Alfieri Claudio Arlandini Roberto Barbera Endre Barta Francesco Beltrame Attila Bende Chiara Bishop Chirstophe Blanchet Ignacio Blanquer Vincent Bloch Gianpaolo Bottoni Vincent Breton Andrea Calabria Andrea Caprera Tiziana Castrignanò Federidica Chiappori Dario Corrada Paolo Cozzi Stefano Cozzini Enza D’Alba Pasqualina D’Ursi Ana Da Costa Paride Dagna Guilia De Sario Davide Di Pasquale Giacinto Donvito Vihang Dudhalkar Peter Ernst David Fergusson Geraldine Fettahi Sandro Fiore Riccardo Gervasoni Karl-Heinz Glatting John Hatton Ally Hume Nicolas Jacq Atul Jain Miklos Kozlovszky Giuseppe La Rocca Yannick Legré Pietro Liò Carles Loomis Mario Marchisio Hajnal Marton Rafael Mayo Garcia Mirco Mazzucato Giovanni Meloni Ivan Merelli Emanuale Merelli Luciano Milanesi Elisa Molinari Ettore Mosca Georgina Moulton Loukas Moutsianas Tibor Nagy Alessandro Negro Laszlo Oroszi Alessandro Orro Giovanni Paolella Silvano Paoli Antonio Pierro Giorgio Pietro Maggi Marco Pirola Raffaele Ponzini Ivan Porro Paolo Ramieri Paolo Romano Ermanna Rovida Erika Salvi Jean Salzemann Diego Sardaci Salvatore Scifo Martin Senger Giuliano Taffoni Livia Torterolo Gabriele Trombetti Angelica Tulipano Vania Ugè Elizabeth van der Wath Richard van der Wath Kasam Vinod Federica Viti Guy Warner Ted Wen Pierfrancesco Zuccato

Milanesi Luciano BioinfoGRID Symposium, Milan December Projects Acknowledgements EUGRID ISS e G