Presentation is loading. Please wait.

Presentation is loading. Please wait.

Plateforme de Calcul pour les Sciences du Vivant A Service for Biological Database Replication and Update Jean Salzemann – LPC.

Similar presentations


Presentation on theme: "Plateforme de Calcul pour les Sciences du Vivant A Service for Biological Database Replication and Update Jean Salzemann – LPC."— Presentation transcript:

1 Plateforme de Calcul pour les Sciences du Vivant http://clrpcsv.in2p3.fr A Service for Biological Database Replication and Update Jean Salzemann – LPC IN2P3/CNRS

2 CERN - EGEE User Forum 2 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 RUGBI: french project financed by the Gen'homme network: –Grid for biologists –Based on existing technologies (Web Services, globus toolkit 4, native XML databases) –3 sites in France: Grenoble, Lyon, Clermont-Ferrand Biologists are using, most of the time flat files databases, available on ftp repositories. These databases are changing and growing constantly and therefore need regular updates in order to keep the most up to date version available. This service, is an applicative service, integrable in a grid environment, which performs automatically regular updates and propagate them through the grid. Introduction

3 CERN - EGEE User Forum 3 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 Service concept Master Service: – Get the information from the information system (Controller) – Compare the states of the databases – Download the differences – Notify the clients Client Service: – Get the information from the information system – Download the differences Grid Compare and download download Inform SE Controler Implemented in java as web Services and tcp socket. Compatible with Axis, Globus Toolkit 3, Globus Toolkit 4. SER Ftp Server

4 CERN - EGEE User Forum 4 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 General Architecture in Rugbi Grid SE SE de reference Query and update of information Information System Database Finder Register/ Unregister Delete Callback Grid FTP 2811 Update Database Service Master Update Database Service Client 8080

5 CERN - EGEE User Forum 5 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 Main Steps of the process 1. The SER updates its repository and notifies the clients (Comparison + download) 2. The SE gets the notification and download the updates with GridFTP. 3.The SER ask for a REGISTER of the new database and an UNREGISTER of the old version. 4. The SE notifies the success of the deployment to the SER 5. The SER is waiting for a deletion notification of the old version, when it is received, it deletes the old database and propagates this notification through the grid.

6 CERN - EGEE User Forum 6 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 The databases –Swissprot, 700 MB – Trembl, 2.4 GB – Pdb, 2.9 GB – Kegg, 13 GB – Embl, 476 GB, 180 GB (release, without annotations) Possibility to add new databases. The databases are described as dynamical XML sheets, containing all the necessary information to make each step of the process. The Data

7 CERN - EGEE User Forum 7 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 Pre-deployment XML exemple <install required_architecture="none" required_dbms="none" required_mb_space="200000" required_platform="none">

8 CERN - EGEE User Forum 8 Plateforme de Calcul pour les Sciences du Vivant March 1 st 2006 Deployment with LCG User Interface (Update Service) RLS FTP SERVER SE Copy and registration lcg-cr Comparison and download


Download ppt "Plateforme de Calcul pour les Sciences du Vivant A Service for Biological Database Replication and Update Jean Salzemann – LPC."

Similar presentations


Ads by Google