Enabling Grids for E-sciencE INFSO-RI-508833 When databases met the Grid A New Data Source Oriented CE for GRID G. Taffoni (2,3), E. Ambrosi (1), A. Barisani.

Slides:



Advertisements
Similar presentations
INAF experience in Grid projects C. Vuerli, G. Taffoni, V. Manna, A. Barisani, F. Pasian INAF – Trieste.
Advertisements

LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
The National Grid Service and OGSA-DAI Mike Mineter
Data Management Expert Panel - WP2. WP2 Overview.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Workload Management Massimo Sgaravatto INFN Padova.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
WP3 RGMA Deployment Laurence Field / RAL Steve Fisher / RAL.
The Japanese Virtual Observatory (JVO) Yuji Shirasaki National Astronomical Observatory of Japan.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks G-ICS Grid–Instrument Control System C. Vuerli.
Instruments and Sensors in Grid – ICTP Trieste – 23 April Instruments in Grid: the New Instrument Element C. Vuerli (1,2), G. Taffoni (1,2), I.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
London e-Science Centre GridSAM Job Submission and Monitoring Web Service William Lee, Stephen McGough.
A monitoring tool for a GRID operation center Sergio Andreozzi (INFN CNAF), Sergio Fantinel (INFN Padova), David Rebatto (INFN Milano), Gennaro Tortone.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
AN INTEGRATED FRAMEWORK FOR VO-ORIENTED AUTHORIZATION, POLICY-BASED MANAGEMENT AND ACCOUNTING Andrea Caltroni 3, Vincenzo Ciaschini 1, Andrea Ferraro 1,
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
Bridging VO and the GRID (a successful experience) Giuliano Taffoni INAF - OATs Trieste Italy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Data and storage services on the NGS.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Jean-Philippe Baud, IT-GD, CERN November 2007
Grid Computing: Running your Jobs around the World
Database System Concepts and Architecture
Sergio Fantinel, INFN LNL/PD
Report on GLUE activities 5th EU-DataGRID Conference
Google Sky.
Presentation transcript:

Enabling Grids for E-sciencE INFSO-RI When databases met the Grid A New Data Source Oriented CE for GRID G. Taffoni (2,3), E. Ambrosi (1), A. Barisani (2,3), C. Vuerli (2,3) (1) INFN – CNAF (2) INAF – Informative Systems Unit (3) INAF – Astronomical Observatory of Trieste

Enabling Grids for E-sciencE INFSO-RI Project motivation Limits currently present in the Grid environment: the Grid is able to execute binary code or shell scripts and stores files Main focus on the extension of the existing Resource Manager of Globus for providing transparent access to heterogeneous DS and DSE such as a relational, flat and XML DBMS, file system, document store, content management system, or other

Enabling Grids for E-sciencE INFSO-RI Data Source Engine Concepts Until now, only two types of computational resources, the hardware machines and batch queuing system, have been taken into account as valid Resource Framework Layer instances. Different types of virtual computing machines exist such as Java Virtual Machine, Parallel Virtual Machine and Data Source Engine. The Grid Information System and Data Model have been used to represent hardware computing machines only; a software computational machine resource however can also be represented. We extend the Grid Resource Framework Layer, Information System and Data Model so that a software virtual machine as a Data Source Engine becomes a valid instance for a Grid computing model. We define therefore a new Grid component (G-DSE) that enables the access to a Data Source Engine and Data Source, totally integrated with the Grid Monitoring and Discovery System and Resource Broker. The new Grid Element, finally, can be built on top of the G-DSE component. the Query Element

Enabling Grids for E-sciencE INFSO-RI Data Source Engine Problems May exist a pool of Data Sources on which USERs‘ Query/Transaction can be processed through some Engine (DSE) may be possible a Mapping for Query/Transaction on more LOCAL DSE to a Data Source must be possible to Grant/Revoke all LOCAL DSEs needed for a mapped Query/Transaction must be possible to enter Query/Transaction running state with specialized mechanisms must be possible to Grant/Revoke additional DSE resources request based on DATA SET, TABLE SPACE, etc., at run time may be possible a Query/Transaction Spawning for JOIN intra/inter G-DSEs must be possible to Send Query/Transaction on a DSE must be possible to Receive Query/Transaction from a DSE must be possible to enter Termination requests for cancelling Query/Transaction must be possible to make Query/Transaction Match-Making based on specific Knowledge Base of DSE and DS

Enabling Grids for E-sciencE INFSO-RI G-DSE Model

Enabling Grids for E-sciencE INFSO-RI Globus G-DSE integration gatekeeper JobMangerQueryManger JobProcessQueryProcess Scheduler p-in Pbs/LFS query plug-in Query DB specific driver GRAM GIS RDBMS MDS GRIS Ldap ldif RDBMS Grid Providers (snmp)

Enabling Grids for E-sciencE INFSO-RI Data Source Engine Grid Access Enabled G-DSE supports Data Source (DS) and DSE indexing, monitoring, management and recovery through a rich set of Meta-Data bound to standard GIS. DS have their core engine into G-DSE, that provides a framework for activity and task management. A RSL/JDL Transaction/Query permits a number of tasks to be specified, together with their parameters, inputs, outputs and control flow. The response to a request is generated by the GDSE within a JobQueryManager Session. The GDSE analyses incoming Task and conducts authentication and authorisation The standard Grid WorkLoad Manager constructs an optimised execution graph. GIS will monitor a DS’s and DSE’s status digest produced by its internal monitor. The GDSE has been designed to support dynamic configuration, sessions, transactions, recovery and concurrency.

Enabling Grids for E-sciencE INFSO-RI From G-DSE to QE With the “G-DSE enabled” collective and resource Grid layers, the new QE (Query Element) Grid Element is built QE is a CE able to manage Queries and to connect RDBMS resources via Grid. UI/QE interactions trough a STANDARD LANGUAGE Fully Grid integrated: GIS + RB It is not a metadata service…but can be used for metadata

Enabling Grids for E-sciencE INFSO-RI QE implementation Runs on any linux/unix flavor: GT>2.4.3 Backbends: any DB vendor (MySQL, Oracle, PostgreSQL etc…) + flat files Two protocols: GRAM or WS API: C, C++, python, Java, perl GDSE GRAM SOAP

Enabling Grids for E-sciencE INFSO-RI QE Security Access control using GSI and VOMS –The certificate + roles identify the user permissions on DB And so on… Super user: crate, modify, admin, grant and revoke users…. ANYTHING!!! Standard user: select+ insert Simple user: select

Enabling Grids for E-sciencE INFSO-RI QE Authorization Create granularity in access control: –Permissions on whole DB, on one table, on one row;  “rw” or “ro” or “rwx” etc… –“delegation” The super user from a UI can modify the acl or delegate a new user to do it.

Enabling Grids for E-sciencE INFSO-RI QE Features The present –Interactive query –Off line access –Parallel sql to many DSE: SIMD, MIMD –Redirect output to a SE –Discovery system: BDII integrated. The future –WMS integration –Workflow implementation QE

Enabling Grids for E-sciencE INFSO-RI G-DSE Workflow

Enabling Grids for E-sciencE INFSO-RI QE replication Scalability Reliability Two implementations: –DB replica –Data replica Performances –SIMD –MIMD Partial replica

Enabling Grids for E-sciencE INFSO-RI QE language RSL(SQL) –Globus programs:  Globus-job-run g.dse.name/dbmanager-ODBC -queue ORADB “select a from onetable where…;”  Globusrun -s -r g.dse.name ‘&(executable=“select a from onetable where.. ;”)….  Globus-job-submit g.dse.name/dbmanager-ODBC -queue ORADB “select a from onetable where…;” UI implementation  glite-query-submit -h g.dse.mane -sql “select a from onetable where.. ;” -o gsiftp://se.name/tmp/out.dat  glite-query -h g.dse.mane > interactive usage.

Enabling Grids for E-sciencE INFSO-RI QE & the IS As CE, QE publish its status and its description on GRIS and BDII QE == CE + SE + WN CE publish the status of computational Resource: free CPUs, Memory, queues etc. QE publish: DMBS, load of each DB, number of transactions for DB, table space etc. QE may publish a set of metadata used to locate the DSE resource.

Enabling Grids for E-sciencE INFSO-RI QE BDII ldapsearch -LLL -x -H g.dse.host -b "mds-vo- name=site,o=grid” dn:GlueDSEUniqueID=g.dse.host:2119/dbmanager- ODBC, mds-vo-name=local,o=grid objectClass: GlueCETop objectClass: GlueCE objectClass: GlueDSE objectClass: GlueDSETop objectClass: GlueKey GlueDSEName: TESTDB GlueDSEStateStatus: Production GlueDSEInfoLRMSType: Postgresql GlueDSEInfoLRMSVersion:7.3

Enabling Grids for E-sciencE INFSO-RI QE use case: AstroDB The International Virtual Observatory Alliance (IVOA) was formed in June 2002 with a mission to facilitate the international coordination and collaboration necessary for the development and deployment of the tools, systems and organizational structures necessary to enable the international utilization of astronomical archives as an integrated and interoperating virtual observatory.

Enabling Grids for E-sciencE INFSO-RI The two Grids GRID VO

Enabling Grids for E-sciencE INFSO-RI G-DSE trigger No access to astronomical databases from the Grid middleware chosen for the GRID.IT project Some possible solutions evaluated (Spitfire, OGSA- DAI) A tight correlation between data access/management and computation of paramount importance within the Astronomical Community Collaboration started by INFN and INAF to design, implement and test the G-DSE

Enabling Grids for E-sciencE INFSO-RI AstroDBs Astronomical Resources –Catalogues & Archives structured in DB –Complex DBs Astronomical Standards –VOResource –VOTable –VOQL –SOAP Computational Operations, statistical analysis etc on DataBases –Ex. exploring the values at the extreme of some distribution can take 2 hours

Enabling Grids for E-sciencE INFSO-RI IVOA use case Verify the ability of the GIS to act as VObs Resource: resource localization/description; Astronomical Resource XSD => Astronomical Resource GLUE; BDII ~= Registry Query Language for Registry => ClassAD Requirements; VOQL (XML) query; BDII GlueAstroIdentity = Identifier GlueAstroPublisher=Curation.P ublisher GlueAstroCreator=Curation.Cr eator.Name ETC… g ……..… List of Available resources GlueSchema IVOA compliant info set, based on new set of information providers. GlueSchema IVOA compliant info set, based on new set of information providers.

Enabling Grids for E-sciencE INFSO-RI Complex Workflow Grid Services query exec QE Query (VOQL) CE exec Query(VOQL) CE exec API for applications: work in progress. IVOA Query (VOQL) exec IVOA Resource IVOA Registry Internet Joining VO and Grid: from the Grid to VO.

Enabling Grids for E-sciencE INFSO-RI G-DSE Future Plans G-DSE integrated in the WMS G-DSE is going now to be installed in all sites within the VO INAF (extended tests of G-DSE within that VO in a short time). G-DSE installed and under test at INFN Bari (bioinformatics group). ArcheoGrid G-DSE as FileStorage (some tests done…)

Enabling Grids for E-sciencE INFSO-RI End of Presentation Thank you for your attention