Introduction on R-GMA Shi Jingyan Computing Center IHEP.

Slides:



Advertisements
Similar presentations
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 15 Introduction to Rails.
Advertisements

21 Sep 2005LCG's R-GMA Applications R-GMA and LCG Steve Fisher & Antony Wilson.
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
G O B E Y O N D C O N V E N T I O N WORF: Developing DB2 UDB based Web Services on a Websphere Application Server Kris Van Thillo, ABIS Training & Consulting.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
XML Views El Hazoui Ilias Supervised by: Dr. Haddouti Advanced XML data management.
Introduction to Web Database Processing
Week 2 IBS 685. Static Page Architecture The user requests the page by typing a URL in a browser The Browser requests the page from the Web Server The.
Introduction To Databases IDIA 618 Fall 2014 Bridget M. Blodgett.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
Session 5: Working with MySQL iNET Academy Open Source Web Development.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to R-GMA: Relational Grid Monitoring Architecture.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
Simple Database.
Dbwebsites 2.1 Making Database backed Websites Session 2 The SQL… Where do we put the data?
Database Application Security Models Database Application Security Models 1.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Information System (IS) Valeria Ardizzone.
Publication and Protection of Site Sensitive Information in Grids Shreyas Cholia NERSC Division, Lawrence Berkeley Lab Open Source Grid.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
JSPG: User-level Accounting Data Policy David Kelsey, CCLRC/RAL, UK LCG GDB Meeting, Rome, 5 April 2006.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
Introduction to the new mainframe © Copyright IBM Corp., All rights reserved. Chapter 12 Understanding database managers on z/OS.
APEL & MySQL Alison Packer Richard Sinclair. APEL Accounting Processor for Event Logs extracts job information by parsing batch system (PBS, LSF, SGE.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Dave Kant Grid Operations Centre LCG Workshop CERN 24/3/04.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks R-GMA Now With Added Authorization Steve.
Java Database Connectivity (JDBC). Topics 1. The Vendor Variation Problem 2. SQL and Versions of JDBC 3. Creating an ODBC Data Source 4. Simple Database.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Grid Operations Centre LCG Accounting Trevor Daniels, John Gordon GDB 8 Mar 2004.
Some Title from the Headrer and Footer, 19 April Overview Requirements Current Design Work in Progress.
An information and monitoring system for static and dynamic information about grid resources, applications, networks … RDBMS Servlet aware of API during.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
EGEE is a project funded by the European Union under contract IST R-GMA: Production Services for Information and Monitoring in the Grid John.
Dave Kant Monitoring ROC Workshop Milan 10-11/5/04.
WP3 RGMA Deployment Laurence Field / RAL Steve Fisher / RAL.
INFSO-RI Enabling Grids for E-sciencE
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
Website: Answering Continuous Queries Using Views Over Data Streams Alasdair J G Gray Werner.
LCG Accounting John Gordon Grid Deployment Board 13 th January 2004.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
Local Job Accounting Cristina del Cano Novales STFC-RAL.
The impact of R-GMA (upon WP1 and WP4). EDG (Paris) 6 Mar James MagowanImpact of R-GMA Grid Monitoring Architecture (GMA) We use it not only for.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
INFSO-RI Enabling Grids for E-sciencE Information System Valeria Ardizzone INFN EGEE NA4 Generic Applications Meeting Catania,
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
FESR Trinacria Grid Virtual Laboratory Relational Grid Monitoring Architecture (R-GMA) Valeria Ardizzone INFN Catania Tutorial per Insegnanti.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
APEL Accounting Update Dave Kant CCLRC, e-Science Centre.
Dave Kant LCG Accounting Overview GDA 7 th June 2004.
INFSO-RI Enabling Grids for E-sciencE R-GMA Gergely Sipos and Péter Kacsuk MTA SZTAKI Credit to Valeria Ardizzone.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using R-GMA.
INFSO-RI Enabling Grids for E-sciencE gLite Information System: R-GMA Tony Calanducci INFN Catania gLite tutorial at the EGEE User.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
CERN 21 January 2005Piotr Nyczyk, CERN1 R-GMA Basics and key concepts Monitoring framework for computing Grids – developed by EGEE-JRA1-UK, currently used.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to R-GMA: Relational Grid Monitoring Architecture.
Relational Grid Monitoring Architecture (R-GMA)
Grid Event Management Using R-GMA Monitoring Framework
R-GMA Command Line Tool
Information System Valeria Ardizzone INFN
gLite Information System(s)
Practicals on R-GMA Valeria Ardizzone INFN
PHP / MySQL Introduction
Cristina del Cano Novales STFC - RAL
gLite Information System(s)
SDMX Reference Infrastructure Introduction
R-GMA (Relational Grid Monitoring Architecture) for monitoring applications “s” gLite and LCG.
SDMX IT Tools SDMX Registry
Presentation transcript:

Introduction on R-GMA Shi Jingyan Computing Center IHEP

Content R-GMA R-GMA concept R-GMA components Accounting system

Relational Grid Monitoring Architecture -- introduction Models the information infrastructure of a Grid as a set of Consumers (who request information), Producers (who provide information) and a single Registry (which mediates the communication between producers and consumers). Impose a standard query language (a subset of SQL): producer publishes tuples with INSERT statement; consumer query tuple with SELECT statement. All tuples carry a time-stamp to support monitoring system

R-GMA Introduction (cont.)

Architecture:

R-GMA Introduction (cont.) the information resources of a VO is in a single virtual database containing a set of virtual table. a single schema contains the name and structure of each virtual table in the system. a single registry contains a list, for each table, of producers who have offered to publish rows for the table. a consumer runs an SQL query against a table, and the registry selects the best producers to answer the query in a process called mediation. The consumer then contacts each producer directly, combines the information, and returns a set of tuples. Mediation process is hidden from the user. There is no central repository holding the contents of the virtual table.

R-GMA Introduction (cont.) Producers: Primary producer: user's code periodically inserts tuples which is then stored internally by the producer. The producer answers consumer queries from its own storage. Secondary producer: populates its own storage by running its own query against the virtual table. The user code only sets the process running; the tuples come from other producers. On-demand Producer: no internal storage; data is provided by the user code in direct reponse to a query forwarded on to it by the producer service.

R-GMA Introduction (cont.) Consumer: each consumer represents a single SQL SELECT query on the virtual database and obtain the answer tuple from the producer after the mediation. Mediation: The query is first passed to the Registry to identify which producers, for each virtual table in the query, must be contacted to answer it. The process is called Mediation.

R-GMA Introduction (cont.) Types of query continuous query: all new tuples matched the query will be streamed into the consumer's tuple-storage as soon as they are inserted into the virtual table by the rpoducers. One-time queries:  History-query: all versions of any matching tuples are returned.  Latest-query: only the tuples representing the ” current state ” are returned. Static query: database-like query and do not contain R-GMA time-stamps.

R-GMA Introduction (cont.) Retention Periods: LatestRetentionPeriod: is inserted into each tuple published by a Primary Producer and remains there when a tuple is re-published by a Secondary Producer. HistoryRetentionPeriod: Producer declare a HistoryRetentionPeriod for each table to which they are publishing tuples. A latest-query returns only those tuples which have not exceeded their LatestRetentionPeriod for the table. A history-query returns all versions of tuples which have not exceeded the producer's HistoryRetentionPeriod for the table.

R-GMA Introduction (cont.)

Web Service Architecture: R-GMA conforms to the Web Services Architecture. 6 principal services:Primary producer,Secondary producer,On-demand producer, Consumer, Registry and Schema Each service has one WSDL document. Message is used to communicate with the services. Message sequence and format are also specified in WSDL.

R-GMA Introduction (cont.) R-GMA uses ” SOAP messaging over http/s ” in a request/response pattern.

Apel — accounting in LCG-2 Apel software is composed of Apel Log Processor and Flexible archiver. Apel Log Processor: parses log files to extract job information and publishes it using R-GMA. Flexible Archiver:Located on the Grid Operation Center(GOC). Receive the data for the accounting table from all sites participating in the R-GMA configuration, it will contain an amalgamation of all accounting data from each site.

Apel — accounting in LCG-2 (cont.)

Apel Log Processor used to parse GateKeeper and PBS event logs generated at a site. The extracted data is pieced together to form an accounting record detailing the owner of a submitted job with the resources used to excute the job itself. Accounting records are then published using R-GMA. Accounting records are then collated together into a centralised repository on the GOC using an R-GMA Secondary Producer.

Aple Log Processor (cont.)

parsed log files: /var/log/globus-gatekeeper.log /var/log/message /var/spool/pbs/server_priv/accounting Tables used in Apel EventRecords GkRecords MessageRecords SpecRecords LcgRecords (published)

Flexible Archiver

Examples – Two Servlets The first one: Provides a web page as the user interface. Create a consumer to show the statistic info from the accounting data on the date the user provides

Example – Two servlets (cont.) The second example: Create a primary producer to publish the statistic infomation of the accounting data which can be queried from the browser servlet provided by RGMA software package

IHEP Accounting plan Pbs log file: /var/spool/pbs/server_priv/accounting Perl program analyse log file to generate DB data Java program uses producer to publish the necessary accounting info by joining DB data Rgma server has registry function to maintain the virtual table Summary accounting info with respect to user.

| Field | Type | Null | Key | Default | Extra | | theDate | date | YES | | NULL | | | eventID | varchar(60) | YES | | NULL | | | siteName | varchar(30) | YES | | NULL | | | localUser | varchar(20) | YES | | NULL | | | localGroup | varchar(20) | YES | | NULL | | | jobName | varchar(30) | YES | | NULL | | | queueName | varchar(20) | YES | | NULL | | | jobCreateTime | varchar(10) | YES | | NULL | | | jobQueuedTime | varchar(10) | YES | | NULL | | | jobEligibleTime | varchar(10) | YES | | NULL | | | startTime | varchar(10) | YES | | NULL | | | endTime | varchar(10) | YES | | NULL | | | execHOST | varchar(30) | YES | | NULL | | | resource_List_cput | time | YES | | NULL | | | resource_List_neednodes | varchar(30) | YES | | NULL | | | sessionID | int(10) | YES | | NULL | | | exitStatus | int(2) | YES | | NULL | | | resources_Used_cput | time | YES | | NULL | | | resources_Used_mem | int(16) | YES | | NULL | | | resources_Used_vmem | int(16) | YES | | NULL | | | resources_Used_walltime | time | YES | | NULL | |