GPIR GridPort Information Repository

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
The National Grid Service and OGSA-DAI Mike Mineter
1 Integration Made Easy Agile Integration: Connecting Salesforce With Your Enterprise.
Reusable Components for Grid Computing Portals Marlon Pierce Community Grids Lab Indiana University.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
4a.1 Grid Computing Standards ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4a.
The Cactus Portal A Case Study in Grid Portal Development Michael Paul Russell Dept of Computer Science The University of Chicago
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Understanding and Managing WebSphere V5
Enterprise Reporting with Reporting Services SQL Server 2005 Donald Farmer Group Program Manager Microsoft Corporation.
Talend 5.4 Architecture Adam Pemble Talend Professional Services.
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
11/16/2012ISC329 Isabelle Bichindaritz1 Web Database Application Development.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
Accounting for the Grid Usage Records and a Resource Usage Service.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
NeSC Apps Workshop July 20 th, 2002 Customizable command line tools for Grids Ian Kelley + Gabrielle Allen Max Planck Institute for Gravitational Physics.
Real Time Monitor of Grid Job Executions Janusz Martyniak Imperial College London.
The PROGRESS Grid Service Provider Maciej Bogdański Portals & Portlets 2003 Edinburgh, July 14th-17th.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure SRB + Web Services = Datagrid Management System (DGMS) Arcot.
Enabling Grids for E-sciencE EGEE-III INFSO-RI I. AMGA Overview What is AMGA Metadata Catalogue of EGEE’s gLite 3.1 Middleware Main Feature of.
Kurt Mueller San Diego Supercomputer Center NPACI HotPage Updates.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Portal Update Plan Ashok Adiga (512)
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Some comments on Portals and Grid Computing Environments PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics,
Adrian Jackson, Stephen Booth EPCC Resource Usage Monitoring and Accounting.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
SAN DIEGO SUPERCOMPUTER CENTER Welcome to the 2nd Inca Workshop Sponsored by the NSF September 4 & 5, 2008 Presenters: Shava Smallen
Application Web Service Toolkit Allow users to quickly add new applications GGF5 Edinburgh Geoffrey Fox, Marlon Pierce, Ozgur Balsoy Indiana University.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
DataGrid is a project funded by the European Commission EDG Conference, Heidelberg, Sep 26 – Oct under contract IST OGSI and GT3 Initial.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Interstage BPM v11.2 1Copyright © 2010 FUJITSU LIMITED INTERSTAGE BPM ARCHITECTURE BPMS.
V7 Foundation Series Vignette Education Services.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
XSEDE GLUE2 Update 1. Current XSEDE Usage Using legacy TeraGrid information services Publishing compute information about clusters – Subset of XSEDE clusters.
The Holmes Platform and Applications
LEAD-VGrADS Day 1 Notes.
TeraGrid Information Services Developer Introduction
Towards GLUE Schema 2.0 Sergio Andreozzi INFN-CNAF Bologna, Italy
Stephen Pickles Technical Director, GOSC
Middleware independent Information Service
Resource monitoring and discovery in OGSA
Grid Portal Services IeSE (the Integrated e-Science Environment)
Information Management
EDT-WP4 monitoring group status report
a VO-oriented perspective
Author: Laurence Field (CERN)
Federated Hierarchical Filter Grids
Google Sky.
OGCE Portal Software for Big Red and the TeraGrid
OGCE Architecture: Portlets and Services for Science Gateways
DBOS DecisionBrain Optimization Server
Information Services Claudio Cherubino INFN Catania Bologna
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

GPIR GridPort Information Repository Tomislav Urban Texas Advanced Computing Center TEXAS ADVANCED COMPUTING CENTER

Origins HotPage Informational Data Load, MOTD, Node Map, etc. Obtained from customized data gathering scripts MDS 2.0 where available “Static” VO configuration data Identified interest in recording historical grid data in support of Workflow/ Decision-making Job schedulers/Brokers Histograms Sought to move towards a web services model using XML schema Removes the need to write customized implementations for each new resource 2

GridPort Information Repository (GPIR) Implementation of web service enabled information service Evolved from various HotPage, GridPort, TACC and GCE-RG information and web services projects (IAWS) Concept demonstrated at SC 02 for TeraGrid, PACI (NPACI/Alliance) resources Called Information Archival Web Service (IAWS) Based on XML documents stored on a file server Thin clients (Java / Perl) pushed data into repository Contained XML documents for current grid status as well as archived historical data (HotPage information other) The IAWS was conceptualized in collaboration with SDSC and NCSA 3

Design Philosophy “Aggressive Practicality” Follow Standards Scalable Works today with what’s available today Comprehensive Portal-centric data set Intended to support the GridPort GCE framework and it’s data requirements. As web service, can be repurposed to any grid data needs. Follow Standards OGSI (Grid Services) Emerging Data Schema (GLUE?) Scalable Relational Database back-end Extensible Easy to add new XML Queries, format as needed 4

Architecture GPIR Resources Information Providers dB Clients Portals Perl Client edu.tacc.GPIR Portlets Java Client Ingester WS Query WS MDS GPIR MySQL PostgreSQL Other Middleware OGSA (Future) Web Scraping Other SOAP-XML HTTP JDBC 5

Architecture A single GPIR instance may support multiple portals serving various VOs VO Portal VO Portal VO Portal VO Portal GPIR 6

Current Data Sources Thin Clients MDS GMS NWS “Web Scraping” Java Perl http://www.tacc.utexas.edu/grid/gms NWS “Web Scraping” Cron jobs run periodically on HPC resources compiling text files that are then accessed via HTTP 7

Data Load - aggregated CPU Jobs – individual and aggregated queue MOTD Nodes - job usage for each machine node NWS - based on VO and Click model Grid Monitoring (GMS) Based on NCSA Machine Status “Static” Resource data (query only) Extensible through the addition of XML data from any recognized source Need schema Need query 8

Web Services Ingester WS Query WS Accepts XML documents containing updates to Grid status Query WS Provides XML containing query specific information 9

Current Work Migration to PostgreSQL Administration Client Full feature set Transactionality Etc. Better future J2EE support CMP CMR Administration Client Allowing web-based administration of “static” data for all supported VOs would be a huge productivity boost 10

Supported VOs Current Planned The PACI: NPACI, Alliance TACC/University of Texas: TIGRE / State of Texas University of Texas, University of Houston Texas A&M, Texas Tech, Rice Baylor College of Medicine IPG Planned ETF 11

Deployment Code available at: http://www.tacc.utexas.edu/grid/gpir Consists of: Web Service Example Clients JavaDocs DDL Script for MySQL XML Schema Documents (XSDs) XML Document Examples 12

Future Directions Integration into GridPort 3.0 J2EE Implementation Treat GPIR Entities as real objects rather than table rows Significant expansion to the data being gathered Administration Client Reporting and decision making based on historical data 13

Grid Services Intend to implement GPIR as a grid service Inherit OGSI Security model GT 3.0 GSI OGSI Compliance OGSA Compliance Will support WC3 and GGF standards Web Services Grid Services 14

Outstanding Issues Inflexibility Inefficiency of dynamic data storage Relational Database Changes XML Schema Changes Support for Dynamic Queries (Waiting for standards) Inefficiency of dynamic data storage Sampling vs. Events Example: The Job Table Data Format Standards MDS/GLUE Schema INCA? Security GSI based authentication 15