Introduction to OGSA-DAI The OGSA-DAI Team

Slides:



Advertisements
Similar presentations
Experiences with Converting my Grid Web Services to Grid Services Savas Parastatidis & Paul Watson
Advertisements

Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
E-Science Data Information and Knowledge Transformation Eldas Building Service Grids with Enterprise Level Data Access Services Alan Gray
Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
1 OGSA-DAI Platform Dependencies Malcolm Atkinson for OMII SC 18 th January 2005.
OGSA-DAI Render Demo NeSC Review 18 March Description and Aims The OGSA-DAI Render demo is intended to demonstrate both the Process and Data aspects.
OGSA-DAI Data Access and Integration for the Grid Neil Chue Hong
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
An Overview of OGSA-DAI Kostas Tourlas
Grid-Enabling Data: Sticking Plaster, Sellotape, & Chewing Gum? Colin C. Venters National Centre for e-Social Science University.
FirstDIG First Data Investigation on the Grid Paul Graham, Terry Sloan, Adam Carter EPCC Ian Gregory, Darren Unwin First South Yorkshire tel:+44 (0)131.
This product includes material developed by the Globus Project ( Introduction to Grid Services and GT3.
Distributed Heterogeneous Data Warehouse For Grid Analysis
17 July 2006ISSGC06, Ischia, Italy1 Agenda Session 26 – 14:30-16:00 An Overview of OGSA-DAI OGSA-DAI today – and future features How to extend OGSA-DAI.
1 e-science & data mining workshop, NeSC, UK, November 30 th, 2004 Terry Sloan EPCC, The University of Edinburgh INWA : using OGSA-DAI.
OGSA : Open Grid Services Architecture Ramya Rajagopalan
OGSA-DAI: Future Work and Wrap-up The OGSA-DAI Team
Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester.
1 UK NeSC Meeting, November 18 th, 2004 Terry Sloan EPCC, The University of Edinburgh INWA : using OGSA-DAI in a commercial environment.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
The Queen’s University of Belfast The Queen’s University of Belfast GeneGrid : Using OgsaDai in Bioinformatics Noel Kelly Belfast.
Extensible Framework for Data Access & Integration Malcolm Atkinson Director 10 th November 2004.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
ES Metadata Management Enabling Grids for E-sciencE ES metadata OGSA-DAI NA4 GA Meeting, D. Weissenbach, IPSL, France.
Using OGSA-DAI in a commercial environment Terry Sloan EPCC Telephone:
Resource Monitoring & Service Discovery in GeneGrid Sachin Wasnik Belfast e-Science Centre.
XML Registries Source: Java TM API for XML Registries Specification.
DAIT (DAI Two) NeSC Review 18 March Description and Aims Grid is about resource sharing Data forms an important part of that vision Data on Grids:
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
The Anatomy of the Grid Introduction The Nature of Grid Architecture Grid Architecture Description Grid Architecture in Practice Relationships with Other.
1 1 EPCC 2 Curtin Business School & Edinburgh University Management School Michael J. Jackson 1 Ashley D. Lloyd 2 Terence M. Sloan 1 Enabling Access to.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Grids - the near future Mark Hayes NIEeS Summer School 2003.
Jian Gui WANG New Implementation of Agriculture Models APAN19---Jan New Implementations of Agriculture Models Using Mediate Architecture.
OGSA-DAI.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid Services I - Concepts
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
Transition and Evolution Moving to Grid Services.
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI Technology Update GGF17, Tokyo (Japan)
Experiences with OGSA-DAI : Portlet Access and Benchmark Deepti Kodeboyina and Beth Plale Computer Science Dept. Indiana University.
Supercomputing 2003, UK e-Science Booth 1 First Data Investigation on the Grid: FirstDIG Terry Sloan, Paul Graham, Adam Carter Edinburgh Parallel Computing.
Prof S.Ramachandram Dept of CSE,UCE Osmania University
1 OGSA-DAI Status Report Neil P Chue Hong 20 th May 2005.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
OGSA-DAI & DAIT projects Update for TAG Prof. Malcolm Atkinson Director 30 th October 2003.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Requirements Gathering Exercise 2 nd DIALOGUE workshop eSI, 9-10.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
OGSA-DAI Users’ Meeting Introduction Malcolm Atkinson Director 7 th April 2004.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Status and Benchmarks All Hands Meeting 2005 Nottingham, 22 September.
GESA: Grid Economic Services Architecture Steven Newhouse Jon MacLaren
The OGSA-DAI Project Databases and the Grid Neil Chue Hong Project Manager EPCC, Edinburgh
OGSA-DAI Open Grid Services Architecture – Data Access and Integration NeSC Review 18 March 2004.
Data and storage services on the NGS.
GT3 Index Services Lecture for Cluster and Grid Computing, CSCE 490/590 Fall 2004, University of Arkansas, Dr. Amy Apon.
OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh
1 OGSA-DAI: Service Grids Neil P Chue Hong. 2 Motivation  Access to data is a necessity on the Grid  The ability to integrate different data resources.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
1 A new Architecture for OGSA-DAI Malcolm Atkinson, K. Karasavvas, M. Antonioletti, R. Baxter, A. Borley, N. Chue Hong, A. Hume, M. Jackson, A. Krause,
OGSA-DAI Current Version Guy Warner.
OGSA-DAI.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Service Oriented Architecture (SOA)
Presentation transcript:

Introduction to OGSA-DAI The OGSA-DAI Team

2 The OGSA-DAI Project  A generic framework for integrating data access and computation –Uniform interface to relational, XML, flat file data resources  Using the grid to take specific classes of computation nearer to the data  Kit of parts for building tailored access and integration applications  Investigations to inform DAIS-WG  One reference implementation for DAIS  Releases publicly available NOW

3 Project Partners Powered by …. Funded by the Grid Core Programme

4 Project Membership Principal Investigators Project Manager Programme Management Board Chair Technical Review Board Chair Research Team IBM Dissemination Team EPCC Team Charaka Mike Ally Amy Mario Malcolm Kostas Norman Paul Neil Andy Simon BrianDave PatrickNeil IBM Development Team

5 OGSA Infrastructure Architecture Grid or Web Service Infrastructure Data Intensive Applications for Science X Compute, Data & Storage Resources Distributed Simulation, Analysis & Integration Technology for Science X Data Intensive X Scientists Virtual Integration Architecture Generic Virtual Data Access and Integration Layer Structured Data Integration Structured Data Access Structured Data Relational XML Semi-structured- Transformation Registry Job Submission Data TransportResource Usage Banking BrokeringWorkflow Authorisation OGSA-DAI

6 Project Status  Current release 4.0 –Globus Toolkit 3.2 compliant –Platform and language independent Java 1.4 Document model  Work concentrated on data access –Wraps data resources without hiding underlying data model –Provide base for higher-level services Distributed Query Processing (DQP) Data federation services

7 Supported Data Resources RelationalXMLOther MySQL Xindice Files DB2 eXist ? Oracle PostgreSQL SQLServer

8 Web Service Architecture Service Registry Service Consumer Service Provider Publish Bind Discover

9 OGSA-DAI Service Architecture DAISGR Service Consumer GDSF GDS Publish Bind Discover

10 OGSA-DAI Services  OGSA-DAI uses three main service types –DAISGR (registry) for discovery –GDSF (factory) to represent a data resource –GDS (data service) to access a data resource  This will change accesses represents DAISGR GDSF GDS Data Resource locates creates

11 GDSF and GDS  Grid Data Service Factory (GDSF) –Represents a data resource –Persistent service Currently static (no dynamic GDSFs) –Cannot instantiate new services to represent other/new databases –Exposes capabilities and metadata –May register with a DAISGR  Grid Data Service (GDS) –Created by a GDSF –Generally transient service –Required to access data resource –Holds the client session

12 Grid Data Service XindiceMySqlOracleDB2  Data source abstraction behind GDS instance –plug in “data resource implementations” for different data source technologies –does not mandate any particular query language or data format Heterogeneity

13 DAISGR  DAI Service Group Registry (DAISGR) –Persistent service –Based on OGSI ServiceGroups –GDSFs may register with DAISGR –Clients access DAISGR to discover Resources Services (may need specific capabilities) –Support a given portType or activity

14 Analyst Registry DAISGR Factory GDSF registerServicefindServiceData  Data resource publication through registry  Data location hidden by factory  Data resource meta data available through Service Data Elements Location

15 Interaction Model: Start up OGSI Container GDSF DAISGR 1. Start OGSI containers with persistent services. 2. Here GDSF represents Frog database.

16 Interaction Model: Registration OGSI Container GDSF DAISGR 3. GDSF registers with DAISGR. Frogs: GSH

17 Interaction Model: Discovery OGSI Container GDSF DAISGR 4. Client wants to know about frogs. Can: (i) Query the GDSF directly if known or (ii) Identify suitable GDSF through DAISGR. Frogs: GSH Mmmmm … Frogs? FindService: Frogs GSH: GDSF

18 Interaction Model: Service Creation OGSI Container GDSF DAISGR 5. Having identified a suitable GDSF client asks a GDS to be created. Frogs: GSH GDS CreateService GSH: GDS

19 Interaction Model: Perform OGSI Container GDSF DAISGR 6. Client interacts with GDS by sending Perform documents. 7. GDS responds with a Response document. 8. Client may terminate GDS when finished or let it die naturally. Frogs: GSH GDS Perform Document Response Document

20 Interaction Model: Summary  Only described an access use case –Client not concerned with connection mechanism –Similar framework could accommodate service-service interactions  Discovery aspect is important –Probably requires a human –Needs adequate definition of metadata Definitions of ontologies and vocabularies - not something that OGSA-DAI is doing …

21 More Complex Behaviour Data Resource Container Client GDS GDT Data Resource Container GDS GDT Deliver data back to the client. Data Resource Deliver data to a third party. Deliver data another GDS. And there's a lot more that you can do …

22 Usage Patterns G A Q S+R Data Q - Query D - Delivery S - Status R - Result U - Update I - Data id Q+D A C G S R G C A Q S D R A G Q+U S RetrieveUpdate/InsertPipeline G2=C G1=P A I Q1 S2 S1 U/R Q2+D Q1+D G2=C A G1=P S2 S1 Q2 U/R Actors - OGSI process - Non-OGSI process A - Analyst C - Consumer G - GDS P - Producer Call Response Data Flow A P G U I Q S A P G U I S Q+D

23 Project Using OGSA-DAI

24 Projects Using OGSA-DAI OGSA-DAI ( AstroGrid ( BioSimGrid ( BioGrid ( Bridges ( eDiaMoND ( FirstDig ( GeneGrid ( GEON ( IU RGRBench ( myGrid ( N2Grid ( ODD-Genes ( OGSA-WebDB ( INWA (

25 Project classification OGSA-DAI Biological Sciences Physical Sciences Commercial Applications Computer Sciences FirstDig INWA Bridges AstroGrid BioSimGrid BioGrid eDiamond myGrid ODD-Genes N2Grid GEON MCS IU RGBench OGSA Web-DB GeneGrid GridMiner

26 Points to Note  Feedback from users largely positive –Good suggestions –Fair criticisms –How OGSA-DAI is being used –Where it succeeds and where it fails –Helping us to capture requirements  Hope to allow user contributions –Plan to establish a policy/framework for this  Engage more with User Community –Meetings scheduled for this year OGSA-DAI mini-workshop at AHM 2004 OGSA-DAI tutorials at various meetings/locations

27  e-Digital MammOgraphy National Database –Mammogram - X-ray of the breast  Built prototype of a national database of mammographic images –In support of the UK Breast screening programme  Employed Grid technologies to facilitate process Thanks to eDiaMonND project and the Digital Database for Screening Mammography for this image.

28  Breast screening in the UK began in 1988 –Women aged screened every 3 Years –Women aged from 2004 –1 View/Breast → 2 views by 2003  UK has –Over 90 Breast screening units throughout the UK –Each one deals with about women on average p.a.  Each centre sees images/year  In → –Screened: 1.4M → 1.5M –Recalled for Assessment : → –Cancers detected : → –Lives per year Saved: 300 → 1250 (by 2010)  Distributed team of doctors perform the analysis

29 DB2 Content Manager DB2 Content Manager DB2 Content Manager DB2 Content Manager DB2 Federation OGSA-DAI Database Files OGSA-DAI Core Services Core Services Core Services Core Services Data Load Training App Training Services UCL KCLUEDCHU Core API Training API Training Application Core & Training API OGSA-DAI Data Load Training App Core & Training API Data Load Training App Core & Training API Data Load Training App Core & Training API

30  eDiaMoND Findings: –OGSA-DAI provides a flexible framework –Dynamically configure the system through discovery –Activities can operate with different levels of granularity –Federation can be introduced at various levels –Good documentation on how to extend the framework Extended Activities to access IBM DB2 Content Manager –Changes between versions broke some things Low level XML issues

31 FirstDIG  Data mining with the First Transport Group, UK –Example: “When buses are more than 10 minutes late there is an 82% chance that revenue drops by at least 10%” –"The results of this exercise will revolutionise the way we do things in the bus industry.“, Darren Unwin, Divisional Manager, First South Yorkshire. OGSA-DAI OGSA-DAI Client Application Data Mining Application

32 INWA  Innovation Node: Western Australia –Informing Business & Regional Policy: Grid-enabled fusion of global data and local knowledge  Project –Run from Nov Aug 2004 –Involved 10 partners (6 UK + 4 Australia)  Aim –Data mine commercially sensitive data –Security an absolute MUST –Employ Grid technologies –Need access to data and computational resources  Demonstrator using: –OGSA-DAI Incorporate data resources –Sun DCG's TOG (Transfer-queue Over Globus) Handle job submission to analyse micro array data

33 Curtin,Australia EPCC,UK INWA Grid Engine BankTelco Grid Engine BankTelco OGSA-DAI TOG Data Browser Telco data Bank data Australian property UK Property

34 INWA: Lessons Learned  Performing Data Integration: –TimeZone date problems  Security issues: –Bugs in JavaCoG in GT3 OGSA-DAI could not switch security for Grid data transfers TOG had no security option –All of these have been fixed  Middleware not mature enough for commercial deployment

35 Why OGSA-DAI?  Why use OGSA-DAI over JDBC? –Can embed additional functionality at the service end Transformations, compressions Third party delivery The extensible activity framework –Avoiding unnecessary data movement –Common interface to heterogeneous data resources Relational, XML databases, and files –Usefulness of the Registry for service discovery Dynamic service binding process Provision of good meta-data is necessary –Language independence at the client end Do not need to use Java –Platform independence Do not have to worry about connection technology, drivers, etc