1 A new Architecture for OGSA-DAI Malcolm Atkinson, K. Karasavvas, M. Antonioletti, R. Baxter, A. Borley, N. Chue Hong, A. Hume, M. Jackson, A. Krause,

Slides:



Advertisements
Similar presentations
Tom Sugden EPCC OGSA-DAI Future Directions OGSA-DAI User's Forum GridWorld 2006, Washington DC 14 September 2006.
Advertisements

Fujitsu Laboratories of Europe © 2004 What is a (Grid) Resource? Dr. David Snelling Fujitsu Laboratories of Europe W3C TAG - Edinburgh September 20, 2005.
Delivery of Industrial Strength Middleware Federated Strengths Agility & Coordination Prof. Malcolm Atkinson Director 21 st January 2004.
Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
1 OGSA-DAI Platform Dependencies Malcolm Atkinson for OMII SC 18 th January 2005.
The National Grid Service and OGSA-DAI Mike Mineter
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
An Overview of OGSA-DAI Kostas Tourlas
OGSA-DAI: Patterns and Functions Supercomputing ‘06 AIST Stand Malcolm Atkinson Director e-Science Institute & e-Science Envoy 14 th November.
Grid-Enabling Data: Sticking Plaster, Sellotape, & Chewing Gum? Colin C. Venters National Centre for e-Social Science University.
Facilitating the use of eInfrastructure: NeSC Training Team Enabling, facilitating and delivering quality training in the UK and Internationally.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
Slides thanks to Steve Lynden Amy Krause EPCC Distributed Query Processing with OGSA-DQP Principles and Architectures for Structured Data Integration:
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
1 An Introduction to OGSA-DAI Konstantinos Karasavvas 13 th September 2005.
Data Definition Language (DDL) Specification notation for defining the database schema –E.g. create table account ( account-number char(10), balance integer)
Mike Jackson EPCC OGSA-DAI Today Release 2.2 Principles and Architectures for Structured Data Integration: OGSA-DAI.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
17 July 2006ISSGC06, Ischia, Italy1 Agenda Session 26 – 14:30-16:00 An Overview of OGSA-DAI OGSA-DAI today – and future features How to extend OGSA-DAI.
Client-Server Processing and Distributed Databases
Chapter 1 Overview of Databases and Transaction Processing.
Introduction to OGSA-DAI Neil Chue Hong 15 th February 2006 GGF16, Athens.
1 OGSA-DAI: Status and Future Plans Neil Chue Hong.
OGSA-DAI: Future Work and Wrap-up The OGSA-DAI Team
Database Taskforce and the OGSA-DAI Project Norman Paton University of Manchester.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
DAIS Grid1 Database Access and Integration Services on the Grid * * Authors: N. Paton, M. Atkinson, V.
San Diego Supercomputer Center SDSC Storage Resource Broker Data Grid Automation Arun Jagatheesan et al., San Diego Supercomputer Center University of.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Introduction to OGSA-DAI The OGSA-DAI Team
DAIT (DAI Two) NeSC Review 18 March Description and Aims Grid is about resource sharing Data forms an important part of that vision Data on Grids:
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
OGSA-DAI-RDF & Its Ontology Interfaces Isao Kojima and Masahiro Kimoto Data Grid Team, Grid Technology Research Center
1 1 EPCC 2 Curtin Business School & Edinburgh University Management School Michael J. Jackson 1 Ashley D. Lloyd 2 Terence M. Sloan 1 Enabling Access to.
OGSA-DAI.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Data access and integration with OGSA-DAI: OGSA-DQP Steven Lynden University of Manchester.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Transaction-based Grid Data Replication Using OGSA-DAI Presented by Yin Chen February 2007.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
Metadata Mòrag Burgon-Lyon University of Glasgow.
OGSA-DAI Neil Chue Hong 29 th January 2007 OGF19, Chapel Hill.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI Technology Update GGF17, Tokyo (Japan)
1 OGSA-DAI Status Report Neil P Chue Hong 20 th May 2005.
Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens.
OGSA-DAI & DAIT projects Update for TAG Prof. Malcolm Atkinson Director 30 th October 2003.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Requirements Gathering Exercise 2 nd DIALOGUE workshop eSI, 9-10.
OGSA-DAI Users’ Meeting Introduction Malcolm Atkinson Director 7 th April 2004.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
The OGSA-DAI Project Databases and the Grid Neil Chue Hong Project Manager EPCC, Edinburgh
OGSA-DAI Open Grid Services Architecture – Data Access and Integration NeSC Review 18 March 2004.
The National Grid Service Mike Mineter.
OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh
1 OGSA-DAI: Service Grids Neil P Chue Hong. 2 Motivation  Access to data is a necessity on the Grid  The ability to integrate different data resources.
Data Breakout. OGSA Architecture – databases Eldas, OGSA-DAI and GridMiner implement a slightly old version of OGSA / DAIS –Architecture doc describes.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
OGSA-DQP Steven Lynden University of Manchester. Data access & integration with OGSA-DAI: GGF 17 2 Introduction OGSA-DQP is a service based distributed.
OGSA-DAI 简介及其它在 China-VO DAS 系统中的应用 杨阳 中国虚拟天文台研发团队 Chinese Virtual Observatory.
Chapter 1 Overview of Databases and Transaction Processing.
OGSA-DAI Current Version Guy Warner.
OGSA-DAI.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
Database System Concepts and Architecture
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Grid Systems: What do we need from web service standards?
Presentation transcript:

1 A new Architecture for OGSA-DAI Malcolm Atkinson, K. Karasavvas, M. Antonioletti, R. Baxter, A. Borley, N. Chue Hong, A. Hume, M. Jackson, A. Krause, S. Laws, N. Paton, J. M. Schopf, K. Tourlas and P. Watson 19 th September 2005

2 Contents: What we talk about Background Requirements Overview of new Architecture Status and Future Work

Background OGSA-DAI Middleware to provide uniform extensible access to data resources  Relational, XML, Files and Indexed Files  Translation and delivery activities Old releases  Globus Toolkit 3 - OGSI New releases  Axis – WS-I  OMII – WS-I+  Globus Toolkit 4 – WSRF Experience  Previous assumption: web services are lightweight  Factory pattern: making a home for session state  Cannot rely on all data resources having sufficient functionality Old Architecture One GDS per client interaction One Data Resource per GDS No explicit Sessions No explicit Transactions Depend on data resources for concurrency management

International Collaboration & Use USA: o Globus Alliance o IBM Corporation o caBIG o BIRN o Indiana University o GridSphere o GEON o LEAD o MCS o NCSA o Secure Data Grid o UNC Japan: o AIST o BioGrid o NAREGI Europe: o CERN o DataMiningGrid o GridMiner o GridSphere o inteligrid o N2Grid o OntoGrid o Provenance o SIMDAT o OMII-EU UK: o OMII o OMII-UK o NGS o NCeSS o NIeeS o AstroGrid o BioSimGrid o BRIDGES o CancerGrid o ConvertGrid o eDiaMonD o EDINA o First Group plc o Fujitsu Labs Europe o GEDDM o GeneGrid o Genomic Technology and Informatics o GOLD o Human Genetics Unit o IBM UK o my Grid o Oracle UK China: o CAS o ChinaGrid o cnGrid o INWA o OMII-China Australia: o Curtin Business School o INWA Tutorials BostonCambridge CERNChicago EdinburghLondon San FranciscoSeattle SeoulSingapore TokyoISSGC 03 to 05 DIALOGUE workshops Columbus, Edinburgh, Indiana, Vienna Chicago, Manchester, San Diego South Korea: o KISTI 1485 registered users downloads

LEAD GeneGrid caBIG BRIDGES OGSA WebDB FirstDIG ConvertGrid eDiaMoND OGSA-DQP Grid Miner Meeting User Requirements

Requirements Web Services now long-running Manage state lifetime within WS Manage concurrency within WS Manage resources within WS Continued support of functionality All previous tasks All previous extensibility Single task & composition Accommodate small and large tasks Standard activities Manageable and Configurable Monitoring and control Persistent & resilient to failures Persistent operational state Recovery Performance Streamed pipelines of tasks Concurrent sessions & transactions Standard coordination protocols Embedded in request documents Multi-request sessions Extensible framework & Increased Power in Requests Application-defined activities Additional data sources Extensible client libraries Standard interfaces – Preserve application investment Client library stability Activity API stability Naming Multiple Data Resources per Data Service Results, streams, sessions, transactions, …

New OGSA-DAI Architecture Tx Request Tx Response External Standard Coordination

8 Components (1) Data Resources (DRs) Currently works with  Relational, XML DBMS, file systems & indexed files Extensibility point  Any kind of data resources can be added  Need not be a single resource – could be a federation Data Services Zero or more DRs Client Toolkit  Protects developers from changes  Common API for all OGSA-DAI platforms

9 Components (2) Activities (and Tasks) Activity is the unit of work  E.g. SQL or X* statement, translation, delivery  Types: Core, Supplied and Extension activities Extensibility point TADD, Requests and Results Composite format  Reduce round-trip latencies, status & results, delegation of work Control flow Client toolkit

10 Components (3) Sessions Sharing and managing state between TADDs  External decisions, retain results for collection/delivery, transactional and security information Implicit, explicit and new sessions Transactions Recovery from failures Consistent transactional framework  Transactional metadata for activities  Logical rules for combining the behaviour of transactional activities Data Identifiers (DIDs) OGSA-DAI Engine (ODE)

Execution Model – Workload Throttle & Session manager TADD Receiver thread To do queue Active session pool Free thread pool Allocater thread Session exists? Capacity exists? Yes  resume no  new session Yes  create session no  wait for session to end

Execution Model – Execution Planning TADD Receiver thread To do queue Active session pool Free thread pool Allocater thread Session thread Resume Session Check authentication unchanged Parse to form task DAG Check authorisation Update DID table Task DAG Optimise task DAG Optimised execution plan Start execution coordination thread Execution coordination thread Monitor execution coordination thread Resume Session Task Form

Execution Model – Processing one Request TADD Optimised execution plan (oep) Execution coordination thread Task Form input Set Tx mode start Create result TADD(s) TADD result Initiate 3 rd party data collection Walk oep launching activity threads activity thread activity thread activity thread activity thread Initiate 3 rd party data delivery detect completions & tidy up Detect overall completion or failures Tidy & close TADD(s) Tidy DIDs hibernate or end Tx hibernate or pool ECT Data Streams

14 Components (4) Special Interfaces WS-DAI Transaction coordination protocols  WS-AtomicTransactions Data Resource Access Manager (DRAM) Configuration and Monitoring Gateway to a set of DRs – via a registry Data Service Description Language (DSDL) Naming Accommodate naming schemes Multiple DRs – TADDs contain DR name

Status and Future Work Release 7 – Partial move to new architecture New DR extensibility point New activities, e.g. multi-source union Reconfigurable services Concurrency within data service Sessions Parts of DQP DQP Release 3 Releases for Friends & Family Transactions (tech. preview of phases)

Status and Future Work 2 Following releases Performance  Execution planning, concurrency, streamed pipelines, better data transfer Functionality  Completed Transactions  Distributed queries  Resilient multi-source unions  Schema and data integration  More data extraction from files & files in transactions New extensibility & configuration  Activity database  Dynamically installing extension activities

Further information The OGSA-DAI Project Site: The DAIS-WG site: OGSA-DAI Users Mailing list General discussion on grid DAI matters Formal support for OGSA-DAI releases OGSA-DAI training courses Questions Please