Presentation is loading. Please wait.

Presentation is loading. Please wait.

OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh

Similar presentations


Presentation on theme: "OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh"— Presentation transcript:

1 OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti mario@epcc.ed.ac.uk EPCC, University of Edinburgh http://www.ogsadai.org.uk AHM 2004, Data Services and Middleware, 2 nd September 2004

2 2http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 OGSA-DAI  OGSA-DAI is middleware –Middleware should be invisible –Provide additional functionality or hide complexity  Allows uniform access to data resources –data resources: relational and XML databases, files, …  Provides an extensible framework –You can extend functionality - fill any gaps  We think it works well –But need feedback  Recount how OGSA-DAI is being used  Some background first …

3 3http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Basic Operational Model Data Resource Container DAISGR Client GDSF GDS

4 4http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Why OGSA-DAI?  Why use OGSA-DAI over JDBC? –Language independence at the client end Do not need to use Java –Platform independence Do not have to worry about connection technology, drivers, etc –Can handle XML resources –Can embed additional functionality at the service end Transformations Third party delivery etc Avoiding unnecessary data movement –Provision of Metadata is powerful –Usefulness of the Registry for service discovery Dynamic service binding process

5 5http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 More Complex Behaviour Data Resource Container Client GDS GDT Data Resource Container GDS GDT Deliver data back to the client. Data Resource Deliver data to a third party. Deliver data another GDS. And there's a lot more that you can do …

6 6http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Usage Patterns G A Q S+R Data Q - Query D - Delivery S - Status R - Result U - Update I - Data id Q+D A C G S R G C A Q S D R A G Q+U S RetrieveUpdate/InsertPipeline G2=C G1=P A I Q1 S2 S1 U/R Q2+D Q1+D G2=C A G1=P S2 S1 Q2 U/R Actors - OGSI process - Non-OGSI process A - Analyst C - Consumer G - GDS P - Producer Call Response Data Flow A P G U I Q S A P G U I S Q+D

7 7http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Activities are the drivers  Express a task to be performed by a GDS  Three broad classes of activities: –Statement –Transformations –Delivery  Extensible: –Easy to add new functionality –Does not require modification to the service interface –Extension operate within the OGSA-DAI framework  Functionality: –Implemented at the service –Work where the data is (do not require to move data back)

8 8http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 OGSA-DAI Deck

9 9http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Building Applications  Activities are grouped together –Perform document –Data can flow between activities  Optimisation –Avoids multiple message exchanges  Can deliver to other GDSs –Prerequisite for data integration  Base middleware for projects requiring data access –Some capability for data integration  That is the theory … now for the practice  OGSA-DAI being adopted by a number of projects …

10 10http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Who is Using OGSA-DAI? OGSA-DAI (http://www.ogsadai.org.uk) AstroGrid (http://www.astrogrid.org/) BioSimGrid (http://www.biosimgrid.org/) BioGrid (http://www.biogrid.jp/) Bridges (http://www.brc.dcs.gla.ac.uk/projects/bridges/) eDiaMoND (http://www.ediamond.ox.ac.uk/) FirstDig (http://www.epcc.ed.ac.uk/~firstdig/) GeneGrid (http://www.qub.ac.uk/escience/projects.php#genegrid) GEON (http://www.geongrid.org/) IU RGRBench (http://www.cs.indiana.edu/~plale/projects/RGR/OGSA-DAI.html) myGrid (http://www.mygrid.org.uk/) N2Grid (http://www.cs.univie.ac.at/institute/index.html?project-80=80) ODD-Genes (http://www.epcc.ed.ac.uk/oddgenes/) OGSA-WebDB (http://www.gtrc.aist.go.jp/dbgrid/) MCS (http://www.isi.edu/~deelman/MCS/) INWA (http://www.epcc.ed.ac.uk/projects/inwa/) GridMiner (http://www.gridminer.org/)

11 11http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Project classification

12 12http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Projects using OGSA-DAI  These projects form a list of case studies –Need to capture requirements –How OGSA-DAI is being used –Where it succeeds and where it fails –Other issues that arise –An on-going process  Only time to outline salient points from a couple of projects –More detail in the paper … but this only gives a top level overview On-going process …  Solicit more –If you have more then please get in touch …

13 13http://www.ogsadai.org.uk - AHM2004, 2nd September 2004  e-Digital MammOgraphy National Database  Built a prototype of a national database of mammographic images in support of the UK Breast screening programme  Employ Grid technologies to facilitate this process  Mike Brady gave a keynote that went over the details

14 14http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 DB2 Content Manager DB2 Content Manager DB2 Content Manager DB2 Content Manager DB2 Federation OGSA-DAI Database Files OGSA-DAI Core Services Core Services Core Services Core Services Data Load Training App Training Services UCL KCLUEDCHU Core API Training API Training Application Core & Training API OGSA-DAI Data Load Training App Core & Training API Data Load Training App Core & Training API Data Load Training App Core & Training API

15 15http://www.ogsadai.org.uk - AHM2004, 2nd September 2004  eDiaMoND Findings: –OGSA-DAI provides a flexible framework –Dynamically configure the system through discovery –Activities can operate with different levels of granularity –Federation can introduced at various levels –Upgrading from R3  R4 broke some things Low level XML issues –Good documentation on how to extend the framework Extended Activities to access IBM DB2 Content Manager

16 16http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 INWA Objectives  Innovation Node Western Australia –Informing Business & Regional Policy: Grid-enabled fusion of global data and local knowledge  Project –Run from Nov 2003 - Aug 2004 –Involved 10 partners (6 UK + 4 Australia)  Aim –Data mine commercially sensitive data –Security an absolute MUST –Employ Grid technologies –Need access to data and computational resources  Demonstrator using: –OGSA-DAI Incorporate data resources –Sun DCG's TOG (Transfer-queue Over Globus) Handle job submission to analyse micro array data

17 17http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 user@australia Curtin,Australia EPCC,UK INWA Grid Engine BankTelco Grid Engine BankTelco OGSA-DAI TOG Data Browser user@edinburgh Telco data Bank data Australian property UK Property

18 18http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 INWA: Lessons Learned  Performing Data Integration: –TimeZone date problems  Security issues: –Bugs in JavaCoG in GT3 OGSA-DAI could not switch security for Grid data transfers TOG had no security option –All of these have been fixed  Middleware not mature enough for commercial deployment

19 19http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 ODD-Genes  OGSA-DAI Demo for Genetics  Collaboration between –EPCC –Scottish Centre for Genomic Technology and Informatics (GTI) –Human Genetics Unit (HGU)  ODD-Genes demonstrates: –Perform high-speed batch analysis of microarray data on the Grid –Browse the results of previous analyses stored in a database –View data from arbitrary databases as HTML –Discover related databases on the Grid –Perform coupled queries on newly- discovered databases to provide a richer analysis of gene data

20 20http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 ODD-Genes Actors HGU Mouse Genome Information OGSA-DAI GTI ODD-Genes Webapp OGSA-DAI Micro Array Data TOG GridEngine EPCC GridEngine Globus OGSA-DAI DAISGR 1. Client 2. EPCC is an example of a computational resource. 3. HGU is an example of a data repository.

21 21http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 ODD-Genes Findings  Data discovery perceived to be very important –Map data views: time -> spatial locations –Discovery of new resources  Transparency to data access –@HGU had an XML database –@GTI had a relational database –Deploy OGSA-DAI and not worry about databases  Issues –Registry maintenance policy –Semantics of the discovery process –Groups working the same area but different schemas, no generic metadata (schemas were the effective metadata)  Provides an additional tool for researchers

22 22http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Other Projects  AstroGrid –Identified (and fixed) a number of bugs –Passed on requirements  FirstDig –Identified a number of bugs –Have contributed a data browser to OGSA-DAI  GeneGrid –Interfacing Perl through an OGSA-DAI service to access biological databases –Requirement for file support  EdSkyQuery-G –Collaboration between OGSA-DAI & Eldas –Based on SkyQuery project by John Hopkins University, Baltimore, USA

23 23http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 More on Projects  MSc at Edinburgh looking at data integration scenarios –Benchmarking OGSA-DAI –Investigating capabilities GDSActivity allowing perform documents to be executed at other GDSs Identifying further requirements for data integration - control flow –sequence –flow Question as to whether such capabilities should be included in OGSA-DAI or OGSA-DAI should interface with other workflow languages  MSc at Edinburgh looking at C bindings to the OGSA-DAI CTK –For language independance need to provide more of these… Perl, Python … Eiffel!!  GridMiner –Have a really cute logo –Have a member of that team currently at NeSC

24 24http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Conclusions  Still early days –Standardisation process not stabilising quickly enough –Infrastructure still developing and prone to change  OGSA-DAI acting as an enabler –Showing people what can be done –However is it cracking a nut with a sledge hammer?  Usage patterns are similar –Call for people to work together to solve similar problems  Problems that are not OGSA-DAI specific –Metadata, Time zones, security, …  Data discovery perceived to be important –Is this in the scope of what OGSA-DAI should be doing?  Need to talk to users and gather war stories –http://www.ogsadai.org.uk/projects –On going process …

25 25http://www.ogsadai.org.uk - AHM2004, 2nd September 2004 Further Information  The OGSA-DAI Project Site: –http://www.ogsadai.org.uk  The DAIS-WG site: –http://cs.man.ac.uk/grid-db  OGSA-DAI Users Mailing list –users@ogsadai.org.uk –General discussion on grid DAI matters  Formal support for OGSA-DAI releases –http://www.ogsadai.org.uk/support –support@ogsadai.org.uk  OGSA-DAI training courses


Download ppt "OGSA-DAI Usage Scenarios and Behaviour: Determining good practice Mario Antonioletti EPCC, University of Edinburgh"

Similar presentations


Ads by Google