Presentation is loading. Please wait.

Presentation is loading. Please wait.

A PPARC funded project The Grid Data Warehouse Description of prototype work in progress by AstroGrid. Access-Grid lecture to Universities of Leeds and.

Similar presentations


Presentation on theme: "A PPARC funded project The Grid Data Warehouse Description of prototype work in progress by AstroGrid. Access-Grid lecture to Universities of Leeds and."— Presentation transcript:

1 A PPARC funded project The Grid Data Warehouse Description of prototype work in progress by AstroGrid. Access-Grid lecture to Universities of Leeds and Sheffield by Guy Rixon on 2004-02-04

2 2 04-02-2004GDW description: access-grid lecture AstroGrid: the UK Virtual Observatory } Seven UK astronomy departments collaborating to build a Virtual Observatory (VO) for the use of the entire astronomical community.

3 3 04-02-2004GDW description: access-grid lecture IVOA: the community of VO projects

4 4 04-02-2004GDW description: access-grid lecture Purpose of the virtual observatory To combine data from all sources into a data grid. Data grid Private files Archives Live feeds Bibliographies Data sets can be images (mainly in files) or tabular (mainly in RDBMS).

5 5 04-02-2004GDW description: access-grid lecture Example of VO use “ Find brown dwarf candidates: combine optical (e.g. APM catalogue) and IR (e.g. 2MASS) data to select by colour. Combine multi-epoch data to determine proper motions; select high-PM fraction of colour- selected sample. Then use that sample to…” Optical archive IR archive 2 nd epoch Colour sample Refined sample 3 rd epoch

6 6 04-02-2004GDW description: access-grid lecture VO as collection of web sites: no good Each site has different query protocol Results only go to browser, not to RDBMS, reprocessing Results in HTML etc not machine readable Basic web sites are not sufficient for the VO.

7 7 04-02-2004GDW description: access-grid lecture Grid metaphor: electricity supply Loadsa complex equipment Simple delivery to consumer Get your power from any supplier: commodity

8 8 04-02-2004GDW description: access-grid lecture Commodities in astronomy data grid Common s/w on desktop Algorithms Archives Writeable Storage Registry of resources (Processors) Bulk data transport; machine- readable results; combined inside grid Metadata transport

9 9 04-02-2004GDW description: access-grid lecture AstroGrid topology PortalRegistry AlgorithmsWriteable storageArchives Workflow

10 10 04-02-2004GDW description: access-grid lecture Difficult RDBMS operations “Select objects with V-K > 4.5…” (i.e. find ‘red’ objects). U, B, V, ROptical archive service IR archive service J, H, K ? No std. way of combining DBs. No std. way of storing results in RDBMS ?

11 11 04-02-2004GDW description: access-grid lecture Need for data warehouse Join across internet RDBMS Join inside warehouse DB 1000x speed gains

12 12 04-02-2004GDW description: access-grid lecture GDW topology extends AstroGrid Portal File storageArchive Workflow Registry Grid-DB (OGSA-DAI) Warehouse controller Grid-DB (OGSA-DAI)

13 13 04-02-2004GDW description: access-grid lecture GDW people Kona Andrews (Cambridge) Elizabeth Auden (MSSL) Martin Hill (Edinburgh) Tony Linde (Leicester) Clive Page (Leicester) Guy Rixon (Cambridge) Noel Winstanley (Jodrell Bank)

14 14 04-02-2004GDW description: access-grid lecture Current system Portal File storageArchive Workflow Registry Grid-DB (OGSA-DAI) Warehouse controller Grid-DB (OGSA-DAI) Link not implemented yet DB tables preloaded; read-only DB Link temporarily redirected

15 15 04-02-2004GDW description: access-grid lecture Next system (3Q2004) Portal File storageArchive Workflow Registry Grid-DB (OGSA-DAI) Warehouse controller Grid-DB (OGSA-DAI) Limited choice Links implemented properly (GridFTP) Two dedicated installations inside AstroGrid; multi- user

16 16 04-02-2004GDW description: access-grid lecture Ultimate system (2005+) Portal File storageArchive Workflow Registry Warehouse controller Grid-DB (OGSA-DAI) AstroGrid UK e-Science grid / EGEE One node per user; any storage node

17 17 04-02-2004GDW description: access-grid lecture Assessment Basic idea is sound Coding of GDW was quite simple Very difficult to get it all integrated Problems with OGSA-DAI: Performance Data-size limits Can’t get higher functions to work yet Proceed? Yes; need to experiment further Still expect to get science out of it

18 18 04-02-2004GDW description: access-grid lecture Can one use it? Beta testers invited Wait for release of “Iteration 4.1” system (soon!) Wait for release of “Iteration 5” system (3Q2004) to see GDW useful for science AstroGrid final release is at the end of 2004 http://wiki.astrogrid.org/bin/view/Astrogrid/BetaTesting

19 19 04-02-2004GDW description: access-grid lecture That’s all folks!


Download ppt "A PPARC funded project The Grid Data Warehouse Description of prototype work in progress by AstroGrid. Access-Grid lecture to Universities of Leeds and."

Similar presentations


Ads by Google