1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.

Slides:



Advertisements
Similar presentations
1 WP2: Data Management Paul Millar eScience All Hands Meeting September
Advertisements

Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
1 CEOS/WGISS20 – Kyiv – September 13, 2005 Paul Kopp SIPAD New Generation: Dominique Heulet CNES 18, Avenue E.Belin Toulouse Cedex 9 France
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
The Community Authorisation Service – CAS Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
PROGRESS: ICWS'2003 Web Services Communication within the PROGRESS Grid-Portal Environment Michał Kosiedowski.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Towards the Design and Implementation of the DAME prototype: OGSA Compliant Grid Services on the White Rose Grid Sarfraz A Nadeem University of Leeds.
John Kewley e-Science Centre GIS and Grid Computing Workshop 13 th September 2005, Leeds Grid Middleware and GROWL John Kewley
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Mike Jackson EPCC OGSA-DAI Today Release 2.2 Principles and Architectures for Structured Data Integration: OGSA-DAI.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
© Geodise Project, University of Southampton, Data Management in Geodise Jasmin Wason, Zhuoan Jiao and Marc Molinari Engineering.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Using the SAS® Information Delivery Portal
GRAM: Software Provider Forum Stuart Martin Computational Institute, University of Chicago & Argonne National Lab TeraGrid 2007 Madison, WI.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
Accounting for the Grid Usage Records and a Resource Usage Service.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
ANSTO E-Science workshop Romain Quilici University of Sydney CIMA CIMA Instrument Remote Control Instrument Remote Control Integration with GridSphere.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason and Marc Molinari
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
Science Gateway Workshop GGF14 28 th June 2005, Chicago CCLRC Portal Infrastructure to Support Research Facilities Dharmesh Chohan e-Science Grid Technology.
authenticated networked guided environment for learning - secure integration of learning environments with digital libraries - Current.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
WebServices, GridServices and Firewalls Matthew J. Dovey Technical Manager Oxford e-Science Centre
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Grids - the near future Mark Hayes NIEeS Summer School 2003.
Rob Allan Daresbury Laboratory A Web Portal for the National Grid Service Xiaobo Yang, Dharmesh Chohan, Xiao Dong Wang and Rob Allan CCLRC e-Science Centre,
© Geodise Project, University of Southampton, Geodise Middleware & Optimisation Graeme Pound, Hakki Eres, Gang Xue & Matthew Fairman Summer 2003.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
Building the e-Minerals Minigrid Rik Tyer, Lisa Blanshard, Kerstin Kleese (Data Management Group) Rob Allan, Andrew Richards (Grid Technology Group)
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
Experiences with OGSA-DAI : Portlet Access and Benchmark Deepti Kodeboyina and Beth Plale Computer Science Dept. Indiana University.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Metadata for structural science Workshop on research metadata in context Nijmegen, 7–8 September 2010 Simon Lambert STFC e-Science UK.
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason & Marc Molinari { z.jiao,
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Registries, ebXML and Web Services in short. Registry A mechanism for allowing users to announce, or discover, the availability and state of a resource:
Portal Update Plan Ashok Adiga (512)
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
Representational State Transfer (REST). What is REST? Network Architectural style Overview: –Resources are defined and addressed –Transmits domain-specific.
Introduction to Grid Computing and its components.
© Geodise Project, University of Southampton, Integrating Data Management into Engineering Applications Zhuoan Jiao, Jasmin.
Intro to Web Services Dr. John P. Abraham UTPA. What are Web Services? Applications execute across multiple computers on a network.  The machine on which.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
John Kewley e-Science Centre All Hands Meeting st September, Nottingham GROWL: A Lightweight Grid Services Toolkit and Applications John Kewley.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Grid Portal Services IeSE (the Integrated e-Science Environment)
Lecture 1: Multi-tier Architecture Overview
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Presentation transcript:

1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater

2 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Motivation Council for the Central Laboratory of the Research Councils‘ has many large scale facilities. E.g. ISIS, a pulsed neutron source and Diamond, the UK’s new synchrotron light source. Data Portal Making existing scientific data resources accessible through a single interface. Acting as a broker between scientists, facilities and data. Benefits Repetition of experiments can be avoided. Collaborations can be built by identifying that someone else is working in a similar area. Data about a related material can be found and used to aid a new analysis. Data can be reanalysed when better analysis tools becomes available

3 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre General Architecture Local data Local metadata Facility 1 Local data Local metadata Facility 1 Local data Local metadata Facility 1 Local data Local metadata Facility N Local data Local metadata Facility 6 Local data Local metadata Facility 4 Local data Local metadata Facility 5 User

4 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre General Architecture Local data Local metadata Facility 2 Local data Local metadata Facility N Local data Local metadata Facility 1 CCLRC Data PortalOther Data Portal instances Xml Wrapper

5 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Core Modules Web Interface, Query and Reply, Lookup and Help. Important function grouped into modules, each modules with a web service interface, interface description in WSDL and communicate via SOAP.

6 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Additional Modules Access Control, Authentication and Authorisation, Data Transfer, Shopping Cart, User Administration, Facility Administration and Accounting. Functions grouped into modules, each with web services interface, interface description in WSDL, communication via soap. These Modules could be shared or exchanged with their own implementation.

7 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre External Services XML-Wrapper, RasDaMan Insertion and Extraction, Specialised Search, HPCPortal, Visualisation Portal, SRB, other DataPortal Instances. Other services that are linked with the DataPortal, but are not integral part of it. Registered with the Portal and accessible via web services interface.

8 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Authentication Session Manager Access & Control FACILITY B Web Interface Authentication USER 1: get certificate(name, passphrase) Access & Control FACILITY A 2: getUserPrivileges (certificate) 3: startSession(certificate, permissions) OUTSIDE SERVICE START: Login(name, passphrase) 4: return SID START: Login(name, passphrase, lifetime) FACILITY A ACCESS & CONTROL DATABASE FACILITY A ACCESS & CONTROL DATABASE SESSION MANAGER DATABASE Set permissions in database for all facilities for duration of session Get permissions from database for facility A Session Manager 5: getPermissions(SID) MyProxy

9 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Basic Search Session Manager XML Wrapper FACILITY B Web Interface Query & Reply User OUTSIDE SERVICE START: basic query (list of facilities, discipline) XML Wrapper FACILITY A 2: getXML (discipline) FACILITY A metadata archive FACILITY B metadata archive SQL selects FINISH: XML results in CLRC scientific metadata format 1: getPermissions(SID) 2: getXML (discipline) FINISH: XML results in CLRC scientific metadata format

10 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Shopping Cart Web Interface Shopping Cart USER START: Add item to Shopping Cart SHOPPING CART database 1: addToCart(SID, item) FINISH: success/fail Session Manager 2: getCert(SID) 3: add item to database (DN, item) 4: success/fail

11 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre HPC Portal Another e-Science Centre project to develop a Web portal to search for resources and submit HPC applications to a computational Grid. Uses Globus toolkit v2.2 Functionalities include: Resource Management: GRAM. Information Services: MDS. Data Management: GridFTP and GASS. All use GSI security protocol as the connection layer.

12 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Integrated Portals Visualisation DataPortal HPCPortal HPC Systems Data Systems Web Services Working with GGF Grid Computing Environments Research Group GridFTP GSI Globus

13 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Signal Sign on How do you have signal sign on? Both HPC and DataPortal have their own Session Managers which rely on Globus Proxy Certificates. Integrated session managers communicate over SSL using mutual authentication between the web servers. Allows certificates to be a transferred between portals allowing single sign on. The certificate can then be used for GSI authentication.

14 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Single Sign on DataPortalHPC Portal USER 1: Login(username,password,lifetime) Via Authentication Module Dataportal Session Manager 5: RequestCert(SID) 7: YES! 4: isValid(SID) 2: Dataportal SID 3: LoginHPC(SID) HPC Session Manager 6: Certificate 8: HPC Session id FINISH: User is sent to HPC front page to use its services START: Log on to DataPortal then to HPC Portal

15 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Scenario User logs on to Data Portal and searches for data. The data found is added to the persistent shopping cart. The user could then transfer the data to another machine using GSI FTP, either using the Data Portal or the HPC Portal. Using single sign on, the user could then go directly to the HPC, and the use a remote job run on the data that they have just transferred and using GSIFTP, transfer the results back to their machine for analysis.

16 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Current Work Converting to a new Xml Wrapper architecture Problems: DTD limits us to one investigation per study. Any new wrappers should be written using the new standard. Only allows one investigation per study (as using the old format). XSLT time consuming to write and must operate on the whole study. Need to transfer the whole studys – this can be large. Data Archive goes down we cannot access study even though it is cached.

17 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Old Xml Wrapper Architecture DataPortal Xmlwrapper cache Data Archive SQL ‘like’ where clause Studies in XML Csmd v1 format SQL statements and replies

18 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Solution DataPortal (via Q&R) XMLwrapper: Doc Selector XMLwrapper: Doc Builder Cache XML Document Repository Data Archive Xml Wrapper Framework XQuery XQuery & Result Returned Update Repository with New and changed CSMD XML docs SQL Selects and Results Mapping between Archive schema And CSMD format

19 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Advantages Xml Wrapper still operates if archive is down. Concurrency much better supported as selecting from the archive is essentially one thread. Only relevant data sent between xml wrapper and Data Portal – also XQuery can include formatting rules which can potentially do away with the need to XSLT.

20 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre Questions?