FirstDIG First Data Investigation on the Grid Paul Graham, Terry Sloan, Adam Carter EPCC Ian Gregory, Darren Unwin First South Yorkshire tel:+44 (0)131.

Slides:



Advertisements
Similar presentations
Database Planning, Design, and Administration
Advertisements

Components of GIS.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP1. Project Management.
Evaluation of a Large-scale VRE Implementation - ELVI Staff and students using the VRE benefit from the greater transparency and communication that it.
SERVING CORPORATES AND INDIVIDUALS ©2012 BUSINESS REPORTING MANAGEMENT SERVICES, INC WELCOME.
Solving Automation Reporting Problems with Dream Report Renee Sikes Applications Engineer Dream Report Brand Manager.
Customer relationship management.
® IBM India Research Lab © 2006 IBM Corporation Challenges in Building a Strategic Information Integration Infrastructure Mukesh Mohania IBM India Research.
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
Institute for Software Science – University of ViennaP.Brezany 1 Databases and the Grid Peter Brezany Institute für Scientific Computing University of.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Institute for Scientific Computing – University of ViennaP.Brezany 1 Databases and the Grid Peter Brezany Institute für Scientific Computing University.
Lecture Nine Database Planning, Design, and Administration
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
1 e-science & data mining workshop, NeSC, UK, November 30 th, 2004 Terry Sloan EPCC, The University of Edinburgh INWA : using OGSA-DAI.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
It refers to the software used to manage the database.
Chapter 6– Artifacts of the process
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Middleware-Based OS Distributed OS Networked OS 1MEIT Application Distributed Operating System Services Application Network OS.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
CONGRESSIONAL SAMPLES FOR APPROXIMATE ANSWERING OF GROUP-BY QUERIES Swarup Acharya Phillip Gibbons Viswanath Poosala ( Information Sciences Research Center,
1 UK NeSC Meeting, November 18 th, 2004 Terry Sloan EPCC, The University of Edinburgh INWA : using OGSA-DAI in a commercial environment.
Microsoft Research Faculty Summit Paul Watson Professor of Computer Science Newcastle University, UK.
EdSkyQuery-G Overview Brian Hills, December
1 EPCC Sun Data and Compute Grids Project Using Sun Grid Engine and Globus to Schedule Jobs Across a Combination of Local.
CS480 Computer Science Seminar Introduction to Microsoft Solutions Framework (MSF)
Using OGSA-DAI in a commercial environment Terry Sloan EPCC Telephone:
DAME: The route to commercialisation Tom Jackson University of York.
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
DAIT (DAI Two) NeSC Review 18 March Description and Aims Grid is about resource sharing Data forms an important part of that vision Data on Grids:
Archivists' Toolkit - CDL Presentation, October 17, 2005 The Archivists’ Toolkit Lee Mandell Brad Westbrook.
Max Ong University of Sheffield, UK. AHM 2004 Session 2.3: Workflow Composition, Wednesday 1 st September 2004, 4pm. Workflow Advisor in DAME Abstract.
"How much?": Aggregating usage data from Repositories in the UK Jo Lambert, Ross Macintyre, Paul Needham, Jo Alcock OR2015.
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
Towards an e-Science Roadmap Tony Hey Director UK e-Science Core Programme
Federated Database Set Up Greg Magsamen ITK478 SIA.
Usability Talk, 26 th January 2006 Development of Usable Grid Services for the Biomedical Community Prof Richard Sinnott Technical Director National e-Science.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
Experiences with OGSA-DAI : Portlet Access and Benchmark Deepti Kodeboyina and Beth Plale Computer Science Dept. Indiana University.
Supercomputing 2003, UK e-Science Booth 1 First Data Investigation on the Grid: FirstDIG Terry Sloan, Paul Graham, Adam Carter Edinburgh Parallel Computing.
Microsoft Management Seminar Series SMS 2003 Change Management.
Geoff Cawood, Terry Sloan Edinburgh Parallel Computing Centre (EPCC) Telephone: EPCC Sun Data and Compute.
Data Integration in Bioinformatics Using OGSA-DAI The BioDA Project Shirley Crompton, Brian Matthews (CCLRC) Alex Gray, Andrew Jones, Richard White (Cardiff.
Neil Chue Hong Project Manager, EPCC OGSA-DAI Requirements Gathering Exercise 2 nd DIALOGUE workshop eSI, 9-10.
Contract Year 1 Review IMT Tilt Thompkins MOS - NCSA 15 May 2002.
GSIM, DDI & Standards- based Modernisation of Official Statistics Workshop – DDI Lifecycle: Looking Forward October 2012.
OGSA-DAI Open Grid Services Architecture – Data Access and Integration NeSC Review 18 March 2004.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
ERP and Related Technologies
Charaka Palansuriya EPCC, The University of Edinburgh An Alarms Service for Federated Networks Charaka.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
ESSnet project "Automated data collection and reporting in accommodation statistics" Objectives, achievements and results Köln,
GROUP PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.
A Collaborative e-Science Architecture towards a Virtual Research Environment Tran Vu Pham 1, Dr. Lydia MS Lau 1, Prof. Peter M Dew 2 & Prof. Michael J.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
OGSA-DAI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
Technology Strategy Update
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
PHP / MySQL Introduction
Database-Driven Web Sites
An Electronic Borrowing System Using REST
Tom Savel, MD Lead – Grid Technologies Medical Officer NCPHI, CDC
Presentation transcript:

FirstDIG First Data Investigation on the Grid Paul Graham, Terry Sloan, Adam Carter EPCC Ian Gregory, Darren Unwin First South Yorkshire tel:+44 (0)

Description First plc - UK’s largest public transport operator Data sources Huge range – mileage, revenue, fuel, maintenance, routes … Collected – manually, ticket machines, GPS … Disparate DBMS  Acquisitions, historical, OS, physical location, representation … Issues NOT unique to the bus industry Fine for day to day operations, but … Business questions – data from >1 source Complaints vs Lateness, Revenue vs Lost Miles … Aggregation – by service, by day, weekdays only … Introduces challenges for data analysis

Description First South Yorkshire situation No common interface No common reporting process Statistics produced manually when required Labour intensive Not performed often or well Process to produce what is needed Expensive Impractical

Description and Aims Open Grid Services Architecture: Data Access and Integration Assists with the access and integration of data from separate data sources via Grid Services Our remit:To evaluate the suitability of the use of OGSA-DAI in a commercial environment. If OGSA-DAI:  Is appropriate, secure, straightforward to deploy and use …  Does what we need!  Provide feedback to OGSA-DAI team Aims 1. Demonstrate deployment of OGSA-DAI within the First South Yorkshire bus operational environment and learn from it 2. Short data analysis using OGSA-DAI service enabled data sources to answer business questions posed by First South Yorkshire

Status: Workpackages WP 1: Data Source requirements capture (FINISHED) D1.1 Data Source Requirements Capture & D1.2 Organisation Data Schema (COMMERCIAL-IN-CONFIDENCE) WP 2: Development of data interfaces (FINISHED) OGSA-DAI Deployment WP 3: Deployment & refinement of OGSA-DAI (FINISHED) First Data Service Browser User Guide First Data Service Browser Software WP 4: Data mining requirements capture (FINISHED) D4.1 Data Mining Requirements Capture (COMMERCIAL-IN- CONFIDENCE) WP 5: Initial data mining analysis (FINISHED) D5.1 Initial Data Mining Report (COMMERCIAL-IN-CONFIDENCE) WP 6: Data mining detailed analysis (FINISHED) D6.1 Final Data Mining Report (COMMERCIAL-IN-CONFIDENCE)

Technical Achievements 1 Data Mining Combined two databases to answer First’s business questions The Customer Contact System  Microsoft Access  Information on customer complaints e.g. time, service, nature The Mileage database  dBASE IV  Information on bus mileage e.g. lost miles Also investigated Revenue and Schedule Adherence suitability for data mining Produced detailed data mining report

Technical Achievements 2 OGSA-DAI deployment at First South Yorkshire Created Grid Data Services for DBMS previously unsupported by OGSA-DAI MS Access – CCS, dBASE IV – Mileage Investigated GDS for SQL Server and CVS-based DBMS Rigorously exercised use of OGSA-DAI in a commercial setting: Identified numerous areas for improvement in OGSA- DAI Identified new requirements for use of OGSA-DAI in business Confirmed the relevance and potential of OGSA-DAI for business

Technical Achievements 3 Data Service Browser Identified need to aid ‘ease of use’ for OGSA-DAI Middleware Developed a generic Grid Data Service Browser Simple GUI – avoids XML etc Allows SQL queries and updates to databases Enables JOIN queries across databases Will be included in future OGSA-DAI releases … demo later

Achievements – First’s perspective Project has proven that: There is a cost-effective solution that First South Yorkshire can utilise First can get to its data and analyse it in a useful manner With considerably reduced labour time First can produce more accurate and more wide- ranging information for the business management

Achievements “the results of this exercise will revolutionise the way we do things in the bus industry” Darren Unwin Divisional IT Manager

Dissemination Presentations Ernst & Young, WestInfo Services, Strategy & Performance Associates, SingTel Optus, Executive Briefing Centre, Curtin Business School, Curtin University of Technology, Perth Australia, February 24 th, 26 th, Curtin Business School Information Systems Seminar, Curtin University of Technology, Perth, Australia, February 20 th 2004 UK e-Science booth, Supercomputing 2003, Phoenix, USA, November 2003 Flyers UK e-Science All Hands Conference, Nottingham, UK 2-4 September 2003 Posters UK e-Science All Hands Conference, Nottingham, UK 2-4 September 2003 Articles T.M.Sloan, A.Carter, P.J.Graham, D.Unwin, I.Gregory, "First Data Investigation on the Grid: FirstDIG", Proceedings of the 2nd UK e- Science All Hands Meeting, 2-4 September, 2003, Nottingham, UK

Exploitation First Data Service Browser is being used and extended in the INWA project with Curtin Business School, Perth, Australia First are keen to extend their deployment to other databases

Future Plans Project is finished, no effort remaining. Incorporation of First Data Service Browser into future releases of OGSA-DAI First South Yorkshire want to build management reporting applications based on OGSA-DAI

Demo Data Service Browser Accessing three different DBMS Mileage, CCS, MySQL A JOIN – similar to the queries required for the data mining Easy within one DB, requires intermediary steps for distributed DB Without OGSA-DAI would have been impractical Looking at Lost Miles and Customer Complaints

Run the Demo

In Conclusion Successfully demonstrated the use of Grid middleware in a ‘real-world’ environment OGSA-DAI team: Gained (in)valuable feedback Incorporated Data Service Browser First Discovered valuable information from their data which would have otherwise been practically unobtainable Keen to extend to other DBMS