Presentation is loading. Please wait.

Presentation is loading. Please wait.

A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Similar presentations


Presentation on theme: "A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic."— Presentation transcript:

1 A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic Meeting, Paris, November, 2004 Presented by Philip McInerney

2 Topics – Web Data-Delivery Introduction Example - Geoscience Australia JetStream Architecture / Design System built on standard protocols of the web Design elements: Catalog and Geospatial Intelligence JetStream Implementation Consultation, Customisation, Integration The Australian Experience Future Vision Distributed Data Management and Delivery

3 Introduction & Web Data-Delivery Example Geoscience Australia’s GADDS

4 Intrepid’s Web Data-Delivery System Intrepid Geophysics develops and maintains the Intrepid Geophysical Data Management and Processing Software In the last two years we have used … Intrepid’s data management and processing the standard protocols of the world-wide-web OpenDAP standards as the basis for developing a web data- delivery system … called JetStream Introduction

5 Geoscience Australia - GADDS April, 2003 – web data-delivery pilot –Intrepid’s JetStream system installed and successfully delivers a small sample of survey datasets + continental-scale grids –The study is rapidly expanded with the goal of delivering all of GA’s survey datasets November, 2003 – GADDS launched –The Geophysical Archive Data Delivery System (GADDS) is formally launched –GA’s 50Gb archive of magnetic and gravity data are freely available across the web (across the globe!) … with minimal administrative overhead GADDS Example

6 Geoscience Australia - GADDS June, 2004 – Expansion –Upgraded to deliver 256-channel radio- metric line datasets, and multi-band grids (K, U, Th & Total Count) November, 2004 – Vic, Qld data added –Datasets from state government surveys were added to pool of data being served –Data from most states will be included by mid-2005 GADDS Example

7 Define Area of Interest GADDS Example

8 Define Area of Interest GADDS Example

9 Define Area of Interest GADDS Example

10 Define Area of Interest GADDS Example

11 Define Area of Interest GADDS Example

12 Select DataType and Theme GADDS Example

13 Search Results... GADDS Example

14 Metadata Review GADDS Example

15 Metadata Review GADDS Example

16 Metadata Review GADDS Example

17 Metadata Review GADDS Example

18 Dataset Selection GADDS Example

19 Dataset Selection GADDS Example

20 Submit Request GADDS Example

21 Submit Request GADDS Example

22 Data Ready … Download GADDS Example

23 Dataset Download GADDS Example

24 GADDS Example - Summary We used a standard browser We viewed standard web-pages We made some simple choices Area of interest Type of data: Vector or Grid Theme: Magnetics, Gravity, Radiometrics, … We chose to download one survey dataset Selected fields of the dataset Chose the Datum, Projection, and file format We received an when data were ready We used a standard web ‘download to file’ GADDS Example

25 JetStream Architecture/Design 1.Schematic Diagram 2.Design: Catalog 3.Design: Geospatial Intelligence

26 JetStream Client-Server Architecture Apache Web Server Explorer Netscape Web Client TCP/IP HTTP Firewall Tomcat (or equivalent) JetStream Processes JetStream Server JetCat Catalog OpenDAP Drivers Intrepid Processes Data Administrator Binary Datasets Architecture / Design

27 JetStream Server Architecture ApacheTomcat (or equivalent; Servlets Container) Client queries Get additional information Queue (Process) Management - Dataset extract & process - Zipping the requested data - ‘data-ready’ service Download management - Dataset extract & process - Zipping the requested data - ‘data-ready’ service Web ServerJetStream Server JetCat Catalog Interface to a web-browser Client Data Administrator (Acquire_Catalog, …) Binary Datasets Intrepid Processes OpenDAP Drivers In a distributed system the data and processes would be located on ‘remote’ servers Architecture / Design

28 Design: Catalog ApacheTomcat (or equivalent; Servlets Container) Client queries Get additional information Queue (Process) Management - Dataset extract & process - Zipping the requested data - ‘data-ready’ service Download management - Dataset extract & process - Zipping the requested data - ‘data-ready’ service Web ServerJetStream Server JetCat Catalogue Interface to a web-browser Client Data Administrator (Acquire_Catalog, …) Binary Datasets Intrepid Processes OpenDAP Drivers Architecture / Design JetCat Catalog Data Administrator (Acquire_Catalog, …)

29 Design: Catalog The catalog … is at the heart of the JetStream System; all interactive client-queries interrogate the catalog rather than the binary datasets is a very simple data structure … –a flat table; can be ASCII, Access, Oracle table, … –one record per dataset –a small number of essential fields … »Lat/Long limits of the dataset »URL address of the dataset »Data-type and ‘theme’ –additional user-defined fields can contain any other metadata that might be pertinent to the application Architecture / Design

30 Design: Catalog Maintaining the catalog is the essential data-administrative task The automated ‘Acquire_Catalog’ administrative tool ‘harvests’ metadata from the data-files an intelligent ‘Data Manager’ tool facilitates additional manual maintenance of the catalog Architecture / Design

31 Design: Geospatial Intelligence ApacheTomcat (or equivalent; Servlets Container) Client queries Get additional information Queue (Process) Management - Dataset extract & process - Zipping the requested data - ‘data-ready’ service Download management - Dataset extract & process - Zipping the requested data - ‘data-ready’ service Web ServerJetStream Server JetCat Catalogue Interface to a web-browser Client Data Administrator (Acquire_Catalog, …) Binary Datasets Intrepid Processes OpenDAP Drivers Architecture / Design Intrepid Processes

32 Design: Geospatial Intelligence JetStream is geospatially intelligent about a wide variety of industry file types; this means that … it can interpret such files to determine a dataset’s location and extent it can ‘look into’ such files … and intelligently extract subsets of the contained data; Subsets may be spatial subsets, or some subset of fields of the dataset This geospatial intelligence is achieved through ‘Intrepid Processes’ – the functions of the Intrepid Geophysical Data Management and Processing System Architecture / Design

33 Design: Geospatial Intelligence Geospatial intelligence is used … –to maintain the catalog … the ‘Acquire_Catalog’ administrative tool intelligently ‘harvests’ metadata from the data-files themselves the Data Manager tool also uses intelligent analysis of data-files to assist manual administrative tasks. –to present ‘on-the-fly’ previews of the data to the end-user client (e.g. thumbprint image displays) –to extract subsets of data – either spatial or by selected dataset fields – in order to deliver to the client only that subset of data requested; in effect, reducing download time by excluding data not required by the client Architecture / Design

34 Geospatial Intelligence – File Types File-typeBack-Office PreviewWeb-based Delivery Vector Intrepid DBYY Geosoft GDBYY OracleY (… and other RDB’s)Y ESRI Shape filesYY SEG-Y (seismic)Soon Grid Files ERMapperYY GeosoftYY netCDFYY Image Files GeoTiffsYY Jpeg (with.jgw)YY Tiff (with.tfw)YY ECW + AlgorithmsSoon Architecture / Design

35 Geospatial Intelligence – File Types Although JetStream can treat many file-types intelligently – and extract subsets of data from such files - it is also possible to configure the system to deliver any file simply add a file to the catalog … assigning the essential fields of ‘dataset extents’ … and JetStream can deliver that file across the web Nominate specific file-extensions to be ‘associated’ files; e.g. a “.doc” or “.pdf” might be ‘report’ files associated with a survey dataset; the files can be ‘associated’ by using the same base-file-name; the data file, and it’s associated report file, will be web-delivered together Architecture / Design

36 JetStream Implementation Consultation, Customisation, Integration

37 ‘Off-the-Shelf’ Solution ? Yes, but … JetStream is an ‘off-the-shelf’ web data- delivery solution … which must be integrated into a corporation’s business In our experience, implementation requires … –Consultation –Integration with existing data-management systems into existing web-interface systems –Customisation of the system JetStream is flexible, with many options, Balance this with the value of keeping web-pages simple! Implementation

38 Integration with Legacy Systems Consultation needs to identify opportunities of using existing data management systems JetStream can use sources of systematic metadata in various ways … –JetStream’s Catalog can simply be an existing database table … in Access, Oracle, ASCII, … –an existing database table can be used to initially populate a Catalog –if a legacy system is maintained … then the Catalog can be regularly refreshed from that system’s database tables Implementation

39 Integration into Existing Web-Pages Again – consultation should explore opportunities to build upon any existing investment in an organisation’s web- systems For example, PIRSA had developed an Arc-IMS site for map-composition and map-based querying of spatial databases; JetStream was integrated with that system within days of delivery Implementation

40 Integration – PIRSA Example Implementation Existing Arc-IMS investment in web-pages JetStream is added with a single TAB panel

41 Customisation – Corporate Image Despite obvious differences – the GA and PIRSA sites have almost identical JetStream functionality JetStream (an ‘off-the-shelf’ solution) can be implemented behind a customised web-page environment … customised to corporate needs … to maintain a corporate image, or integrate with an existing system, etc. Implementation

42 Customisation – Client Service Despite being a ‘off-the-shelf’ solution, JetStream is very flexible – and there is considerable scope to tailor the web-page interface to clients needs … the GA interface, for example, provides access to metadata to assist the client’s selection the PIRSA site provides much less metadata For a seismic data service, one would want to see seismic line locations at the time of defining an ‘area of interest’ … Implementation

43 Customisation – Client Service Display seismic line location to assist definition of ‘area of interest’ Implementation

44 Customisation – Client Service Preview SEG-Y image … then request to download the SEG-Y data file Implementation

45 Australian Experience Client Perspective Data Provider Perspective

46 GADDS – Customer Reaction Well received by the customer base –Clients have expressed satisfaction with the access via universally available web- browsers –The simple series of web-pages provide sufficient metadata to facilitate effective dataset selection –Clients have found the dataset delivery mechanism – notification by , with a URL link – is effective and practical Australian Experience

47 GADDS – Benefits for GA Assists GA in their charter to ‘make data freely available’ … –Use of the universally available ‘web- browser’, and delivery via standard web- download protocols – ensures ease-of- access for all. (No proprietary software needed by the client). –Clients construct their own queries to find ‘what data are available ?’ Australian Experience

48 GADDS – Benefits for GA Reduced data administration overheads –The JetStream system facilitates the dataset management, with features to assist the maintenance of the system catalogue –Significant reduction in clerical staff. Tasks, such as answering client queries, taking orders, extracting datasets from archives, arranging delivery … are now automated Australian Experience

49 PIRSA Experience JetStream has improved our efficiencies More time is available to add value to the products available More and more stakeholders access our potential field data via JetStream. Data are also being better managed centrally via JetStream. Australian Experience Domenic Calandro, Manager, Geoscience Datasets

50 PIRSA Experience – Client View JetStream is being very well received by our stakeholders, particularly international users, and users with broadband internet access JetStream definitely contributed to our "number 1 status in the world" for delivery of pre- competitive data (Report of the independent Fraser Institute) Australian Experience Domenic Calandro, Manager, Geoscience Datasets Fraser Institute Survey: 100% of respondents considered the South Australian geoscience databases to encourage exploration investment

51 Future Vision Distributed Systems

52 Distributed Data Management We see the JetStream system as having a ‘data management’ function in addition to web-data-delivery In the context of data-management it is essential to think in terms of distributed systems Today many organisations operate on a ‘distributed’ basis – with authority and responsibility distributed to regions. It is frequently impractical to centralise the management of data in such organisations Distributed Systems

53 Distributed Data Management The main advantage of distributed management of data comes from the ‘divide and conquer’ principal … the ‘problem’ remains small ! the regional office has a greater interest in the management of their data At the same time, however, clients in other parts of the organisation may want to know ‘what data are available’ … and request a copy of those data … so … Distributed Systems

54 Data Delivery in a Distributed World The goal … Distributed Systems manage data locally access data globally

55 JetStream in a Distributed World The data, Intrepid Processes and the Catalog can be distributed Distributed Systems ApacheTomcat (or equivalent) JetStream Processes Web ServerJetStream Server JetCat Catalog OpenDAP Drivers Intrepid Processes Data Administrator Binary Datasets

56 JetStream in a Distributed World The data, Intrepid Processes and the Catalog can be distributed Distributed Systems ApacheTomcat (or equivalent) JetStream Processes Web ServerJetStream Server JetCat Catalog OpenDAP Drivers Intrepid Processes Data Administrator Binary Datasets

57 JetStream in a Distributed World The data, Intrepid Processes and the Catalog can be distributed Distributed Systems ApacheTomcat (or equivalent) JetStream Processes Web ServerJetStream Server OpenDAP Drivers Data Administrator Catalog Intrepid Binary Datasets Catalog Intrepid Binary Datasets Catalog Intrepid Binary Datasets

58 JetStream in a Distributed World Advantages Local management of datasets is efficient No administrative overhead of centralisation Can access data globally – only accessing it when I want it Distributed Systems

59 Extending the Vision The ultimate goal is to link from one distributed network to other networks … e.g. “I don’t need to manage those data; the European Union is managing it for me … and I’ll go back and get it when I want it” Distributed Systems

60 Extending the Vision A geologist of Global Petroleum Inc. constructs a query to locate seismic data in the North Sea – and queries the corporate net The query delivers metadata information and download options back to the desktop – from multiple distributed data repositories Distributed Systems Global Petroleum Inc. EuroSeismic Network The query construct is forwarded to a special port of the EuroSeismic network

61 Summary JetStream uses the standard protocols of the web (not re- inventing wheels) For geo-spatial data – we can be intelligent about it; we know its extent, we can extract subsets out from it, we have tools to assist data management (SEG-Y ? Soon) We integrate with existing data management, with existing web-systems; we don’t replace, but build-on existing investment We like the KISS principle … Keep It Simple, Simon! Web-delivery is providing client satisfaction, and benefits to the data-provider – improved client service, improved data management, scope to value-add, reduced costs We believe that distributed data management is practical, and are confident that JetStream can deliver global access with such distributed systems

62 Acknowledgements Thank you for the opportunity to make this presentation to you today Thanks also to … Geoscience Australia –http://www.geoscience.gov.au/gadds Primary Industry & Resources, South Australia –http://www.pir.sa.gov.au/pages/minerals/sarig/sarig.htm


Download ppt "A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic."

Similar presentations


Ads by Google