Top Banner
A flexible and generic web- service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic Meeting, Paris, November, 2004 Presented by Philip McInerney
62

A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Dec 15, 2015

Download

Documents

Jenifer Mowdy
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

A flexible and generic web-servicefor the delivery of geophysical data

Experiences from 2 years of Intrepid’s JetStream System in

Australia

EuroSeismic Meeting, Paris, November, 2004

Presented by Philip McInerney

Page 2: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Topics – Web Data-Delivery

• Introduction• Example - Geoscience Australia• JetStream Architecture / Design

• System built on standard protocols of the web• Design elements: Catalog and Geospatial Intelligence

• JetStream Implementation• Consultation, Customisation, Integration• The Australian Experience

• Future Vision• Distributed Data Management and Delivery

Page 3: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Introduction&

Web Data-Delivery Example

Geoscience Australia’s GADDS

Page 4: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Intrepid’s Web Data-Delivery System

• Intrepid Geophysics develops and maintains the Intrepid Geophysical Data Management and Processing Software

• In the last two years we have used …• Intrepid’s data management and processing• the standard protocols of the world-wide-web• OpenDAP standards

as the basis for developing a web data-delivery system … called JetStream

Introduction

Page 5: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Geoscience Australia - GADDS

• April, 2003 – web data-delivery pilot– Intrepid’s JetStream system installed and

successfully delivers a small sample of survey datasets + continental-scale grids

– The study is rapidly expanded with the goal of delivering all of GA’s survey datasets

• November, 2003 – GADDS launched– The Geophysical Archive Data Delivery System

(GADDS) is formally launched– GA’s 50Gb archive of magnetic and gravity data

are freely available across the web (across the globe!) … with minimal administrative overhead

GADDS Example

Page 6: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Geoscience Australia - GADDS

• June, 2004 – Expansion– Upgraded to deliver 256-channel radio-

metric line datasets, and multi-band grids (K, U, Th & Total Count)

• November, 2004 – Vic, Qld data added– Datasets from state government surveys

were added to pool of data being served– Data from most states will be included by

mid-2005

GADDS Example

Page 7: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Define Area of InterestGADDS Example

Page 8: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Define Area of InterestGADDS Example

Page 9: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Define Area of InterestGADDS Example

Page 10: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Define Area of InterestGADDS Example

Page 11: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Define Area of InterestGADDS Example

Page 12: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Select DataType and ThemeGADDS Example

Page 13: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Search Results ...GADDS Example

Page 14: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Metadata ReviewGADDS Example

Page 15: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Metadata ReviewGADDS Example

Page 16: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Metadata ReviewGADDS Example

Page 17: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Metadata ReviewGADDS Example

Page 18: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Dataset SelectionGADDS Example

Page 19: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Dataset SelectionGADDS Example

Page 20: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Submit RequestGADDS Example

Page 21: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Submit RequestGADDS Example

Page 22: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Email: Data Ready … DownloadGADDS Example

Page 23: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Dataset DownloadGADDS Example

Page 24: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

GADDS Example - Summary

• We used a standard browser• We viewed standard web-pages• We made some simple choices

• Area of interest• Type of data: Vector or Grid• Theme: Magnetics, Gravity, Radiometrics, …

• We chose to download one survey dataset• Selected fields of the dataset• Chose the Datum, Projection, and file format

• We received an email when data were ready• We used a standard web ‘download to file’

GADDS Example

Page 25: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream Architecture/Design

1. Schematic Diagram

2. Design: Catalog

3. Design: Geospatial Intelligence

Page 26: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream Client-Server Architecture

Apache

Web Server

ExplorerNetscape

Web Client

TCP/IP

HTTP

Firewall

Tomcat (or equivalent)

JetStreamProcesses

JetStream Server

JetCat Catalog

OpenDAPDrivers

Intrepid Processes

Data Administrator

Binary Datasets

Architecture / Design

Page 27: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream Server Architecture

Apache Tomcat (or equivalent; Servlets Container)

Client queries

Get additional information

Queue (Process) Management - Dataset extract & process - Zipping the requested data - Email ‘data-ready’ service

Download management

- Dataset extract & process - Zipping the requested data - Email ‘data-ready’ service

Web Server JetStream Server

JetCat Catalog

Interface to a web-browser

Client

Data Administrator (Acquire_Catalog, …)

Binary Datasets

Intrepid Processes

OpenDAP Drivers

In a distributed system the data and processes would be located on ‘remote’ servers

Architecture / Design

Page 28: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Design: Catalog

Apache Tomcat (or equivalent; Servlets Container)

Client queries

Get additional information

Queue (Process) Management - Dataset extract & process - Zipping the requested data - Email ‘data-ready’ service

Download management

- Dataset extract & process - Zipping the requested data - Email ‘data-ready’ service

Web Server JetStream Server

JetCat Catalogue

Interface to a web-browser

Client

Data Administrator (Acquire_Catalog, …)

Binary Datasets

Intrepid Processes

OpenDAP Drivers

Architecture / Design

JetCat Catalog

Data Administrator (Acquire_Catalog, …)

Page 29: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Design: Catalog

• The catalog …• is at the heart of the JetStream System; all

interactive client-queries interrogate the catalog rather than the binary datasets

• is a very simple data structure …– a flat table; can be ASCII, Access, Oracle table, …– one record per dataset– a small number of essential fields …

» Lat/Long limits of the dataset» URL address of the dataset» Data-type and ‘theme’

– additional user-defined fields can contain any other metadata that might be pertinent to the application

Architecture / Design

Page 30: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Design: Catalog• Maintaining the catalog is the essential data-administrative task • The automated ‘Acquire_Catalog’ administrative tool ‘harvests’

metadata from the data-files • an intelligent ‘Data Manager’ tool facilitates additional manual

maintenance of the catalog

Architecture / Design

Page 31: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Design: Geospatial Intelligence

Apache Tomcat (or equivalent; Servlets Container)

Client queries

Get additional information

Queue (Process) Management - Dataset extract & process - Zipping the requested data - Email ‘data-ready’ service

Download management

- Dataset extract & process - Zipping the requested data - Email ‘data-ready’ service

Web Server JetStream Server

JetCat Catalogue

Interface to a web-browser

Client

Data Administrator (Acquire_Catalog, …)

Binary Datasets

Intrepid Processes

OpenDAP Drivers

Architecture / Design

Intrepid Processes

Page 32: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Design: Geospatial Intelligence

• JetStream is geospatially intelligent about a wide variety of industry file types; this means that …

• it can interpret such files to determine a dataset’s location and extent

• it can ‘look into’ such files … and intelligently extract subsets of the contained data; Subsets may be spatial subsets, or some subset of fields of the dataset

• This geospatial intelligence is achieved through ‘Intrepid Processes’ – the functions of the Intrepid Geophysical Data Management and Processing System

Architecture / Design

Page 33: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Design: Geospatial Intelligence

• Geospatial intelligence is used …– to maintain the catalog …

• the ‘Acquire_Catalog’ administrative tool intelligently ‘harvests’ metadata from the data-files themselves

• the Data Manager tool also uses intelligent analysis of data-files to assist manual administrative tasks.

– to present ‘on-the-fly’ previews of the data to the end-user client (e.g. thumbprint image displays)

– to extract subsets of data – either spatial or by selected dataset fields – in order to deliver to the client only that subset of data requested; in effect, reducing download time by excluding data not required by the client

Architecture / Design

Page 34: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Geospatial Intelligence – File TypesFile-type Back-Office Preview Web-based Delivery

Vector    

Intrepid DB Y Y

Geosoft GDB Y Y

Oracle Y (… and other RDB’s) Y

ESRI Shape files Y Y

SEG-Y (seismic) Soon Soon

     

Grid Files    

ERMapper Y Y

Geosoft Y Y

netCDF Y Y

     

Image Files    

GeoTiffs Y Y

Jpeg (with .jgw) Y Y

Tiff (with .tfw) Y Y

ECW + Algorithms Soon Soon

Architecture / Design

Page 35: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Geospatial Intelligence – File Types

• Although JetStream can treat many file-types intelligently – and extract subsets of data from such files - it is also possible to configure the system to deliver any file

• simply add a file to the catalog … assigning the essential fields of ‘dataset extents’ … and JetStream can deliver that file across the web

• Nominate specific file-extensions to be ‘associated’ files; e.g. a “.doc” or “.pdf” might be ‘report’ files associated with a survey dataset; the files can be ‘associated’ by using the same base-file-name; the data file, and it’s associated report file, will be web-delivered together

Architecture / Design

Page 36: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream Implementation

Consultation, Customisation, Integration

Page 37: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

‘Off-the-Shelf’ Solution ? Yes, but …

• JetStream is an ‘off-the-shelf’ web data-delivery solution … which must be integrated into a corporation’s business

• In our experience, implementation requires …– Consultation– Integration

• with existing data-management systems• into existing web-interface systems

– Customisation of the system• JetStream is flexible, with many options, • Balance this with the value of keeping web-pages

simple!

Implementation

Page 38: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Integration with Legacy Systems

• Consultation needs to identify opportunities of using existing data management systems

• JetStream can use sources of systematic metadata in various ways …– JetStream’s Catalog can simply be an existing

database table … in Access, Oracle, ASCII, …– an existing database table can be used to initially

populate a Catalog– if a legacy system is maintained … then the

Catalog can be regularly refreshed from that system’s database tables

Implementation

Page 39: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Integration into Existing Web-Pages

• Again – consultation should explore opportunities to build upon any existing investment in an organisation’s web-systems

• For example, PIRSA had developed an Arc-IMS site for map-composition and map-based querying of spatial databases; JetStream was integrated with that system within days of delivery

Implementation

Page 40: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Integration – PIRSA ExampleImplementation

Exist

ing A

rc-IM

S

inve

stm

ent i

n

web-p

ages

JetS

tream

is

added

with

a

single

TAB p

anel

Page 41: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Customisation – Corporate Image

• Despite obvious differences – the GA and PIRSA sites have almost identical JetStream functionality

• JetStream (an ‘off-the-shelf’ solution) can be implemented behind a customised web-page environment … customised to corporate needs … to maintain a corporate image, or integrate with an existing system, etc.

Implementation

Page 42: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Customisation – Client Service

• Despite being a ‘off-the-shelf’ solution, JetStream is very flexible – and there is considerable scope to tailor the web-page interface to clients needs …

• the GA interface, for example, provides access to metadata to assist the client’s selection

• the PIRSA site provides much less metadata

• For a seismic data service, one would want to see seismic line locations at the time of defining an ‘area of interest’ …

Implementation

Page 43: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Customisation – Client Service

Display seismic line location to assist definition of ‘area of interest’

Implementation

Page 44: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Customisation – Client Service

Preview SEG-Y image … then request to download the SEG-Y data file

Implementation

Page 45: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Australian Experience

Client Perspective

Data Provider Perspective

Page 46: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

GADDS – Customer Reaction

• Well received by the customer base– Clients have expressed satisfaction with

the access via universally available web-browsers

– The simple series of web-pages provide sufficient metadata to facilitate effective dataset selection

– Clients have found the dataset delivery mechanism – notification by email, with a URL link – is effective and practical

Australian Experience

Page 47: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

GADDS – Benefits for GA

• Assists GA in their charter to ‘make data freely available’ …– Use of the universally available ‘web-

browser’, and delivery via standard web-download protocols – ensures ease-of-access for all. (No proprietary software needed by the client).

– Clients construct their own queries to find ‘what data are available ?’

Australian Experience

Page 48: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

GADDS – Benefits for GA

• Reduced data administration overheads– The JetStream system facilitates the

dataset management, with features to assist the maintenance of the system catalogue

– Significant reduction in clerical staff. Tasks, such as answering client queries, taking orders, extracting datasets from archives, arranging delivery … are now automated

Australian Experience

Page 49: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

PIRSA Experience

• JetStream has improved our efficiencies• More time is available to add value to

the products available• More and more stakeholders access our

potential field data via JetStream.• Data are also being better managed

centrally via JetStream.

Australian Experience

Domenic Calandro,Manager, Geoscience Datasets

Page 50: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

PIRSA Experience – Client View

• JetStream is being very well received by our stakeholders, particularly international users, and users with broadband internet access

• JetStream definitely contributed to our "number 1 status in the world" for delivery of pre-competitive data (Report of the independent Fraser Institute)

Australian Experience

Domenic Calandro,Manager, Geoscience Datasets

Fraser Institute Survey:100% of respondents considered the South Australian geoscience databases to encourage exploration investment

Page 51: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Future Vision

Distributed Systems

Page 52: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Distributed Data Management

• We see the JetStream system as having a ‘data management’ function in addition to web-data-delivery

• In the context of data-management it is essential to think in terms of distributed systems

• Today many organisations operate on a ‘distributed’ basis – with authority and responsibility distributed to regions. It is frequently impractical to centralise the management of data in such organisations

Distributed Systems

Page 53: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Distributed Data Management

• The main advantage of distributed management of data comes from the ‘divide and conquer’ principal …

• the ‘problem’ remains small !• the regional office has a greater interest in the

management of their data

• At the same time, however, clients in other parts of the organisation may want to know ‘what data are available’ … and request a copy of those data … so …

Distributed Systems

Page 54: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Data Delivery in a Distributed World

• The goal …

Distributed Systems

manage data locally

access data globally

Page 55: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream in a Distributed World

The data, Intrepid Processes and the Catalog can be distributed

Distributed Systems

Apache Tomcat (or equivalent)

JetStreamProcesses

Web Server JetStream Server

JetCat Catalog

OpenDAPDrivers

Intrepid Processes

Data Administrator

Binary Datasets

Page 56: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream in a Distributed World

The data, Intrepid Processes and the Catalog can be distributed

Distributed Systems

Apache Tomcat (or equivalent)

JetStreamProcesses

Web Server JetStream Server

JetCat Catalog

OpenDAPDrivers

Intrepid Processes

Data Administrator

Binary Datasets

Page 57: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream in a Distributed World

The data, Intrepid Processes and the Catalog can be distributed

Distributed Systems

Apache Tomcat (or equivalent)

JetStreamProcesses

Web Server JetStream Server

OpenDAPDrivers

Data Administrator

Catalog

IntrepidBinary

Datasets

Catalog

IntrepidBinary

Datasets

Catalog

IntrepidBinary

Datasets

Page 58: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

JetStream in a Distributed World

• Advantages• Local management

of datasets is efficient

• No administrative overhead of centralisation

• Can access data globally – only accessing it when I want it

Distributed Systems

Page 59: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Extending the Vision

• The ultimate goal is to link from one distributed network to other networks …

• e.g. “I don’t need to manage those data; the European Union is managing it for me … and I’ll go back and get it when I want it”

Distributed Systems

Page 60: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Extending the Vision

• A geologist of Global Petroleum Inc. constructs a query to locate seismic data in the North Sea – and queries the corporate net

• The query delivers metadata information and download options back to the desktop – from multiple distributed data repositories

Distributed Systems

Global PetroleumInc. EuroSeismic

Network

• The query construct is forwarded to a special port of the EuroSeismic network

Page 61: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Summary• JetStream uses the standard protocols of the web (not re-

inventing wheels)• For geo-spatial data – we can be intelligent about it; we

know its extent, we can extract subsets out from it, we have tools to assist data management (SEG-Y ? Soon)

• We integrate with existing data management, with existing web-systems; we don’t replace, but build-on existing investment

• We like the KISS principle … Keep It Simple, Simon!• Web-delivery is providing client satisfaction, and benefits

to the data-provider – improved client service, improved data management, scope to value-add, reduced costs

• We believe that distributed data management is practical, and are confident that JetStream can deliver global access with such distributed systems

Page 62: A flexible and generic web-service for the delivery of geophysical data Experiences from 2 years of Intrepid’s JetStream System in Australia EuroSeismic.

Acknowledgements

• Thank you for the opportunity to make this presentation to you today

• Thanks also to …• Geoscience Australia

– http://www.geoscience.gov.au/gadds

• Primary Industry & Resources, South Australia– http://www.pir.sa.gov.au/pages/minerals/sarig/sarig.htm