Xenia Package ain/XeniaPackage ain/XeniaPackage.

Slides:



Advertisements
Similar presentations
A Roadmap of Open Source components for GI Web Services and Clients A Paul R Cooper MAGIC.
Advertisements

Get Started with GIS Mapping Part 2 of 3 Madhu Lakshmanan.
Get Started with GIS Mapping Part 1of 3 Madhu Lakshmanan.
PubMed/History; Accessing Full-Text Articles (module 4.4)
BY LECTURER/ AISHA DAWOOD DW Lab # 3 Overview of Extraction, Transformation, and Loading.
Metadata at ICPSR Sanda Ionescu, ICPSR.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Near Real Time Ocean Observations Online the Escape of SEACOOS (Southeastern Atlantic Coastal Ocean Observing System) Data Management and Visualization.
A New Learning Tools. Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
Technical Tips and Tricks for User Support Mike Gardner
Oct 31, 2000Database Management -- Fall R. Larson Database Management: Introduction to Terms and Concepts University of California, Berkeley School.
The Visibility Information Exchange Web System (VIEWS): An Approach to Air Quality Data Management and Presentation In a broader sense, VIEWS facilitates.
19 th Advanced Summer School in Regional Science An introduction to GIS using ArcGIS.
Near Real-Time Ocean Data Management An Implementation of Open Source Technologies and OGC Protocols Charlton Purvis, University of South Carolina, a SEACOOS.
Attribute databases. GIS Definition Diagram Output Query Results.
Open Solutions to Regional Observing Systems. Outline Recent near real-time in-situ observations are aggregated to a ‘Xenia’ schema relational database(RDB)
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
ESRM 250 & CFR 520: Introduction to GIS © Phil Hurvitz, KEEP THIS TEXT BOX this slide includes some ESRI fonts. when you save this presentation,
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Considerations: –Unzip data –Data in Digimap – what data formats? –Data conversion –Applying a style to the data Desktop sharing – Working with OS MasterMap.
PubMed/History; Accessing Full-Text Articles (module 4.4)
Office of Research and Development National Exposure Research Laboratory, Atmospheric Modeling Division, Applied Modeling Research Branch October 8, 2008.
1 Introduction to web mapping Dissemination of results, maps and figures ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics.
Status of upgrading CDI service (user interface, harvesting via GeoNetwork, CDI interoperability options following SeaDataNet D8.7) By Dick M.A. Schaap.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
AIRNow-International The future of the United States real-time air quality reporting and forecasting program and GEOSS participation John E. White U.S.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
TECHNICAL DOCUMENTATIONPARTNERS DOWNLOAD DATA Download water quality data in MS Excel, CSV, TSV, and KML formats. Learn how to use the portal and data.
Introduction to Databases A line manager asks, “If data unorganized is like matter unorganized and God created the heavens and earth in six days, how come.
Online Autonomous Citation Management for CiteSeer CSE598B Course Project By Huajing Li.
STOQS: The Spatial Temporal Oceanographic Query System Mike McCann Abstract Monterey Bay Aquarium Research Institute Architecture Postgres.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
1 The NERC DataGrid DataGrid The NERC DataGrid DataGrid AHM 2003 – 2 Sept, 2003 e-Science Centre Metadata of the NERC DataGrid Kevin O’Neill CCLRC e-Science.
Stephen Booth EPCC Stephen Booth GridSafe Overview.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
The Prajna Project Utilities for Understanding Edward Swing.
National Weather Service(NWS) Marine Weather Portal(MWP) Marine Weather Portal(MWP)
U.S. Department of the Interior U.S. Geological Survey USGS Water Data Exchange Services USGS Office of Water Information June 2009 Nate Booth, Dave Briar.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
Maps, Maps and More Maps: Three Approaches to Reach the Masses Lisa M. Ballagh, John C. Cartwright, and Allaina M. Wallace.
A GeoSpatial Mapping Architecture
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Authors Brian F. Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, Ramana.
SECOORA Geo Tools Presentation Dan Ramage, University of South Carolina
Vegetation Index Visualization of individual composite period. The tool provides a color coded grid display of the subset region. The tool provides time.
DSpace - Digital Library Software
Implementing Marine XML for NOAA Observing Data Nazila Merati and Eugene Burger NOAA/Pacific Marine Environmental Laboratory Seattle, WA.
End-to-End Data Services A Few Personal Thoughts Unidata Staff Meeting 2 September 2009.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
IOOS DIF SOS Project. June 5, 2008IOOS DIF SOS Project2 What is SOS? Sensor Observation Service (SOS) – an API for providing sensor and observation data.
NVS New Zealand National Vegetation Survey. What is NVS? NVS (National Vegetation Survey) – New Zealand’s largest archive facility for plot-based vegetation.
NcBrowse: A Graphical netCDF File Browser Donald Denbo NOAA-PMEL/UW-JISAO
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
SECOORA Maps/WMS(OGC Web Mapping Service) via MapServer - animations via javascript DODS/OPeNDAP access to basic tables (organization, platform,
Instrumentation, platforms, and data formats for ocean currents and temperature: a brief overview.
Data Are from Mars, Tools Are from Venus
Mapping for the interwebs
MERRA Data Access and Services
NOSQL.
NOSQL databases and Big Data Storage Systems
CFS Community Day Core Flight System Command and Data Dictionary Utility December 4, 2017 NASA JSC/Kevin McCluney December 4, 2017.
Data Discovery Tools and Services Part B
Presentation transcript:

Xenia Package ain/XeniaPackage ain/XeniaPackage ain/XeniaPackageV2 ain/XeniaPackageV2

Problems Xenia intended to address Grants for research instrumentation which will be collecting observation data while lacking a data management/sharing component beyond archiving datalogger files Low-volume data(< 100,000 records per hour) in-situ observational platforms or system arrays (e.g. 1 to 1000 platforms collecting observations per hour) collecting data at any geographic scale (local,regional,national,etc) Bridging the gap between raw data collection and the organization and sharing of data using previously developed products, services and standards(leveraging earlier work against new data providers) Fostering a standardization of products and services via a common openly shared technical infrastructure(common database schema and product support scripts) Fostering a standardization of products and services via a common openly shared technical infrastructure(common database schema and product support scripts)

Problems Xenia not intended to address High-volume data (millions of records per hour) such as gridded model outputs, hf radar, etc. High- volume data problems at this time are better addressed using traditional file processing techniques where data management can suggest output file formats(such as images, shapefiles, etc) and metadata that are conducive to search and usage needs.

Table Schema Basic tables Extended, Support tables

Table Schema – Basic Main tables used for storing organization->platform->sensor->observation data organization->platform->sensor->observation data Not using geospatial indexing initially(can be added) to keep things simple

Current database implementation is in PostgreSQL, but should be portable to MySQL, etc later. Output products developed on Linux system using mostly perl scripts. Data dictionary captured from earlier development in the lookup tables for m_type_id (m_* = measurement) which can vary by their standard name(sea_water_temperature,sea_water_salinity) and unit of measure(celsius, fahrenheit, psu) All measurements stored in multi_obs table with their corresponding timestamp, location and qc. Multiple observation types stored similarly varying by their m_type_id index. Each measurement can/will provide a lookup for sensor id and possibly collection id.

Table Schema – Extended Additional tables used for supporting quality control tests and user/group notification Additional support tables for collections, quality control will be added

Format Convention No Convention Xenia Relational Database SQL Web Screen-Scrape ASCII Fields + Key File SEACOOS netCDF XML SQL conversion script Time Series Graphs Maps/WMS Animations Archival files by Obs/Platform CSV netCDF,shapefile,etc Latest Data by Obs/Platform KML/Google Earth,etc XML/RSS/WFS? Quality Control Notification Products

Quality Control and Notification Initial quality control tests are intended to flag/notify on observations by: Range tests - values outside of acceptable range low, range high Continuity tests – values change too much within a specific time interval Optional notification of users or user groups when qc tests fail

Time Series Graphs/Data Web request for graph only(can be placed as needed in other website contexts), webpage(graph+data) or download of time series data at specific platform sensors

Maps/WMS(Web Mapping Service) via MapServer Map animations via ImageMagick,Gifsicle, AniS DODS/OPeNDAP access to basic tables (organization, platform, sensor, multi_obs)

Latest and Archival products Guiding concept is to make products available at both regional scale(same observation/product across all platforms) and local scale(same platform across all observations/products) Often a regional product can tie into a local one – a regional water temperature map allows a user to select a water temperature graph at a specific platform listed on the map Products and design divided temporally between latest, recent(0-6 weeks), archival(3+ weeks and older). Latest products continually generated with new data(hourly) where recent and archival products may be generated at periodic intervals(daily, weekly).

Xenia latest, recent, archival table structure for observations. Oldest observations stored to files. Latest past several hours New Data Recent 0-6 weeks Archival 3+ weeks to 1-2 years Possibly table separated by year,month,etc Archival file 1-2+ years Files separated by product/year/month

Latest data products XML schema convention (ObsKML – my term/schema) Regulary(hourly) produced xml file containing all latest measurements organized by organization->platform- >observations. Designed for cross-system aggregation needs. Regulary(hourly) produced xml files (1 per platform) containing all latest measurements within that platform. Designed for local use similar to a RSS feed for each platform. Regulary(hourly) produced xml files (1 per observation) containing all latest measurements of the same observation type. Designed for cross-system aggregation needs focusing on a specific observation.

Latest data products Example of latest XML feed used to populate Carolinas Coast application and potentially further systems or Xenia instances

Latest data products KML (Keyhole Markup Language) which is the XML format used to visualize data in Google Earth and potentially other 3D Globes such as NASA WorldWind and ESRI ArcExplorer

Archival data products CSV (Comma Separated Value) files viewable using Excel Archival folder/file separated by observation type or platform month(or some manageable regular timestep) for file download according to user regional/local interest for file download according to user regional/local interest Other output file formats(netCDF, shapefiles, etc) archives similarly folder/file organized

Archival data products CSV (Comma Separated Value) files(exchange format) viewable using ODV(Ocean Data View) for CTD/Bottle analysis

Archival data products netCDF for analysis using ncBrowse

Xenia aggregation, replication, redundancy With several distributed Xenia systems, these systems could feed each other using either the same latest XML feed or a direct copy of table data offered by each Xenia instance Xenia A,B,C,D,E,F Xenia A,B,C Xenia D,E,F Xenia A Xenia B Xenia C Xenia D Xenia E Xenia F Xenia Backup A,B,C,D,E,F Xenia Backup D,E,F