Vegetation Plot Management: A National Plots Database Demo Funding: National Science Foundation (DBI-9906838) John Harris - NCEAS Robert K. Peet - University.

Slides:



Advertisements
Similar presentations
Natural Business Services for Construct Users Mark Barnard R&D Manager – Natural Business Services.
Advertisements

The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
METS at UC Berkeley Part I: Generating METS Objects.
VegBank.org: a Permanent, Open-Access Archive for Vegetation Plot Data. Michael T. Lee 1, Michael D. Jennings 2, Robert K. Peet 1. Interacting with the.
Bookshelf.EXE - BX A dynamic version of Bookshelf –Automatic submission of algorithm implementations, data and benchmarks into database Distributed computing.
Plant Systematics databases: Users perspectives Robert K. Peet, University of North Carolina In collaboration with The National Center for Ecological Analysis.
Technical Architectures
ORACLE Lecture 1: Oracle 11g Introduction & Installation.
Data models for Community information Robert K. Peet, University of North Carolina John Harris, Nat. Center for Ecol. Analysis & Synthesis Michael D. Jennings,
VegBank A vegetation field plot archive Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center.
EcoInformatics & Vegetation Science. The symposium message Plant community ecology is on the brink of a dramatic transformation that will be made possible.
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
M.Sc. Course, Dept. of Informatics and Telecommunications, University of Athens S.Hadjiefthymiades “Web Application Servers” Basics on WAS WAS are necessary.
November 2011 At A Glance GREAT is a flexible & highly portable set of mission operations analysis tools that increases the operational value of ground.
Confidential ODBC May 7, Features What is ODBC? Why Create an ODBC Driver for Rochade? How do we Expose Rochade as Relational Transformation.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Advanced Java New York University School of Continuing and Professional Studies.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
SDPL 2001Notes 7: XML Web Sites1 7 XML Web-Site Architectures n How (and why) to apply XML techniques in the implementation of Web sites? 7.1 XML, Databases,
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
11/16/2012ISC329 Isabelle Bichindaritz1 Web Database Application Development.
® IBM Software Group © 2007 IBM Corporation J2EE Web Component Introduction
Upgrading to IBM Cognos 10
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
XML Registries Source: Java TM API for XML Registries Specification.
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.
Team Members Team Members Tim Geiger Joe Hunsaker Kevin Kocher David May Advisor Dr. Juliet Hurtig November 8, 2001.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Project Overview Graduate Selection Process Project Goal Automate the Selection Process.
Federated Database Set Up Greg Magsamen ITK478 SIA.
Vegetation Data Management: VegBank Funding: National Science Foundation (DBI ) January 8, 2002 John Harris - NCEAS.
M4 Alpha Release August 9 th February 8th, 2006 M4 Alpha Release Modified Agenda Review M4 Objectives, General Concepts and Design Review M4 Development.
Data Integration and Management A PDB Perspective.
NMNH EMu DAMS Integration Project Rebecca Snyder Smithsonian, NMNH.
The VegBank taxonomic datamodel Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center for.
Carolina Environmental Program 1 UNC Chapel Hill A New Control Strategy Tool within the Emissions Modeling Framework Alison M. Eyth Carolina Environmental.
The VegBank Data Model. Biodiversity data structure Taxonomic database Plot/Inventory database Occurrence database Plot Observation/ Collection Event.
Application Development
Semantic Phyloinformatic Web Services Using the EvoInfo Stack Speaker: John Harney LSDIS Lab, Dept. of Computer Science, University of Georgia Mentor(s):
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
Condor Technology Solutions, Inc. Grace Performance Chemicals HRIS Intranet Project.
Recent Enhancements to Quality Assurance and Case Management within the Emissions Modeling Framework Alison Eyth, R. Partheepan, Q. He Carolina Environmental.
ViaSQL Technical Overview. Viaserv, Inc. 2 ViaSQL Support for S/390 n Originally a VSE product n OS/390 version released in 1999 n Identical features.
The challenge of organism identity --- The flora of the Southeast The flora of the Southeast as a case study Robert K. Peet University of North Carolina.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
2) Database System Concepts and Architecture. Slide 2- 2 Outline Data Models and Their Categories Schemas, Instances, and States Three-Schema Architecture.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
The ECOST Web-based platform for data providers and for data users.
XML 2002 Annotation Management in an XML CMS A Case Study.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
The Holmes Platform and Applications
Data sharing and exchange: Experiences within the
Vegetation Data Management:
Lecture 8 Database Implementation
Web Engineering.
PHP / MySQL Introduction
Taxonomic and Community Classification Resources and Standards
The Re3gistry software and the INSPIRE Registry
SDMX Tools Overview and architecture
Oracle SQL Developer Data Modeler
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

Vegetation Plot Management: A National Plots Database Demo Funding: National Science Foundation (DBI ) John Harris - NCEAS Robert K. Peet - University of North Carolina

Agenda: Over-Arching Concepts Project Overview · Impact · Database Design · System Architecture · Challenges Use-Case Example: Wisconsin Data Data Management Recommendations Future Directions

The US National Plots Database Project supported by: National Center for Ecological Analysis & Synthesis U.S. National Science Foundation USGS-BRD Gap Analysis Program ABI / The Nature Conservancy Project organized and conducted by: Robert K. Peet, University of North Carolina Marilyn Walker, USDA Forest Service & U. Alaska Dennis Grossman, The Nature Conservancy / ABI Michael Jennings, USGS-BRD & UCSB John Harris, NCEAS

Collection Integration Archival Extraction Analysis Publication

Development Cycle Preliminary design Build prototype #1 interface User evaluates interface Evaluation studied by designer Design modification are made Build prototype #n interface Supported by 3 other NCEAS Developers Database Design: Aug – Jan Interface Design: Nov – Feb Backend Development: Jan Interface Development: Mar – Backend Version: Prototype 3 Interface Version: Prototype 1 Expected Beta Release: Late Sept. – Mid Oct.

Plots Database Design

 Support the National Vegetation Classification.  Provide a comprehensive facility to store the most commonly collected vegetation plot data attributes.  Provide the user with a large number of user-defined attributes to store not-so-commonly collected data.  Integrate plots with the dynamic plant taxonomy and vegetation community data. Design meeting September 2000, NCEAS Plots Database Design Goals

Taxonomy Module Smithsonian meeting: Peet-Taswell model vs Berendsohn model FGDC Biological Nomenclature Working Group Update on ABI & HDMS Prospects for implementation The difficult choice

 Logical separation of a "taxonomic name" with the "taxonomic concept", so that taxonomic data can be stored at the most 'atomic' level without ambiguity  The ability to incorporate multiple organizations' 'views' of how a taxonomic name is applied to a taxonomic concept  The ability to link a taxonomic name used in the Plots database with a 'name - concept' pair in the taxonomic database. *Although one can store vegetation community data in the same database table-structure as the plant taxonomy database, we have implemented two separate table structures and have created two separate data sets. Taxonomy Database Design Goals

Representative tools reflect the desire to have the following features: High performance Robust Open architecture Platform neutral Scaleable Development Choices

 Java -- Write once, Cross platform – Linux, Windows, MacOS*  Java Servlet -- Dynamic, database-driven, web content  JDBC -- Connect to any database - Oracle, PostgreSql, SQL Server backend  Swing -- Classy interface tools  Beans -- Reusable components Features - JAVA * Not tested yet :-)

XML: is the format for structured data on the Web. Simple and flexible data conversions, using XSLT Straightforward to write generic tools which export parts of a relational database as XML encoded data, or even to write generic code that serializes Java (or other) objects as XML data structures. Examples later… Features - XML

An Example Workflow Using Wisconsin Plots Data What data integration means to us Taxonomic / Semantic Integration Data formatting for database ingestion General Comments about Current Format Data Parsing Transformation to XML standard Legacy Data Loader

Plots Data Integration & DB Ingestion Reformat by Hand Research MS Access MS Excel Perl Shell scripts ? Integration What is meant by data integration? … Plots DB

Taxonomic Integration Carya ovata (Miller)K. Koch Carya carolinae-sept. (Ashe) Engler & Graebner Carya ovata (Miller)K. Koch sec. Gleason 1952 sec. Radford et al Splitting one species into two illustrates the ambiguity often associated with scientific names. If you encounter the name “Carya ovata (Miller) K. Koch” in a database, you cannot be sure which of two meanings applies. Integration

Semantic Integration of Plot Attributes ‘Basic yet Important’ Integration Cover Scales Strata Dimensions Environmental Attributes

Integration Parse Data from Forms into Table Structure to be Transformed into XML Consistent with the Database Structure Text Forms Columnar Tables XML

Integration Parsed Data Text Forms Columnar Tables

Integration Transform Parsed Data to XML Consistent with the Plots Database Columnar Tables Plots DB XML Legacy Data Loader Data Definition (XML)

Integration Data Definition (XML) – Single file siteData.csv ’,’ site data plotCode authorPlotCode 1 communityName 2 …

Integration Data Definition (XML) – Multiple files vegData.csv siteData.csv site data authorPlotCode '+' ‘,’ species plotName authorPlotCode 1 scientificName taxonName 2

Integration Plots Database XML

Existing Prototype Functionality

Vegetation Database Client

Vegetation Desktop Database Client

Extra slides to follow:

General Data Management Practices general formats weird formats unusable formats modeled the software after the way that people collect plots data -- at least that is what I thought At times tortuous path to the database in terms of reformating class indicies (these are rectified at the plots loading software step)

Management Case: Example from Wisconsin  Baraboo Hills -- Collected Yesterday  PEL -- Legacy Data

Data Transformation of Forms