Andrews LTER Information Management

Slides:



Advertisements
Similar presentations
Luquillo Experimental Forest Information Management: a Long-Term Ecological Research system to deposit documented data ready for analysis and synthesis.
Advertisements

V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
EcoTrends: synthesizing long-term ecological data from across the US and beyond Project Leader: Debra Peters, USDA ARS, Jornada Experimental Range, Jornada.
LTER IM Articulation Work: Developing Community Web Recommendations Nicole Kaplan (SGS), Karen Baker (CCE, PAL), Barbara Benson (NTL), Eda Melendez-Colom.
2009 Mid–Term Review El Verde Field Station June 4, 2009.
Oregon Spatial Data Library Partnership Metadata Training OU Knight Library Eugene, Oregon December 3, 2009 Kuuipo Walsh Institute for Natural Resources.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Geospatial Data Management for Ecological Research Organizations Theresa Valentine, Adam Skibbe, and Jamie Hollingsworth, LTER Network.
Caro-COOPS Data Management: Metadata. Cast-Net addresses the need for improved connectivity among coastal observing systems by creating a regional framework.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Integrating Historical and Realtime Monitoring Data into an Internet Based Watershed Information System for the Bear River Basin Jeff Horsburgh David Stevens,
Center for Environmental Studies Arizona State University Digital Research Records at Center for Environmental Studies Peter McCartney.
EAP ILTER 9 July 2007 Don Henshaw Andrews Experimental Forest LTER Pacific Northwest Research Station, USFS Forest Service Oregon State University Corvallis,
SAFARI 2000 Data Activities at the ORNL DAAC Bob Cook, Les Hook, Stan Attenberger, Dick Olson, and Tim Rhyne Oak Ridge National Laboratory.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
ClimDB/HydroDB (ClimHy) Integration ClimHy has been migrated from AND to LNO and will remain status quo in 2011 – Public page (
Data Integration, Analysis, and Synthesis Matthew B. Jones National Center for Ecological Analysis and Synthesis University of California Santa Barbara.
A different story Melendez,  The role of Information Management in the evolution of Informatics: two perspectives  About Informatics and Information.
ClimDB/HydroDB A web harvester and data warehouse for hydrometeorological data 2011 StreamChemDB Oct Yang Xia (LTER Network Office, University of.
Planning for Arctic GIS and Geographic Information Infrastructure Sponsored by the Arctic Research Support and Logistics Program 30 October 2003 Seattle,
Getting Ready for the Future Woody Turner Earth Science Division NASA Headquarters May 7, 2014 Biodiversity and Ecological Forecasting Team Meeting Sheraton.
Bonanza Creek LTER Information Management Information Management at BNZ Information Management Data & Metadata Website & Communication Education.
1 Integrated Services Program The Virginia Metadata Training Workshop Summer, 2006 Lyle Hornbaker Integrated Services Program
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
LTER Moving from Data to Information Management John Porter.
The Long Term Ecological Research Network LTER. LTER Network Vision, Mission and Goals Network Vision: A society in which exemplary science contributes.
Translation to the New TCO Panel Beverly Law Prof. Global Change Forest Science Science Chair, AmeriFlux Network Oregon State University.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
DATA-MODEL ASSIMILATION CHALLENGES AND OPPORTUNITIES IN THE LTER PROGRAM Debra Peters Lead Research Scientist, USDA ARS, Jornada Experimental Range, Las.
EcoTrends: a multi-agency synthesis project Project Leader: Debra Peters, USDA ARS, Jornada Experimental Range, Jornada Basin LTER & Sevilleta LTER, Las.
Research Design for Collaborative Computational Approaches and Scientific Workflows Deana Pennington January 8, 2007.
Ecoinformatics Workshop Summary SEEK, LTER Network Main Office University of New Mexico Aluquerque, NM.
Current and Potential Uses for GIS in Academic Arctic Research Michael F. Goodchild University of California Santa Barbara.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
LTER Data Management Margaret O’Brien Santa Barbara Coastal Long Term Ecological Research (LTER) Project Santa Barbara Channel Biodiversity Observation.
Network Information System EML status of LTER sites Iñigo San GilSep IM meeting, Estes Park ‘06.
Network Information System EML status of LTER sites Iñigo San GilAug 5th 2005 IM meeting, Montreal ‘05.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
Information Management Jornada Basin LTER. Jornada Information management system Six major components: a)Data management implementation/process b)Management.
LUQUILLO LONG-TERM ECOLOGICAL RESEARCH PROGRAM 18th Annual Meeting LUQUILLO LONG-TERM ECOLOGICAL RESEARCH PROGRAM Information Management Report January.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Metadata ESA Workshop. In this session we will discuss…  Metadata: what are they? and why should they be created?  Metadata standards  Creating metadata.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Data and Metadata Archiving: Atlantic Coast Environmental INdicators Consortium (ACE INC) Lexia M. Valdes June 11, 2003 R
Theresa Valentine Spatial Information Manager Corvallis Forest Science Lab.
NVS New Zealand National Vegetation Survey. What is NVS? NVS (National Vegetation Survey) – New Zealand’s largest archive facility for plot-based vegetation.
Long Term Ecological Research Network Information System LTER EML Status LTER Information Manager’s Meeting 28 July 2004 Mark Servilla
NASA Earth Exchange (NEX) A collaborative supercomputing environment for global change science Earth Science Division/NASA Advanced Supercomputing (NAS)
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Cushing – EIM Integrating Ecological Data Notes from the Grasslands ANPP Data Integration Project evergreen.edu LTER Network Office,
EEMB 595P Winter 2011 SBC LTER Research Seminar Instructor: S. Holbrook Time :Wednesdays noon-1 pm Room: MSRB auditorium 5-JanOrganizational meeting SBC.
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal.
John Porter A BRIEF HISTORY OF DATA SHARING IN THE U.S. LTER NETWORK.
Strategies for NIS Development
Network Information System Advisory Committee (NISAC)
Lecture 8 Database Implementation
Flanders Marine Institute (VLIZ)
Presentation transcript:

Andrews LTER Information Management

AND – Andrews LTER ARC – Arctic Tundra LTER BES – Baltimore Ecosystem Study BNZ – Bonanza Creek LTER CAP – Central Arizona-Phoenix LTER CDR – Cedar Creek LTER CWT – Coweeta LTER FCE – Florida Coastal Eeverglades GCE – Georgia Coastal Ecosystem HBR – Hubbard Brook LTER JRN – Jornada Basin KBS – Kellogg Biological Station KNZ – Konza LTER LUQ – Luquillo LTER MCM – McMurdo Dry Valleys NET – LTER Network Office NWT – Niwot Ridge LTER NTL – North Temperate Lakes PAL – Palmer Station PIE – Plum Island Ecosystem SBC – Santa Barbara Coastal SEV – Sevilleta LTER SGS – Shortgrass Steppe VCR – Virginia Coast Reserve LTER - Long Term Ecological Research Network

Biodiversity/Arthropods s 1970s 1990s1960s1950s 1948 US Forest Service Information Management Decomposition/C dynamics Soils/nutrient dynamics Vegetation successionForest-stream interactionsDisturbance/landscape Hydrology/Small watershed/Climate IBP LTER forestry researchecosystem research Time Line

H. J. Andrews Experimental Forest Long-Term Ecological Research (LTER) Site Information Management Personnel  Don Henshaw U.S.D.A. Forest Service, Pacific Northwest Research Station  Suzanne Remillard Oregon State University, Forest Science Department  Theresa Valentine U.S.D.A. Forest Service, Pacific Northwest Research Station  Fred Bierlmaier Oregon State University, Forest Science Department

Research Study Databases Downloads Downloads ComponentOnline Climate Vegetation Hydrology Carbon & Nutrients Biodiversity Soils Disturbance Stream-Forest Ecophysiology Total Forest Science Data Bank (FSDB) Summary of LTER Online Databases and Usage (July 2005):

Information System Development PeriodPlatformMetadata storage Data storage Primary tool 1980’sMainframePaper formsASCIIFortran / SAS Transition period ’sLocal Area Network File Server Desktop RDBMS ASCII/ RDBMS SAS / Foxpro Transition period ’sDatabase server/ Web Server (UNIX) RDBMS Server/ XML RDBMSSQLServer/ Visual Foxpro

Online LTER Component of FSDB  140 databases (metadata and data) Over 500 data tables Includes 25 spatial databases –50 more in preparation (300 coverages / Shape files) Over 100 tables updated annually 10 new databases per year  Additional 40 legacy databases not online

Andrews LTER Data Access Policy Data Release  Type I status Data is freely available within 2 years of collection Users must agree to terms of use agreement  Type II status Legal issues –Sensitive resources, personal privacy, copyright laws Data quality assurance issues –Quality too low, new techniques, legacy data Publication issues –Vulnerable PI (grad student or post-doc), long sampling interval

User Registration summary

Why researchers don’t share data (Vogel, EOS 1998)  No professional credit  Loss of proprietary rights to data  Fear of others publishing data  Costs of data documentation  Lack of proper quality assurance

Data Use Agreement  Assures data provider of ethical use of data  Provides citation for data source  Gives protection through disclaimer  Requires notification of data usage  Requests copies of derivative publications  Encourages good scientific citizenship

Systematic Approach to Ecological Research Develop Research Objectives Define Statistical Design/Sampling Protocols Design Database, Begin Documentation Collect Data / Do Field Work Capture or Enter Data, Assure Data Quality Analyze and Interpret Data Synthesize Research Results Archive Data and Publish Research Findings on the Web

Ecoinformatics Challenges  Ecologists use heterogeneous data from many sources for synthesis  Data volumes increasing exponentially  Requirements for metadata quality/quantity increasing  Expectations for data accessibility increasing

H. J. Andrews Experimental Forest Telemetry System RF Base Station Repeater/Monitoring Repeater Only Key Monitoring Only Cable Connection Radio Link CR10 Data Logger Upper Lookout Met Station Vanilla Leaf Met Station CR23X CR10 Central Met Station CR10 Roswell Ridge Repeater WS 1, 2, 7 Gauging Stations Mack Cr., WS 8 Gauging Stations RF Base Station Primary Met Station T-1 Connection CR10 Andrews HQ Site OSU Network /WWW WS 3, 6, 9, 10 Gauging Stations Blue Ridge Repeater

… Two radio repeater Blue River Ridge Radio-equipped data loggers Watershed 1 Radio Links Spread Spectrum Telemetry: Internet Connected Data Loggers Airshed Project on Watershed mhz spread spectrum radio Andrews Headquarters Control computer: data logger communication Tower anemometers, sap flow, soil moisture, and other sensors Serial server LAN WWW

Increasing Demand for Information Resources  Re-use of long-term study data and other information resources  Synthesis of cross-site data in large-scale collaborative efforts  Real-time data distribution from a network of environmental observatories  Demands for easy discovery and access via the web  Dynamic compilation of resource metadata

Data Management to Information Management  More information products to manage: Non-spatial databases Spatial data (GIS coverages and remote sensing images) Publications Analytical Tools: models, software Photographs, Images, etc. Taxonomic Collections Documents (study plans, methods manuals, web documents) Data request/download histories, web logs

Publications Analytical Tools Data Sets GIS Coverages Images Information Products Main Catalog Related Objects Search Object Tables People Keywords Species Locations Data Object Tables Remote Sensing Outline of Metadata Database

Simplified view of the normalized metadata structure DatabasesPublication Catalog Image Cross-referenceRequest_history Personnel Grants Organization Project Attribute Enumerated domain Methodology Related File Distribution Entity Taxonomic keyword Theme keyword Place keyword Analytical Tool

Databases List of all Databases Entity List of all entities (tables) Enum_domain Database-specific and shared codes Taxonomic_classif Shared species codes Place_keyword Shared location codes Attribute Global list of all attributes (database-specific) Study Data: Entities and Attributes Domain_type Domain classification Methodology Shared list of all methods

Web Search

Data Catalog on the web

Metadata on the web

Metadata Standards  LTER (Non-geospatial metadata) Origins in IBP (1970’s) Kellogg workshops (NSF 1984, 1990) Minimum standards (Kirchner et al. 1994) ESA/FLED (Michener et al. 1997)  FGDC (Spatial metadata) Dominant spatial standard  NBII (Non-geospatial and spatial metadata) Biological Data Profile  EML (Ecological Metadata Language) Implemented as modular XML documents

Structured Metadata - Webpage creation Distribution - Request tracking - Security Quality Assurance - Data validation - QA Reports - Metadata validation Data Production - Data entry system - Data import/export - Metadata export Metadata is more than just documentation: Generic Tools

EML Generation from a Relational Database Management System

Data Distribution Entity_Attribute field_order key_field Attribute Entity data_restriction_rule

Data Distribution Entity_Attribute field_order key_field Attribute Entity data_restriction_rule

Data Distribution Entity_Attribute field_order key_field Attribute Entity data_restriction_rule

Spatial Management Strategy  Provide access to spatial information Andrews Forest and surrounding areas (regional context)  Support researcher and student GIS efforts Provide GIS analysis Training on GIS / metadata tools  Provide linkages to scientific data Integrated data catalog  Investigate new technology Internet map server (searches for study sites and data sets) Visualization

Study Site Inventory Control area Small watersheds Study locations

Internet Mapping Applications

LTER Network–Level Data Activities  Network Information System (NIS) All-site personnel directory All-site bibliography Distributed Table of Contents (DTOC) Prototype Site Description Database (SiteDB) Climate/Hydrology Data Projects (ClimDB/HydroDB) Ecological Metadata Language (EML) harvester

Review Criteria for LTER Information Management Systems Key points:  Consistent standards across network  Focus on information management system  Focus on functionality  Set the bar high  Compliance levels: shall, should, encouraged  References to policy & best practices documents April 2005

DiscoveryAccessUsability Tier 1  Unstructured, online site catalog with minimal metadata  Data discovery through manual searches  Establish data access policy  Data and metadata access requires human intervention  Unstructured, machine- readable metadata and data  Data use requires human intervention Tier 2  Online, enhanced metadata with consistent internal structure  Data discovery through machine search  Automated access to data  Access to site data and metadata does not require human intervention  Structured, comprehensive metadata and data  Data use does not require human intervention Tier 3  Discovery-enabling metadata structured in EML  Data discovery integrated across network  Access-enabling metadata structured in EML  Data access is integrated across network  Complete validated EML  Data analysis is integrated across the network Future Outcome Semantic-based discovery through machine-based searches Data access through a knowledge-based query process Semi-automated knowledge extraction Tiered Trajectory Metadata completeness Metadata structure

Lessons and Summary  Increasing informational needs necessitate information management policies and systems  Metadata serves an important role in integrating and locating information products  Research community involvement is critical to success Offer incentives: i.e., data quality assurance, data archiving, data and metadata entry tools, web publishing