Data Management Needs and Challenges for Telemetry Scientists Josh M London Wildlife Biologist, Polar Ecosystems Program National Marine Mammal Laboratory.

Slides:



Advertisements
Similar presentations
A distributed architecture for crystallography data, metadata, and applications John C. Bollinger Indiana University Molecular Structure Center, Bloomington,
Advertisements

Knowledge Management at the Gordon – Staff Portal Project Presented by Deirdre Carmichael 12 September 2008.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Astrophysics, Biology, Climate, Combustion, Fusion, Nanoscience Working Group on Simulation-Driven Applications 10 CS, 10 Sim, 1 VR.
Systems Oceanography: Observing System Design. Why not hard-wire the system? Efficiency of interface management –Hard-wire when component number small,
Chapter 2: IS Building Blocks Objectives
Business Intelligence Technology and Career Options Paul Boal Director - Data Management Mercy ( April 7, 2014.
The Future of Psychology
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Virtual Geophysics Laboratory (VGL) VGL v1.2 NeCTAR Project Close R.Fraser, T.Rankine, J.Vote, L.Wyborn, B.Evans, R.Woodcock, C.Kemp July 2013 CSIRO |
Status of ICT structure, infrastructure and applications existed to manage and disseminate information and knowledge of Agricultural Biotechnology Innovations.
ArcGIS Workflow Manager An Introduction
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
CIS 321—IS Analysis & Design Chapter 1: The World of the Modern Systems Analyst.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
OPC Database.NET. OPC Systems.NET What is OPC Systems.NET? OPC Systems.NET is a suite of.NET and HTML5 products for SCADA, HMI, Data Historian, and live.
AIRNow-International The future of the United States real-time air quality reporting and forecasting program and GEOSS participation John E. White U.S.
SednaSpace A software development platform for all delivers SOA and BPM.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
WPS Application Patterns at the Workshop “Models For Scientific Exploitation Of EO Data” ESRIN, October 2012 Albert Remke & Daniel Nüst 52°North Initiative.
CALIFORNIA DEPARTMENT OF WATER RESOURCES GEOSPATIAL TECHNICAL SUPPORT MODULE 2 ARCHITECTURE OVERVIEW AND DATA PROMOTION FEBRUARY 20, 2013.
George Washington Birthplace NM: Geodatabase Development for Resource Management and Planning Bill Slocumb GIS Specialist and Research Associate North.
MIS3300_Team8 Service Aron Allen Angela Chong Cameron Sutherland Edment Thai Nakyung Kim.
U.S. Department of the Interior U.S. Geological Survey Next Generation Data Integration Challenges National Workshop on Large Landscape Conservation Sean.
Funding Opportunities for GI Science at National Science Foundation Nina Lam 02/04/00.
NEPTUNE Canada Workshop Oceans 2.0 Project Environment NEPTUNE Canada DMAS Team Victoria, BC February 16, 2009.
Geospatial Technical Support Module 2 California Department of Water Resources Geospatial Technical Support Module 2 Architecture overview and Data Promotion.
Planning for Arctic GIS and Geographic Information Infrastructure Sponsored by the Arctic Research Support and Logistics Program 30 October 2003 Seattle,
Latrobe.edu.au CRICOS Provider 00115M Panel discussion: Support needs of academics, roles for librarians and skill sets needed Simon Huggard, Digital Infrastructure.
1 Implementing Portal Technology To Support Data/Information Management - Portal Prototype Dashboard for Fisheries Information System (FIS) November 3,
From FAUST to VOYAGER efforts to maintain map and geodata stocks 17th Conference of the LIBER Groupe des Cartothécaires TALLINN, Estonia June 2010.
Billy Gellepis Administrative Services Associate NCURA Region VI Meeting Waikoloa, HI – April 2012 Developing an OPEN HOUSE Collaboration Between Principal.
A Biodiversity Content Management System for Research, Education, and Outreach Cynthia Sims Parr University of Maryland, College Park Co-authors Roger.
Update on e-Placement at Aon Ian Summers. Aon Limited is authorised and regulated by the Financial Services Authority in respect of insurance mediation.
Software Engineering Committee Status Report: Preliminary Findings and Recommendations Richard Loft and Gerry Wiener SE Committee Co-chairs National Center.
Trimble Connected Community Overview Craig Muir. TCC Mission Enable the secure sharing and aggregated viewing of data with authorized people and devices.
Pascucci-1 Valerio Pascucci Director, CEDMAV Professor, SCI Institute & School of Computing Laboratory Fellow, PNNL Massive Data Management, Analysis,
Data Integration and Management A PDB Perspective.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 2 Information System Building Blocks.
2-1 A Federation of Information Systems. 2-2 Information System Applications.
Agricultural Knowledge Management in IPMS KM4Dev Addis, April, 05, 2013 Fanos Mekonnen, Knowledge Management Expert, LIVES.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
TopCAT Use Cases Priorities User Interface 1 ICAT developer workshop, August 2009 Laurent Lerusse – STFC
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Virtual Geophysics Laboratory (VGL): Scientific workflows Exploiting the Cloud Josh.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
U.S. Department of the Interior U.S. Geological Survey Decision Support Tools and USGS Data Management Best Practices Cassandra Ladino USGS Chesapeake.
The 4 Capital Approach: A Framework for Thinking about Sustainable Community Development.
CUAHSI HIS: Science Challenges Linking small integrated research sites (
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Natura 2000 System Alberto Telletxea Bilbomática under EEA Contractor.
Barrow Area Spatial Data Infrastructure A collaborative effort of the Barrow Arctic Science Consortium Digital Working Group Funding for this effort has.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Institutional data curation implementation 1st African Digital Curation Conference 12 February 2008.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
William Perry U.S. Geological Survey Western Ecological Research Center Geography 375 Final Project May 22, 2013.
BG 5+6 How do we get to the Ideal World? Tuesday afternoon What gaps, challenges, obstacles prevent us from attaining the vision now? What new research.
Comments on SPI. General remarks Essentially all goals set out in the RTAG report have been achieved. However, the roles defined (Section 9) have not.
CyVerse Data Store Managing Your ‘Big’ Data. Welcome to the Data Store Manage and share your data across all CyVerse platforms.
Enhancements to Galaxy for delivering on NIH Commons
Accessing the VI-SEEM infrastructure
Challenges of open science
Lecture 8 Database Implementation
Design and realization of Payload Operation and Application system of China’s Space Station Wang HongFei 首页.
Problem: Ecological data needed to address critical questions are dispersed, heterogeneous, and complex Solution: An internet-based mechanism to discover,
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Presentation transcript:

Data Management Needs and Challenges for Telemetry Scientists Josh M London Wildlife Biologist, Polar Ecosystems Program National Marine Mammal Laboratory NOAA NMFS Alaska Fisheries Science Center

Temptation to identify biologists as the source for the raw data

The Tip of a Complex Iceberg hypothesisagency needs/mandatesfunding initiatives opportunistic vs. planned tag design/vendortag programming Deployment of tags (location, age/sex, time) Data Management data quality control synthesis movement model Publications Contract reports Status/Listing Review derived products Field Work and Study Design Narrowing Bottleneck Many biologists lack the skills and training for effective, scalable database design and data management practices

Field Work & Tag Deployment  When? Where?  Which Tag/Vendor?  Which Age? Which Sex? (Do we have a choice?)  Tag Programming  Deployment Length (attachment type)

Limited Tools for Managing Raw Telemetry Data ‘raw’ data  via Argos as CSV/Text  Process w/ Vendor Software (behavior data)  Typically output as CSV  Field data about animal (e.g. ID, species, sex, age, health) needs  Explore ‘raw’ data  Address hypotheses  Visualize movement/use  Synthesize w/ dependent (e.g. health, age) and independent data (e.g. other animals, remote sensed)

Biologists Not Trained in Large Scale Data Management Biologists  Excel and/or Access  ESRI ArcMap (shapefiles)  Google Earth  Mouse Click Interaction  Programming (visual basic, R, python) recipe driven … not developers Data Manager  Postgres/PostGIS, Oracle, MySQL, SQL Server  Normalization and Efficient Design  Scripting, Jobs, Transactions  Data Integrity  Automation, Reproducible

My Perspective To address complex questions related to marine mammal telemetry and understanding animal ecology, I had to become more of a data manager …And, in the process, I’ve become less of a biologist Start (2006)  Argos Monthly CDs  SatPack Access Database  Excel Files (limited to 56k)  Large, Flat Tables  No Central Repository Current System  Nightly FTP Argos Push  Nightly Data Processing  CSV/External Oracle Table  PL/SQL Procedures  Developed/Designed with Training via Google Search

My Perspective Current Limitations  Data access requires a minimum level of technical skills (basic SQL, Oracle framework, Oracle APEX, R spatial tools, ArcMap)  Single Point of Access/Failure (me)  Limited Documentation of Design  Design May Not be Optimal/Appropriate  Main Objective to Provide Data to Analysts – Not necessarily designed for providing data to public

My Perspective Greatest Needs – Research Program  Data Management and Design Consultation  Data Design & Documentation Portal (user-friendly metadata)  Low Tech Exploration Tools  Database and Application Developers (data flow and data input)  Training Opportunities

My Perspective Greatest Needs – External to Program?  Provide Meaningful Public Access to Data  A Clear Data Sharing Policy w/ Best Practices  Encourage/Facilitate Scientific Collaboration  Meet Agency Needs and Requirements  How to Communicate Scientific Knowledge in the Modern/Digital Age–sharing knowledge/expertise just as important as sharing data  Publish Data Once

My Perspective Challenges / Road Blocks  Limited Funds and Priorities – appropriate resources for doing the priority analysis and science not available, let alone the resources to distribute data responsibly  Database design/management often in the hands of the least skilled users  IT Policies, Investments, and Infrastructure Varied Across Institutions  No standard(s) for communicating and sharing ‘raw’ animal telemetry data. What is ‘raw’ data?