VegBank and the ESA Cyber-infrastructure for Vegetation Science Robert K. Peet & The Ecological Society of America Vegetation Panel.

Slides:



Advertisements
Similar presentations
Evolutionary biology Population genetics Systematics Paleontology Botany and Zoology Genomics Ecology Medicine Agriculture Anthropology Bioinformatics.
Advertisements

A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
OVERVIEW OF DATA FLOW IN NVC PROCESS Field sheets NVC Proceedings.
Taxonomic data issues: An ecologist’s experience R.K. Peet The University of North Carolina Adapted by J Kennedy.
VegBank.org: a Permanent, Open-Access Archive for Vegetation Plot Data. Michael T. Lee 1, Michael D. Jennings 2, Robert K. Peet 1. Interacting with the.
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Vegetation databases Lessons from VegBank, SEEK, TDWG, IAVS, & NCEAS Robert Peet University of North Carolina.
Plant Systematics databases: Users perspectives Robert K. Peet, University of North Carolina In collaboration with The National Center for Ecological Analysis.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Names are not sufficient: the challenge of documenting organism identity R.K. Peet, J.B.Kennedy, and N.M. Franz and The Ecological Society of America Vegetation.
Data models for Community information Robert K. Peet, University of North Carolina John Harris, Nat. Center for Ecol. Analysis & Synthesis Michael D. Jennings,
VegBank A vegetation field plot archive Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center.
EcoInformatics & Vegetation Science. The symposium message Plant community ecology is on the brink of a dramatic transformation that will be made possible.
North American initiatives in Ecoinformatics: Vegbank and SEEK Robert K. Peet and The Ecological Society of America Vegetation Panel The SEEK development.
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
n U.S. Department of Agriculture Natural Resources Conservation Service National Plant Data Team (NPDT) NRCS: A repository of plant data P lant L ist.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
A new floristic atlas for the Southeast based on taxon concept relationships Robert K. Peet 1, Alan S. Weakley 1,2 & Xianhua Liu 1,3 1 The University of.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
The National Park Service's Information Management Strategy, Infrastructure, and Software Applications.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
12/04/07FGDC Vegetation Subcommittee Briefing Federal Geographic Data Committee Vegetation Subcommittee Briefing for the FGDC Coordination Group December.
Component 11/Unit 8b Data Dictionary Understanding and Development.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Digitization of Natural History Collections (DIGIT) Larry Speers Program Officer Digitization of Natural History Collections Data TDWG Annual Meeting Oct.
Overview of progress in Ecoinformatics Susan Wiser Landcare Research, Lincoln New Zealand.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
7/11/2006FGDC Vegetation Subcommittee Briefing Federal Geographic Data Committee Vegetation Subcommittee Coordination Group Briefing July 11, 2006 Ralph.
The VegBank taxonomic datamodel Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center for.
From Small to Big… Gail Kampmeier Illinois Natural History Survey University of Illinois
Collections. Vegetation sampling We observe and collect data on soil.
LTER Data Management Margaret O’Brien Santa Barbara Coastal Long Term Ecological Research (LTER) Project Santa Barbara Channel Biodiversity Observation.
Digital Accountability: The Line Between Producing and Preserving Digital Government Information Mary Alice Baish Superintendent of Documents Indiana State.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Multi-institutional collaborative program. Established in 1988 to document the composition and status of natural vegetation of the Carolinas. Provides.
Current and planned tools and resources. Multi-institutional collaborative program Established in 1988 to document the composition and status of natural.
1 The National Biological Information Infrastructure and Biodiversity Collections Annette Olson BCI meeting, Washington DC, January 28-29th, 2008.
The VegBank Data Model. Biodiversity data structure Taxonomic database Plot/Inventory database Occurrence database Plot Observation/ Collection Event.
Open access- a funders perspective (or “What we want from institutions”) CRC/RLUK/ARMA/SCONUL meeting 27 th January 2011 Robert Kiley, Head Digital Services,
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Transition to taxon concepts from a world of legacy data --- R.K. Peet 1, A.S. Weakley 1,2, X. Liu 1,3, & N. Franz 4,5 1 The University of North Carolina.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
California Digital Library Managing and Federating e-Print Repositories: UC’s eScholarship Initiatives CNI Fall Task Force Meeting December 1999 John Ober.
The challenge of organism identity --- The flora of the Southeast The flora of the Southeast as a case study Robert K. Peet University of North Carolina.
U.S. Department of the Interior U.S. Geological Survey Records Management Practices: Doing Right by the Records John Faundeen ASPRS May 1, 2008 Portland,
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
VegBank and the ESA Cyber-infrastructure for Vegetation Science R.K. Peet, Don Faber-Langendoen, Michael Jennings, & Michael Lee Ecological Society of.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
A vision for community involvement and integration Robert K. Peet & Alan S. Weakley Alan S. Weakley.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
Geog. 377: Introduction to GIS - Lecture 16 Overheads 1 5. Metadata 6. Summary of Database Creation 7. Data Standards 8. NSDI Topics Lecture 16: GIS Database.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Data sharing and exchange: Experiences within the
The CVS-EEP Partnership
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Taxonomic and Community Classification Resources and Standards
An ecosystem of contributions
Bird of Feather Session
Presentation transcript:

VegBank and the ESA Cyber-infrastructure for Vegetation Science Robert K. Peet & The Ecological Society of America Vegetation Panel

The ESA Vegetation Classification Panel was established in 1993 with a mandate to support the emerging U.S. Vegetation Classification. Background

Vegetation field plots.Vegetation field plots. Documentation & description of floristic types.Documentation & description of floristic types. Submission & peer review of proposed types.Submission & peer review of proposed types. Management, citation, & archiving of vegetation data.Management, citation, & archiving of vegetation data. ESA Guidelines for vegetation classification The ESA Vegetation Panel has developed guidelines for vegetation classification covering requirements for:

North American Vegetation Classification Ecological Society of America – Standards, peer review & publication.Ecological Society of America – Standards, peer review & publication. US Federal Geographic Data Committee – US government standards.US Federal Geographic Data Committee – US government standards. NatureServe – Maintenance and distribution of the Classification.NatureServe – Maintenance and distribution of the Classification. USDA & ITIS – Taxonomic standards for organismsUSDA & ITIS – Taxonomic standards for organisms

The new community ecology Intersection of 3 data types Site data: climate, soils, topography, etc.Site data: climate, soils, topography, etc. Taxon attribute data: identification, phylogeny, distribution, life-history, functional attributes...Taxon attribute data: identification, phylogeny, distribution, life-history, functional attributes... Co-occurrence data: attributes of individuals (e.g., size, age, growth rate) and taxa (e.g., cover, biomass) that co-occur at a site.Co-occurrence data: attributes of individuals (e.g., size, age, growth rate) and taxa (e.g., cover, biomass) that co-occur at a site.

The Vegetation Plot The primary unit of vegetation observation. Universal attributes: date, location, area, species list, species importanceUniversal attributes: date, location, area, species list, species importance Optional attributes: environment, soil, disturbanceOptional attributes: environment, soil, disturbance Protocols and formats: many & flexibleProtocols and formats: many & flexible Available data: > 10 6 plot records containing > 5x10 7 species occurrence records.Available data: > 10 6 plot records containing > 5x10 7 species occurrence records.

VegBank VegBank – a public archive for vegetation plot observations ( – a public archive for vegetation plot observations ( VegBank functions in a manner analogous to GenBank.VegBank functions in a manner analogous to GenBank. Plot data can be deposited, discovered, viewed, cited, annotated, & downloaded.Plot data can be deposited, discovered, viewed, cited, annotated, & downloaded. Plot data can be used for documentation validation and reanalysis.Plot data can be used for documentation validation and reanalysis.

VegBank strategies Standard exchange format – established through IAVSStandard exchange format – established through IAVS Supports multiple protocols.Supports multiple protocols. Flexible and expandableFlexible and expandable Tools for data discovery, integration, and summarization.Tools for data discovery, integration, and summarization. Generalizable to most types of species co-occurrence data.Generalizable to most types of species co-occurrence data. Incentives to participate.Incentives to participate.

NatureServe Biotics Classification Mgt. US-NVC Panel Proposal submission Analysis & Synthesis VegBank & other plot archives US-NVC --- Proposed data flow Extraction NatureServe Explorer Peer Review NVC Proceedings Legend External Action Internal Action Software Entity

Biodiversity data structure Taxonomic database Observation database Occurrence database Observation/ Collection Event Specimen or Object Bio-Taxon Locality Observation or Community Type Observation type database

Project Plot Observation Taxon / Individual Observation Taxon Interpretation Plot Interpretation Core elements of VegBank

Taxonomic database challenge: Standardizing organisms and communities The problem: Integration of data potentially representing different times, places, investigators and taxonomic standards. The traditional solution: A standard list of organisms / communities.

USDA Plants & ITIS Abies lasiocarpa var. lasiocarpa var. arizonica One concept ofAbies lasiocarpa

Flora North America Abies lasiocarpa Abies bifolia A narrow concept of Abies lasiocarpa Partnership with USDA plants to provide plant concepts for data integration

High-elevation fir trees of western US AZ NM CO WY MT AB eBC wBC WA OR var. arizonica Abies lasiocarpa Distribution USDA & ITIS Flora North America Abies bifoliaAbies lasiocarpa A. lasiocarpa sec USDA > A. lasiocarpa sec FNA A. lasiocarpa sec USDA >A. bifolia sec FNA A. lasiocarpa v. lasiocarpa sec USDA >A. lasiocarpa sec FNA A. lasiocarpa v. lasiocarpa sec USDA > <A. bifolia sec FNA A. lasiocarpa v. arizonica sec USDA <A. bifolia sec FNA var. lasiocarpa

T

T

T

T

T

T

VegBank Lessions for community participation Usability (testing, training)Usability (testing, training) IncentivesIncentives –Uploads (mandates, rewards, security) –Downloads (opportunity) –Citation (culture) –Tools and integration ConnectivityConnectivity

VegBank Lessions for data sharing Embrace established standards (e.g., TDWG, EML)Embrace established standards (e.g., TDWG, EML) Work to establish new standardsWork to establish new standards –Content, process, exchange Design for idiosyncratic dataDesign for idiosyncratic data Anticipate distributed systems and connectivityAnticipate distributed systems and connectivity Leverage agency mandatesLeverage agency mandates Repositories for archiving; databases for accessRepositories for archiving; databases for access Respect intellectual propertyRespect intellectual property –Embargos, licenses, confidentiality

Roles & Responsibilities Professional Societies Set standardsSet standards –Data content and exchange format –Data archiving and access (discrete, well-circumscribed elements) Assure quality control (peer review)Assure quality control (peer review) –One of the 4 functions of publication: Certification, Validation, Awareness, Archiving

Roles & Responsibilities Digital repositories and libraries Archive and provide access to publications and dataArchive and provide access to publications and data Institutional responsibilities to granting agencies. Driven by lawyers and paid for by overheadInstitutional responsibilities to granting agencies. Driven by lawyers and paid for by overhead Potential security for databasesPotential security for databases

Roles & Responsibilities Granting Agencies Set requirements for data archiving and sharing, but perhaps delegate implementation to societiesSet requirements for data archiving and sharing, but perhaps delegate implementation to societies Pay for archiving and publication, directly or through overheadPay for archiving and publication, directly or through overhead

Roles & Responsibilities Data Centers Maintain a portfolio of critical, discipline-specific database systemsMaintain a portfolio of critical, discipline-specific database systems Maintain key infrastructure contentMaintain key infrastructure content –Digital identifiers –Common objects (e.g. taxa, publications) –Data registries

Roles & Responsibilities Publishers Require that specific types of data be archived (e.g., GenBank, VegBank)Require that specific types of data be archived (e.g., GenBank, VegBank) Imbed deep links as a form of citation for standard elements such as taxon concepts and data elementsImbed deep links as a form of citation for standard elements such as taxon concepts and data elements Provide archives for and links to supporting documentationProvide archives for and links to supporting documentation

Roles & Responsibilities Government Agencies Formulate federal standards and policies (in context of disciplinary standards).Formulate federal standards and policies (in context of disciplinary standards). Mandate and implement federal standards (e.g., FGDC standards)Mandate and implement federal standards (e.g., FGDC standards) Assure critical infrastructure existsAssure critical infrastructure exists

Conclusions VegBank is an example of a discipline- specific but public data archive (functions for deposit, discovery, withdrawal, citation, annotation)VegBank is an example of a discipline- specific but public data archive (functions for deposit, discovery, withdrawal, citation, annotation) Standards for data content, data exchange, and archivingStandards for data content, data exchange, and archiving Standards for reference to standard elements such as taxonomic dataStandards for reference to standard elements such as taxonomic data Work with a broader community to avoid institutional fragility.Work with a broader community to avoid institutional fragility.