The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.

Slides:



Advertisements
Similar presentations
The Effectiveness of Inclusion By: Audrey Bruce Angela Rawal Maryam Siddiqui.
Advertisements

Australian Faunal Directory (AFD) and Australian Plant Census (APC): Content, Architecture and Services Documenting and delivering nomenclature and taxonomy.
Study Objectives and Questions for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
What is a Flora? Peter Hovenkamp. What is not a Flora? Labwork/ecology paper Species selection on non-taxonomic criteria No identification tool Character.
V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
Research Methods in Crime and Justice Chapter 4 Classifying Research.
To share data, all providers must agree upon a data standard.
OVERVIEW OF DATA FLOW IN NVC PROCESS Field sheets NVC Proceedings.
Taxonomic data issues: An ecologist’s experience R.K. Peet The University of North Carolina Adapted by J Kennedy.
VegBank.org: a Permanent, Open-Access Archive for Vegetation Plot Data. Michael T. Lee 1, Michael D. Jennings 2, Robert K. Peet 1. Interacting with the.
1 Quality Control in Scholarly Publishing. What are the Alternatives to Peer Review? William Y. Arms Cornell University.
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Vegetation databases Lessons from VegBank, SEEK, TDWG, IAVS, & NCEAS Robert Peet University of North Carolina.
Transition to taxon concepts from a world of legacy data --- R.K. Peet 1, A.S. Weakley 1,2, X. Liu 1,3, & N. Franz 4,5 1 The University of North Carolina.
Plant Systematics databases: Users perspectives Robert K. Peet, University of North Carolina In collaboration with The National Center for Ecological Analysis.
Names are not sufficient: the challenge of documenting organism identity R.K. Peet, J.B.Kennedy, and N.M. Franz and The Ecological Society of America Vegetation.
Improving Restoration Using CVS-Designed Web-Based Tools 7 October 2009 M. Forbes Boyle University of North Carolina, Chapel Hill.
Data models for Community information Robert K. Peet, University of North Carolina John Harris, Nat. Center for Ecol. Analysis & Synthesis Michael D. Jennings,
VegBank A vegetation field plot archive Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center.
EEP wants to do a better job creating natural ecosystems. CVS provides improved reference data, target design, monitoring, and data management and analysis.
EcoInformatics & Vegetation Science. The symposium message Plant community ecology is on the brink of a dramatic transformation that will be made possible.
VegBank and the ESA Cyber-infrastructure for Vegetation Science Robert K. Peet & The Ecological Society of America Vegetation Panel.
North American initiatives in Ecoinformatics: Vegbank and SEEK Robert K. Peet and The Ecological Society of America Vegetation Panel The SEEK development.
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National.
Knowledge Management Solutions
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
NANDA International Investigating the Diagnostic Language of Nursing Practice.
Vegetation Plot Management: A National Plots Database Demo Funding: National Science Foundation (DBI ) John Harris - NCEAS Robert K. Peet - University.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
FGDC Vegetation Subcommittee Update to Coordination Group Marianne K. Burke, Ph.D Chair of the Vegetation Subcommittee Forest Service Research and Development.
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
A new floristic atlas for the Southeast based on taxon concept relationships Robert K. Peet 1, Alan S. Weakley 1,2 & Xianhua Liu 1,3 1 The University of.
Indexing the Species Names of the World - for the World Frank Bisby (Species 2000), Michael Ruggiero (ITIS) Per de Place Bjørn (GBIF - ECAT)
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
The National Park Service's Information Management Strategy, Infrastructure, and Software Applications.
Brian J. Enquist Dept. Ecology and Evolutionary Biology University of Arizona, Tucson, A.Z. and The Santa Fe Institute, Santa Fe, N.M. Brian J. Enquist.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
12/04/07FGDC Vegetation Subcommittee Briefing Federal Geographic Data Committee Vegetation Subcommittee Briefing for the FGDC Coordination Group December.
Synopsis of current BIEN and Enquist projects managed by Martha iPlant 2014.
Overview of progress in Ecoinformatics Susan Wiser Landcare Research, Lincoln New Zealand.
Experience from Mapping Existing Models to the Transfer Schema Robert Kukla.
7/11/2006FGDC Vegetation Subcommittee Briefing Federal Geographic Data Committee Vegetation Subcommittee Coordination Group Briefing July 11, 2006 Ralph.
Vegetation Data Management: VegBank Funding: National Science Foundation (DBI ) January 8, 2002 John Harris - NCEAS.
The VegBank taxonomic datamodel Sponsored by: The Ecological Society of America - Vegetation Classification Panel Produced at: The National Center for.
Collections. Vegetation sampling We observe and collect data on soil.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Taxonomic verification: Species 2000 and the Catalogue of Life Frank Bisby.
Multi-institutional collaborative program. Established in 1988 to document the composition and status of natural vegetation of the Carolinas. Provides.
Current and planned tools and resources. Multi-institutional collaborative program Established in 1988 to document the composition and status of natural.
The VegBank Data Model. Biodiversity data structure Taxonomic database Plot/Inventory database Occurrence database Plot Observation/ Collection Event.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
May 2007 Registration Status Small Group Meeting 1: August 24, 2009.
Transition to taxon concepts from a world of legacy data --- R.K. Peet 1, A.S. Weakley 1,2, X. Liu 1,3, & N. Franz 4,5 1 The University of North Carolina.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
Multi-institutional collaborative research program. Established in 1988 to document the composition and status of natural vegetation of the Carolinas.
Efforts in Science and Technology to Address Data Management Needs A Network Connecting Science With Conservation January 31, 2006 Jennifer Swenson, Carmen.
The challenge of organism identity --- The flora of the Southeast The flora of the Southeast as a case study Robert K. Peet University of North Carolina.
VegBank and the ESA Cyber-infrastructure for Vegetation Science R.K. Peet, Don Faber-Langendoen, Michael Jennings, & Michael Lee Ecological Society of.
The challenge of biodiversity: Plot, organism and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee.
A vision for community involvement and integration Robert K. Peet & Alan S. Weakley Alan S. Weakley.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
Data sharing and exchange: Experiences within the
Vegetation Data Management:
The CVS-EEP Partnership
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
CLASSIFICATION VOCABULARY
Taxonomic and Community Classification Resources and Standards
Data Management: The Data Repatriation Re-integration Step or …
Measuring Data Quality
Big Data Needs Little CRUD:
Presentation transcript:

The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National Center for Ecological Analysis and Synthesis

1. The Intended User Groups Specifically “Ecologists” who record species composition of vegetation. “Ecologists” who use records of vegetation composition. In general Creators and distributors of data wherein organisms need to be labeled. Users and consolidators of data wherein organisms are labeled.

VegBank The ESA Vegetation Panel is currently developingVegBank ( as a public vegetation plot archivewww.vegbank.org VegBank is expected to function for vegetation plot data in a manner analogous to GenBank. Primary data will be deposited for reference, novel synthesis, and reanalysis, particularly for classification.

2. Intended Functionality Ecologists, like most users of organism names, care about how to label organisms, and how to interpret labels others have placed on organisms. We wish for ease in combining datasets. We abhor name changes and ambiguity

Plot Observation Taxon Observation Taxon Interpretation Plot Interpretation Core elements of VegBank Taxon Assignment Plot Assignment

Taxonomic database challenge: Standardizing organisms and communities The problem: Integration of data potentially representing different times, places, investigators and taxonomic standards. The traditional solution: A standard checklists of organisms.

Standard lists are available for Taxa Representative examples for higher plants in North America / US USDA Plants ITIS NatureServehttp:// BONAP Flora North America These are intended to be checklists wherein the taxa recognized perfectly partition all plants. The lists can be dynamic.

Most taxon checklists fail to allow effective dataset integration The reasons include: The user cannot reconstruct the database as viewed at an arbitrary time in the past, Taxonomic concepts are not defined (just lists), Multiple party perspectives on taxonomic concepts and names cannot be supported or reconciled.

Intended functionality Organisms are labeled by reference to concept (name- reference combination), Party perspectives on concepts and names can be dynamic, User can select which party perspective to follow, Party perspectives are perfectly archived, Different names systems are supported, Enhanced stability in recognized concepts by separating name assignment and rank from concept,

NameReferenceConcept 3. Taxonomic theory A taxon concept represents a unique combination of a name and a reference “Taxon concept” roughly equivalent to “Potential taxon” & “assertion”

NameConceptUsage A usage represents a unique association of a concept with a name. Usage does not appear in the IOPI model, but instead is a special case of concept Usage can be used to apply multiple name systems to a concept Desirable for stability in recognized concepts

NameConcept Start, Stop ConceptStatus Level, Parent Usage Start, Stop NameStatus Name system Reference Data relationships VegBank taxonomic data model Single party, dynamic perspective

Party Perspective The Party Perspective on a concept includes: Status – Standard, Nonstandard, Undetermined Correlation with other concepts – Equal, Greater, Lesser, Overlap, Undetermined. Lineage – Predecessor and Successor concepts. Start & Stop dates.

NameConcept Party Usage Start, Stop NameStatus Name system Status Start, Stop ConceptStatus Level, Parent Reference Data relationships VegBank taxonomic data model Multiple parties, dynamic perspectives

NameConcept Party Usage Start, Stop NameStatus Name system Status Start, Stop ConceptStatus Level, Parent Correlation Reference Lineage Data relationships VegBank taxonomic data model With party correlations and lineages

Primary differences between the VegBank and IOPI models The IOPI model is optimized for describing taxonomic decisions represented in literature. The VB model is optimized for stability in accepted concepts (super concepts), support of multiple dynamic party perspectives, support of multiple name systems.

4. State of Development 1. VegBank 2. Collaborators NatureServe Biotics4 USDA PLANTS & ITIS

Status of VegBank Working Prototype; open for deposit Nov 1 Production version July 2004 Access version on VegBranch today Efficient data exchange by July 2004 Functionality mandated in draft FGDC standards IAVS working group established for data exchange standards established 2003

VegBank data content Prototype populated with USDA PLANTS lists and synonyms = weak concepts. Contract with NatureServe and John Kartesz Develop reference-based concepts for by July 2004 of the ~32000 vascular plant taxa at species level and below List of unambiguous taxa (~6000?) Treatment of most ambiguous taxa Demonstration mapping to FNA A few demosntration groups in depth

Concept workbench Concept workbench for both plant concepts and community concepts is planned.

NatureServe Biotics 4 In production Loading the same initial concepts Already using concepts where strong differences between state programs Routine data exchange with Vegbank under development.

PLANTS & ITIS Frequent communication during redesign. Design documents for conversion to concept-based system complete Collaboration to assure that the concepts we develop will be employed in the new system Uncertainties about funding

Names Carya ovata Carya carolinae-septentrionalis Carya ovata v. ovata Carya ovata v. australis Concepts (One shagbark) C. ovata sec Gleason ’52 C. ovata sec FNA ‘97 (Southern shagbark) C. carolinae-s. sec Radford ‘68 C. ovata v. australis sec FNA ‘97 (Northern shagbark) C. ovata sec Radford ‘68 C. ovata (v. ovata) sec FNA ‘97 References Gleason Britton & Brown Radford et al Flora Carolinas Stone Flora North America Six shagbark hickory concepts Possible synonyms are listed together

ITIS FNA Committee NatureServe Carya ovata sec Gleason 1952 Carya ovata sec FNA 1997 Carya ovata sec Radford 1968 Carya carolinae sec Radford 1968 Carya ovata (ovata) sec FNA 1997 Carya ovata australis sec FNA 1997 PartyConcept PartyConceptStatusStart Usage:SciName ITIS ovata –G52 NS1996 ITIS ovata –R68 St1996C. ovata ITIScarolinae-s –R68 St1996C. carolinae-sept. ITIScarolinae-s –R68 NS2000 ITISovata aust –FNA St2000C. carolinae-sept. ITISovata – R68 NS2000 ITISovata ovata –FNA St2000C. ovata Status Application of Party Perspective

NameConcept Party Usage Start, Stop NameStatus Name system Status Start, Stop ConceptStatus Level, Parent Correlation Reference Lineage Data relationships VegBank taxonomic data model With party correlations and lineages

NameConcept Source Type: Lit, Org Correlation Status Reference Core elements of the IOPI model Is author Assigns status