Cybertaxonomy and revisionary systematics Dmitry Dmitriev Illinois Natural History Survey, USA

Slides:



Advertisements
Similar presentations
A vision for the future of taxonomic databases David Eades Illinois Natural History Survey Presented at the Natural History Museum, London, 17 January.
Advertisements

Cybertaxonom y Vincent S. Smith The use of computers and networks in a program of taxonomic research.
Landcare Research LCR Manage 7 of 25 Nationally Significant Collections & Databases –CHR (plants, mosses etc) - 600k collections –PDD (fungi) - 80k collections.
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
What is a Flora? Peter Hovenkamp. What is not a Flora? Labwork/ecology paper Species selection on non-taxonomic criteria No identification tool Character.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
Diana Hernandez Integrating the catalogue of Mexican biota: different approaches for different client perspectives.
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Jordan Biserkov, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
BIS TDWG Conference, New Orleans, 2011 GBIF: Issues in providing federated access to digital information related to biological specimens David Remsen Senior.
VegBank.org: a Permanent, Open-Access Archive for Vegetation Plot Data. Michael T. Lee 1, Michael D. Jennings 2, Robert K. Peet 1. Interacting with the.
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Service activities ViBRANT Project Year 3/Final Review Meeting – Brussels Description & Objectives WP Description WP Objectives WP partners.
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Arthur ChapmanData Quality Training SABIF June 2012 Taxonomic and Nomenclature Data A. D. Chapman Data Quality.
A LOOMING CRISIS: MAINTAINING ACCESS TO ELECTRONIC RESEARCH PRODUCTS Daphne Fautin University of Kansas Gail Kampmeier Illinois Natural History Survey.
Streamlining the registration- to-publication pipeline Lyubomir Penev, Teodor Georgiev, Pavel Stoev Sherborn Meeting, NHM London, 28 Oct 2011 ViBRANT.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer September G A Darwin-Core Archive solution to publishing and.
Open access journals Pensoft Journal Ststem PJS 2.0 Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT ViBRANT Tools for DNA taxonomists,
What EDIT brings : Funding, Fieldwork, Training, Web, Software Gaël Lancelot EDIT Communication officer.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
Making small data big: The Biodiversity Data Journal (BDJ) Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, David M. Roberts 4 & Vincent.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).
To be Published for free or to be Read for free: OA publishing from an Easterneuropean perspective Lyubomir Penev Pensoft Publishers, Sofia APE 2011 Berlin.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Use case lessons: Components of the SEEK architecture Robert K. Peet University of North Carolina.
Virtual Biodiversity ViBRANT Vince Smith & Dave Roberts Natural History Museum, London ViBRANT Virtual Biodiversity.
At the frontline of publishing in systematic zoology: A presentation of ZooKeys Lyubomir Penev 1, Terry Erwin 2, Jeremy Miller 3 1 Pensoft Publishers,
The Pensoft Journal System and XML-based workflow Lyubomir Penev Life and Literature Conference, Chicago 2011 ViBRANT Virtual Biodversity.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
@dimitriskoureas making small data… big. Publications based on countless specimens, images, maps, keys and datasets Typically generated by small communities.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
A paradigm shift in biodiversity publishing: mobilization, mark up, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel.
Richard White Biodiversity Informatics. What is biodiversity informatics? The preceding project, among others, shows that the challenges facing biodiversity.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
CBoL Taipei, september 2007 BARCODE DATA, MUSEUM CATALOGS AND GBIF Simon Tillier.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
From Small to Big… Gail Kampmeier Illinois Natural History Survey University of Illinois
Dag Endresen Knowledge Systems Engineer GBIF New Orleans (Louisiana, USA) 20 October 2011 Biodiversity Information Standards, TDWG.
NeMys: an evolving biological information system, a state of art Deprez, Tim (UGent) Vincx, Magda (UGent) Vanden Berghe, Edward (VLIZ) Mees, Jan (VLIZ)
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Hellenic Centre for Marine Research (HCMR) MedOBIS - Ocean Biogeographic Information System for the Eastern Mediterranean and Black Sea.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Literature & interoperability: a working example using ants Donat Agosti, Terry Catapano, Guido Sautter, Christiana Klingenberg & Christie Stephenson TDWG.
Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Andreas Kohlbecker, Pepe Ciardelli, Niels Hoffmann, Katja Luther, Andreas Müller Botanic Garden.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
Scratchpads Virtual Research Environments for taxonomic and biodiversity related data.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
GBIF Governing Board 20 Module 6B: New GBIF Tools II 2013 Portal and NPT Startup Daniel Amariles IT Leader, National Biodiversity Information System of.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
ZooBank: Scope of Registry
International Congress of Entomology, Orlando
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Bringing Organism Observations Into Bioinformatics Networks
Presentation transcript:

Cybertaxonomy and revisionary systematics Dmitry Dmitriev Illinois Natural History Survey, USA

Taxonomy During last 255 years since Linnaeus about 1,800,000 species have been described Descriptive taxonomy remains very slow and labor intensive process

Species accumulation curve Catalogue of Life provides records for ~1,350,000 species names Total estimated at 1,800,000 species

Number of species described in 10 years periods Estimated 10,000-15,000 species described each year

Species accumulation curve in Cicadellidae

Species accumulation curve in Cicadellidae: Typhlocybinae

Number of species of Typhlocybinae described in 10 year periods Beamer Edwards Osborn Ribaut Matsumura Knull Zachvatkin Ross Young Dworakowska Anufriev Ross Dworakowska Dietrich Dmitriev

Taxonomic impediment Despite our best efforts, the vast majority (perhaps 90% or more) of the species remain undocumented. Taxonomists currently describe 15,000 new species per year. Recent estimates suggest that between 27,000 and 130,000 species are being lost each year to extinction.

Taxonomic revision challenges A taxonomic revision summarizes knowledge about a group of organisms (morphology, distribution patterns, ecological preferences, bioacoustics, molecular variation, synonyms, new species, tools for identifications, etc.) Efficient management and synthesis of large amounts of nomenclatural, morphological, and distributional data is required.

Number of species of Typhlocybinae described in 10 year periods Beamer Edwards Osborn Ribaut Matsumura Knull Zachvatkin Ross Young Dworakowska Anufriev Ross Dworakowska Dietrich Dmitriev

Taxonomic revision challenges When published, a revision provides a snap-shot of the modern knowledge on a group of organisms. It stimulates further study and species discovery. It quickly becomes outdated. 2006: First record of the tribe Erythroneurini from South America. Description of the genus Zyginama 2008: Revision of the genus Zyginama (70 species) 2013: Three new species of the genus Zyginama from Argentina

What is cybertaxonomy? Technological advances, including relational databases, digital imaging, and Internet dissemination, provide taxonomists with tools to increase both the quality and quantity of such studies. Cybertaxonomy aims to develop information processing tools that enable taxonomists both to produce traditional taxonomic revisions more rapidly and to develop new models for managing and disseminating taxonomic information.

Available applications 3i SpeciesFile MX Scratchpads

3i - Interactive keys

3i – Pictorial keys

3i - Dichotomous Keys

3i - Taxonomic pages

Publications based on 3i

3i - Data Sharing Catalogue of Life (CoL) Encyclopedia of Life (EoL) Global Biodiversity Information Facility (GBIF) Discover Life Global Name Architecture (GNA) Biodiversity Heritage Library

SpeciesFile Designed by David Eades and colleagues in the Illinois Natural History Survey (USA) Supported by International Orthoptera Society Online application based on SQL Server database and Visual Basic.net. Originally designed for the insect order Orthoptera, but later was adopted by researchers working on other insect groups.

Species File System allows storage and retrieval of taxonomic and nomenclatural information with associated images, distribution, and bibliography information. System strictly enforces the rules of the International Code of Zoological Nomenclature. Data shared with Catalogue of Life, GBIF

MX (MatriX) Designed by Matt Yoder in Texas A&M University (USA) Based on MySQL and Ruby on Rails online application. Originally designed for the insect order Hymenoptera, but later was adopted by researchers working on other insect groups.

MX (MatriX) The system could be used for storage and manipulation of different types of data: bibliographies, images, specimen records, distribution, molecular and morphological information. Dichotomous and matrix based keys. Images could be linked to MorphBank.

MX (MatriX) Significant part of MX is an integrated morphological ontology builder which allows to link vocabulary terms to each other, as well as their definitions and illustrations.

MX (MatriX) Dichotomous and matrix based keys.

Scratchpads Based on Drupal content management system Online application which provides users with templates to enter taxonomy related information in uniform way Support for classifications, taxon profiles, specimens, literature, images, maps, phenotypic, genotypic & morphometric datasets, keys, phylogenies Data could be exported in various formats

Publishing observations and taxon data Specimen records & species pages on Scratchpads Pushed to GBIF & EOL (requires site registration with GBIF & EOL) Darwin Core Archive (DwCA)

Article publishing Paper assembled from Scratchpad database XML submission, peer review & marked-up publication by Pensoft Published in Zookeys & Phytokeys PDF HTML XML doi: /zookeys

TaxonWork: new development

Taxon Name RelationshipsName Statuses Georeferences Collecting Events Taxon Concepts Interactive Keys Matrices Sources Media People Alternative Classification Specimens Sources People Sources Media People Sources Specimens Media People Sources Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People SourcesGeoreferences Collecting Events Specimens Media People Sources Taxon Name Georeferences Collecting Events Specimens Media People Sources Alternative Classification Taxon Name Georeferences Collecting Events Specimens Media People Sources Name Statuses Alternative Classification Taxon Name Georeferences Collecting Events Specimens Media People Sources RelationshipsName Statuses Alternative Classification Taxon Name Georeferences Collecting Events Specimens Media People Sources Taxon Name RelationshipsName Statuses Georeferences Collecting Events Taxon Concepts Matrices Media Alternative Classification Specimens Media Specimens Media Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Georeferences Collecting Events Specimens Media Taxon Name Georeferences Collecting Events Specimens Media Alternative Classification Taxon Name Georeferences Collecting Events Specimens Media Name Statuses Alternative Classification Taxon Name Georeferences Collecting Events Specimens Media RelationshipsName Statuses Alternative Classification Taxon Name Georeferences Collecting Events Specimens Media People

TaxonWorks: Nomen Ontology

TaxonWorks: Nomen Ontology

TaxonWork: new development

Acknowledgements Collaborators: Christopher Dietrich, Roman Rakitov, Daniela Takia, Sindhu Krishnankutti, Doris Lagos, David Eades, Matt Yoder, Edward DeWalt, Alexey Solodovnikov, Yalin Zhang, Richard Pile, and many others. Grant support: NSF, EoL.