A curation interface for reconciliation of species names for India. Thomas Vattakaven and R. Prabhakar, India Biodiversity Portal, Strand Life Sciences,

Slides:



Advertisements
Similar presentations
How can the ALA help BIGnet? Citizen Science at work Piers Higgs Citizen Science Team Lead Sydney, 3 rd April, 2011 The Atlas.
Advertisements

To share data, all providers must agree upon a data standard.
Diana Hernandez Integrating the catalogue of Mexican biota: different approaches for different client perspectives.
Virtualizing Entomology Collection Student: Di Wang (Alan) Sponsors: John Marris: Curator, Entomology Research Museum Stuart Charters: Department of Applied.
The Naturalist Fredrik Ronquist Swedish Museum of Natural History.
The Species 2000 Protocols for a Distributed System by Yuri Roskov, Species 2000 Secretariat 20th International CODATA Conference, Session K2, 25 October.
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Create new database Create staging table Import new taxonomy Index new taxonomy Load new taxonomy to core db New TNRS DB New taxonomic source More taxonomic.
Arthur ChapmanData Quality Training SABIF June 2012 Taxonomic and Nomenclature Data A. D. Chapman Data Quality.
Next Steps in the Catalogue of Life Frank Bisby, Sp2000 and Thomas Orrell, ITIS Catalogue of Life Partnership.
Ocean Biodiversity Information – 29/11-1/12/20041 European Register of Marine Species version 2.0 data management, current status and plans for the future.
Controlled Vocabularies (CVs) Development Why do we need CVs? Select CV domains needed Identify applicable CVs to evaluate Evaluate content of CVs –Scientific.
Cynthia Parr Species Pages Group GBIF Briefing 11 Aug 2010.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Biodiversity Information Systems Prasanna J. Kolte Ashoka Trust for Research in Ecology and the Environment, Bangalore E-Forests Information Systems for.
The EDIT Platform for Cybertaxonomy as an information broker in name infrastructures Andreas Kohlbecker 1, Yde de Jong 2, Cherian Mathew 1, Lorna Morris.
BIOPAMA Regional Reference Information System and the Digital Observatory of Protected Areas Steve Peedell European Commission Joint Research Centre BIOPAMA.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
U.S. Department of the Interior U.S. Geological Survey Biodiversity Information Serving Our Nation (BISON): A National Resource for Species Occurrence.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
The Encyclopedia of Life: A Web Site for Every Species James Edwards Executive Director, EOL Barcode of Life Conference Taipei 20 September 2007.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Indexing the Species Names of the World - for the World Frank Bisby (Species 2000), Michael Ruggiero (ITIS) Per de Place Bjørn (GBIF - ECAT)
Internet Gateway for Delivering Biodiversity Data ESRI User Conference July 2005.
The variety of living organisms
1 Enhancing Organism Based Disease Knowledge Using Biological Taxonomy, and Environmental Ontologies Ken Baclawski Northeastern University Neil Sarkar.
OBIS Portal Architecture Concepts plus potential for utilization as a basis for Regional OBIS Nodes Tony Rees, CSIRO Marine Research, Hobart (and OBIS.
Animal Species Database of China JI, Li-Qiang Institute of Zoology, CAS Beijing, China CODATA, 2006, Beijing.
The National Park Service's Information Management Strategy, Infrastructure, and Software Applications.
Semantic Species Taxonomy Shima Izadpanahi CMPE583 Term project.
Online Data Flanders Marine Data & Information Centre InnovOcean site SeadataNet Annual Meeting, Madrid 2009.
Citizen Science and the India Biodiversity Portal Thomas Vattakaven, Strand Life Sciences, Bangalore, India.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
GLOBAL BIODIVERSITY INFORMATION FACILITY ECAT Programme Update David Remsen & Markus Döring.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
A Biodiversity Content Management System for Research, Education, and Outreach Cynthia Sims Parr University of Maryland, College Park Co-authors Roger.
Global Working Checklist of Compositae A TICA Project Seed Funded by GBIF ECAT.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Encyclopedia of Life Established May 2007 First version of portal went online Feb year goals –Assemble infinitely expandable web pages for all.
Christina Flann Species 2000 October 2014 Catalogue of Life Indexing The World’s Known Species Connecting the taxonomic community and the names infrastructure.
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
GBIF Mid Term Meetings 2011 Biodiversity Data Portals for GBIF Participants: The NPT Global Biodiversity Information Facility (GBIF) 3 rd May 2011.
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
The CSA/NBII Biocomplexity Thesaurus: Current Initiatives, Future Directions CENDI Terminologies Workshop Washington, DC 16 September 2004 Lisa Zolly NBII.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
Plankton Web Application Project for AIP-7 By Lawrence E. McGovern, DSC International Council on System Engineering/WYLE Aerospace.
Taxonomic & data-quality challenges of large-scale citizen science: Examples from Marshall J. Iliff, eBird Project Leader.
NeMys: an evolving biological information system, a state of art Deprez, Tim (UGent) Vincx, Magda (UGent) Vanden Berghe, Edward (VLIZ) Mees, Jan (VLIZ)
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Taxonomic verification: Species 2000 and the Catalogue of Life Frank Bisby.
Coreoidea Species File Online Laurence Livermore 5 th IHS Quadrennial Meeting – July 2014 Lessons Learned in Creating a Comprehensive Taxonomic Inventory.
Progress Alastair Culham. i4Life – the BIG aim To move Catalogue of Life from a research project to a sustainable service 1.To enhance the content 2.To.
CAAB - Codes for Australian Aquatic Biota Tony Rees Divisional Data Centre CSIRO Marine Research, Hobart
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
Darrell Siebert The MOA Programme: Did we really do that?
GBIFS Seminar with the Science Committee and the Nodes Strategy Group Analysis of the content published by the GBIF network – Better understanding what’s.
GBIF - ECAT  Electronic Catalogue of Names of Known Organisms  Program Officer;  Per de Place Bjørn 
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
Centre for Environmental Data and Recording - CEDaR Established in 1995 to collect, collate and disseminate all biodiversity and geodiversity records for.
Charles Copp, Neil Caithness & Richard White.  Evaluation, selection and acquisition of existing thesauri  Thesaurus modelling - logical and physical.
GBIF Governing Board 20 Module 6B: New GBIF Tools II 2013 Portal and NPT Startup Daniel Amariles IT Leader, National Biodiversity Information System of.
African Register of Marine Species AfReMas Leen Vandepitte On behalf of WoRMS data management team.
Inspiring and Engaging the Public Towards a Shared Understanding and Sense of Ownership of Freshwater Ecosystems A. Mauroner a, I.J. Harrison ab, & M.
 1) Linnaean system of groups  2) Cladistics Chapter 17.1, 17.2.
EASIN European Alien Species Information Network GBIF
Flanders Marine Institute (VLIZ)
Brief introduction to the project
GBIF Strategic Plan Alberto González-Talaván
Big Data Needs Little CRUD:
Presentation transcript:

A curation interface for reconciliation of species names for India. Thomas Vattakaven and R. Prabhakar, India Biodiversity Portal, Strand Life Sciences, Bangalore, India

Aims to aggregate information on the biodiversity of India and make it openly accessible to all. All data is put out under Creative Commons Licences.

Information modules Species pages – descriptive content on species (Crowd sourced from verified users) Observations – species sighting records with media- (Crowd sourced – Citizen science) Lists – species records from a locality (Crowd sourced – Citizen science) Maps – map layers containing ecological information. Documents – publications on a species. (Crowd sourced)

India species name lists There is no definitive name list for all the species in India There is scattered information for Indian species across different global and regional databases. None are comprehensive. Both ZSI and BSI do not provide a complete name list There is a need for a comprehensive name resolution service to resolve all Indian names to create a species name list for India’s biodiversity.

Compilation of original species list for IBP Database species name-list

Species Pages Observations Species listsMap layers Documents names Database species name-list

Name resolution Accepted Name Synonym Common name Misspelt name Resolve all available scientific names against a single reference taxonomy

What properties of a name do we need? Accepted nameSynonymCommon name Rank Name Status AuthorString Classification Kingdom Phylum Class Order Family Genus Species Accepted name Synonym, References Language Transliteration

How do we do it? The names need to be reorganized based on some sort of consensus taxonomy. 100,000 names (scientific and common) spread across different taxa and little or no taxonomic resources. A massive one-time exercise will not do, new names will continually feed in to the portal. Can we feed off an existing service that already handles these issues and adapt it to our needs?

The Catalogue of Life is the most comprehensive and authoritative global index of species currently available. It consists of a single integrated species checklist and taxonomic hierarchy. From many databases - (142 databases with information on 1,583,924 species, 146,175 infra specific taxa and also includes 1,285,745 synonyms and 390,258 common names) many databases CoL contains substantial contributions of taxonomic expertise from more than fifty organizations around the world, integrated into a single work by the ongoing work of the CoLP partners. It has a dynamic list (constantly evolving) and an annual list that is published and archived and can be referenced.

However, CoL does not have all Indian species. In the Indian context, classification systems for certain groups may be more recent/relevant and we need flexibility to choose such a classification system. eg: butterflies of India. CoL provides a dynamically updating taxonomic list that covers all taxa and resolves all names along a consensus taxonomy It is accepted and used by other major global initiatives

Mammals Arthropods Fishes Plants Amphibians Birds Coleoptera Butterflies IBP additions not matched on CoL but curated by curators Aves CoL Catalogue CoL-IBP Hybrid Catalogue Additions and Substitutions

Clean list CoL Ubio GBIF TNRS EoL Name No match Match Curation interface User input 1 O Reference2 O Reference3 O Reference Non-editableEditable Working list Curators (group specific) Any User AutomatedManual Namelist for India which is also the taxonomic backbone of the portal Master Curator (group specific) Dirty List Dirty List

Demo version of the name curation interface

Acknowledgments Portal Team Support

Contributors