Taxonomic Publications: Past und Future Donat Agosti (AMNH and NHMB) Andrew Polaszek (ICZN) Klemens Böhm und Guido Sautter (Uni Karlsruhe)

Slides:



Advertisements
Similar presentations
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Advertisements

A vision for the future of taxonomic databases David Eades Illinois Natural History Survey Presented at the Natural History Museum, London, 17 January.
The Library of Life Federated Description Services and the Library of Life or What can we do with SDD anyway? Kevin Thiele Centre for Biological Information.
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
Virtualizing Entomology Collection Student: Di Wang (Alan) Sponsors: John Marris: Curator, Entomology Research Museum Stuart Charters: Department of Applied.
Integrating Biodiversity Data
Service activities ViBRANT Project Year 3/Final Review Meeting – Brussels Description & Objectives WP Description WP Objectives WP partners.
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
Catalogue of Life, Reading, UK, 29 March 2007 Consortium for the Barcode of Life (CBOL): Linking Molecules to the Catalogue of Life David E. Schindel,
Exploring the Academic Invisible Web Das wissenschaftliche Invisible Web erkunden Dr. Dirk Lewandowski Heinrich-Heine-Universität Düsseldorf, Information.
A LOOMING CRISIS: MAINTAINING ACCESS TO ELECTRONIC RESEARCH PRODUCTS Daphne Fautin University of Kansas Gail Kampmeier Illinois Natural History Survey.
Streamlining the registration- to-publication pipeline Lyubomir Penev, Teodor Georgiev, Pavel Stoev Sherborn Meeting, NHM London, 28 Oct 2011 ViBRANT.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
Online resources in TCD Library:
Cybertaxonomy and revisionary systematics Dmitry Dmitriev Illinois Natural History Survey, USA
An online information literacy program: the case of a Greek academic library Ilias Nitsos, Aphrodite Malliari Library, Alexander Technological Educational.
N-gram Topic Models for Bibliometric Analysis Gideon Mann, David Mimno, and Andrew McCallum Can topic models provide better measurements of the impact.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
What IS the Web? Mrs. Wilson Internet Basics & Beyond.
The Encyclopedia of Life: A Web Site for Every Species James Edwards Executive Director, EOL Barcode of Life Conference Taipei 20 September 2007.
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
The Internet in Education Objectives Introduction Overview –The World Wide Web –Web Page v. Web Site v. Portal Unique and Compelling Characteristics Navigation.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Scott Miller – SANBI, 7 April 2006 Overview of DNA Barcoding and the Barcode of Life Initiative Scott E. Miller, Chair, CBOL Executive Committee National.
Small pieces loosely joined Building scientific web communities with Scratchpads S. Rycroft, D. Roberts, K. Harman, V. Smith.
At the frontline of publishing in systematic zoology: A presentation of ZooKeys Lyubomir Penev 1, Terry Erwin 2, Jeremy Miller 3 1 Pensoft Publishers,
Working group III: Infrastructure Needs Zooplankton collections Taxonomic Training CMarZ Network Website, Database & Species pages.
Tom Garnett April 12, 2007 Smithsonian Institution Libraries National Museum of Natural History Board Science Committee Meeting Biodiversity Heritage Library.
Company profile John Wiley & Sons Founded 1807 Wiley-VCH Acquisition 1995 International publisher of scientific and professional.
Advantages and disadvantages of current reference and digital objects linking models in scientific information space Radovan Vrana, M.Sc. Department of.
Progress since the February 2005 London DNA Barcode of Life Conference Scott Miller, Chair Consortium for the Barcode of Life Smithsonian Institution.
Copyright, Biopiracy and the Taxonomic Impediment Donat Agosti Naturhistorisches Museum der Burgergemeinde Bern, Switzerland and American Museum of Natural.
Richard White Biodiversity Informatics. What is biodiversity informatics? The preceding project, among others, shows that the challenges facing biodiversity.
TDWG EDIT Platform for Cybertaxonomy – An Overview Andreas Müller, Andreas Kohlbecker, Pepe Ciardelli, Julius Welby, Pere Roca, Niels Hoffmann, Patricia.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
CBoL Taipei, september 2007 BARCODE DATA, MUSEUM CATALOGS AND GBIF Simon Tillier.
TaxonX : A mark-up schema and approach for systematics literature American Museum of Natural History and University of Karlsruhe in collaboration with.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY DNA Barcoding in Southern Africa Cape Town 7 April
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.
The PLAZI Markup System Donat Agosti Terry Catapano Robert “Bob“ Morris Guido Sautter Universität Karlsruhe (TH) Research University – founded 1825.
From Small to Big… Gail Kampmeier Illinois Natural History Survey University of Illinois
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
1 Internet Research Third Edition Unit A Searching the Internet Effectively.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
Andrew Polaszek Executive Secretary, ICZN, c/o Natural History Museum, London UK
Internet Research – Illustrated, Fourth Edition Unit A.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
A superior collections management system for the world’s largest: Museums Art Galleries Historical Societies Herbaria Botanic Gardens KE EMu.
Literature & interoperability: a working example using ants Donat Agosti, Terry Catapano, Guido Sautter, Christiana Klingenberg & Christie Stephenson TDWG.
Serenate1 The librarian’s view Raf Dekeyser K.U.Leuven.
Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Andreas Kohlbecker, Pepe Ciardelli, Niels Hoffmann, Katja Luther, Andreas Müller Botanic Garden.
HISCOM An Australian Virtual Herbarium Jim Croft Australian National Herbarium.
Steve Cassidy Computing at MacquarieNo 1 Searching The Web Steve Cassidy Centre for Language Technology Department of Computing Macquarie University.
Mediterranean Plant Collections: The computerised way forward.
Plazi: Prospects for Markup of Legacy and New Taxonomic Literature Terry Catapano TDWG Fremantle, WA October 21, 2008.
IPT + Darwin Core OBIS XML Schema OBIS Database Schema Explained Mike Flavell OBIS Data Manager OBIS Nodes Training Course, Oostende, Belgium, 6 May 2014.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Organization of Information LSIS Summer II (2005)
International Congress of Entomology, Orlando
RCN Development of an Online Database to Enhance the Conservation of SGCN Invertebrates in the Northeastern Region James W. Fetzner Jr. & John.
Major ILS disciplines What does iSchools like SILS study?
Internet Research Third Edition
Natural History Collections (NHC) Biodiversity Data Informatics 101
Unit# 5: Internet and Worldwide Web
Web Mining Department of Computer Science and Engg.
Presentation transcript:

Taxonomic Publications: Past und Future Donat Agosti (AMNH and NHMB) Andrew Polaszek (ICZN) Klemens Böhm und Guido Sautter (Uni Karlsruhe)

Taxonomists at work …… T. E. Lawrence: Seven Pillars of Wisdom – a triumph. 1st published for general circulation, 1935: p. 535

The traditional flux of information …a more or less closed system

The group that found the top Quark at Fermilab in Chicago in 1998 Successful scientists at work

The staff of The Natural History Museum, London, 1993 Aren‘t we doing big science too? > 6,000 taxonomists world wide, major institutions (Herbaria, Natural History Museums)

The staff of Entomology at The Natural History Museum, London, 1993 Aren‘t we doing big science too? > 6,000 taxonomists world wide, major institutions (Herbaria, Natural History Museums) Hawkmoths Curculionids Ants Psyllids Chalcids Bugs

Aren‘t we doing big science too? > 6,000 taxonomists world wide, major institutions (Herbaria, Natural History Museums) Global Biodiversity The staff of Entomology at The Natural History Museum, London, 1993

Aren‘t we doing big science? > 6,000 taxonomists world wide, major institutions (Herbaria, Natural History Museums) 1,5 M known taxa and about 10M to go > 2 Billion specimens in our colellections Increasing amounts of DNA sequences and whole genomes > 1,000 journals covering systematics If all would be connected….

Finland “Structure of the World Wide Web in Finland. Circles denote sites and lines denote connecting links.” Courtesy of Bernardo Hubernman (HP Labs, Palo Alto) from B. Huberman The Laws of the Web, Cambridge, MIT Press, 2001

Why aren‘t we recognized as „big science“? It has a lot to do with the way we are currently organized: whose data in this room can be accessed right now over the Internet?

Why aren‘t we recognized as „big science“? It has a lot to do with the way we are currently organized: whose data in this room can be accessed right now over the Internet? But the sheer numbers and knowledge offers a potential to change this situation.

Why aren‘t we recognized as „big science“? It has a lot to do with the way we are currently organized: whose data in this room can be accessed right now over the Internet? But the sheer numbers and knowledge offers a potential to change this situation. What needs be done?

What ought to be changed: Culture – probably the most difficult to change: - The way we collaborate (the social aspects)

What ought to be changed: Culture – probably the most difficult to change: - The way we collaborate (the social aspects) - The way we exchange and provide access to data

What ought to be changed: Culture – probably the most difficult to change: - The way we collaborate (the social aspects) - The way we exchange and provide access to data - The way we look at the Internet (Semantic Web)

Scanning Pdf-conversion (WWW) Electronic revolution? Not yet.

From text document to XML-document, or the deconstruction of documents Taxon-x schema

Index 1Index nDocsIndex.. RDBMS Retrieval Engine - Analyze queries - Use indices for SE and result improvement - Retrieve documents - Functionality in Query Executor & Plugins Query Pipeline Retrieval Plugin 1 Retrieval Plugin... Measure Plugin 1 Measure Plugin... Retrieval Plugin n Query Exe- cutor Document Analyzer - Analyze documents (NLP) - Store documents - Create indices from analysis results - Functionality in GATE & Plugins NLP Analysis Pipeline Pre- Plugin 1 Pre- Plugin...AnalyzerPlugin...Analyzer Plugin n Analyzer Plugin 1 GATE Doc Result Doc Result Doc Result User ??? Query ??? Query ??? Query Doc Questions / Feedback Legend Document / Query Meta Data Information Retrieval for Biodiversity Information Guido Sautter

What ought to be changed: Culture – probably the most difficult to change: - The way we collaborate (the social aspects) - The way we exchange and provide access to data - The way we look at the Internet (Semantic Web) Access: We face an aggressive publishing industry (and few colleagues) who disrupt our still free(ish) if a little bit anarchic and fledging flow of information over the Internet, due to commercial interest.

(Fyffe, 2005) Indirect effect due to huge increases of costs of serials

Access to ant taxonomic publications through antbase.org /Smithsonian Institution, including currently the entire body of non-copyrighted publications since 1758 (>4,000 publications or 85,000 pages. Source: (Agosti 2005 and antbase.org) Directly through enforcement of copyright

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information as well: Make your information accessible at small scale: All ant literature is online (4,000 publications)

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Make your information accessible at small scale: All ant literature is online (4,000 publications) at larger scale: Biodiversity Heritage Literature Project (All systematics literature published in the English language), hopefully followed by a European initiative

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Make your information accessible at small scale: All ant literature is online (4,000 publications) at larger scale: Biodiversity Heritage Literature Project (All systematics literature published in the English language), hopefully followed by a European initiative Provide Name Servers Registration of new names as a prerequisite to make them valid, in exchange of an up-to-date list of all (animal) names (i.e. Zoobank at ICZN), but mainly through a federation of taxon specific name servers (e.g. Hymenpotera Name Server / antbase) linked together through global tools, such as GBIF, ITIS, Species2000 or UBIO.

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Make your information accessible at small scale: All ant literature is online (4,000 publications) at larger scale: Biodiversity Heritage Literature Project (All systematics literature published in the English language), hopefully followed by a European initiative Provide Name Servers Registration of new names as a prerequisite to make them valid, in exchange of an up-to-date list of all (animal) names (i.e. Zoobank at ICZN) Standard access Apply and develop, if necessary, data standards and exchange protocols (e.g. Darwin Core or ABCD, or DiGir as used at GBIF)

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information as well: Open flow of information - Support Open Access, and publish in journals allowing open access

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Open flow of information - Support Open Access, and publish in journals allowing open access - Adopt the principles of the Conservation Commons, that is making data and information accessible for science, education and conservation use

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Open flow of information - Support Open Access, and publish in journals allowing open access - Adopt the principles of the Conservation Commons, that is making data and information accessible for science, education and conservation use - Urge your publishers and societies to warrant open access

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Open flow of information - Support Open Access, and publish in journals allowing open access - Adopt the principles of the Conservation Commons, that is making data and information accessible for science, education and conservation use - Urge your publishers and societies to warrant open access BUT: Can descriptions and monographs be copyrighted anyway? Descriptions are “factual knowledge”, that is knowledge, based on direct observation

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Open flow of information - Support Open Access, and publish in journals allowing open access - Adopt the principles of the Conservation Commons, that is making data and information accessible for science, education and conservation use - Urge your publishers and societies to warrant open access BUT: Can descriptions and monographs be copyrighted anyway? Descriptions as “factual knowledge” Increasingly, descriptions are machine output from data-matrices (i.e. DELTA, Lucid, etc.)

What can and should we do to enhance access to our data and knowledge? A case needs to be made that non-systematists do not want to miss our information: Open flow of information - Support Open Access, and publish in journals allowing open access - Adopt the principles of the Conservation Commons, that is making data and information accessible for science, education and conservation use - Urge your publishers and societies to warrant open access BUT: Can descriptions and monographs be copyrighted anyway? Descriptions as “factual knowledge” Descriptions as machine output from data-matrices (i.e. DELTA, Lucid, etc.) As an alternative, why not change the function of a publication from a terminal product to a version control instrument?

From the traditional flux of information … …in a more or less closed system ….

ms submission („Taxon-x-version“) new ms alertPosting for review Edited ms Revised ms Publication: pdf Publication: hard copy Publication database („taxon-x-version“) ontology bibliography analysis & ms preparation ZooBank / NS Character DB Specimen DB Description DB Distribution DB Char. Matrix DB Phyl. Tree DB Char-state Im. Specimen Im. Habitat Image Leg. Publicat. Taxon DB New Data feedback Accepted ms New taxon alert ….. to the Future of Publication