The XML mark up process from the viewpoint of a biodiversity publisher Lyubomir Penev, Donat Agosti, Teodor Georgiev, Terry Catapano, Vladimir Blagoderov,

Slides:



Advertisements
Similar presentations
Incentivising Biodiversity Data Publishing: GBIF-Pensoft Partnership Vishwas Chavan 1, Lyubomir Penev 2,3, Teodor Georgiev 3 1 Global Biodiversity Information.
Advertisements

How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Jordan Biserkov, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith.
A common XML query/response model for automated publication- to-registration pipeline Lyubomir Penev, Jordan Biserkov, Teodor Georgiev, Pavel Stoev Pro-iBiosphere.
Trends in Scientific Publishing Guenther Eichhorn DirectorAbstracting & Indexing Cambridge, MA April 2010.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
Taxonomic Literature Standards and Synergies TDWG 2006 Anna L. Weitzman & Christopher H. C. Lyal.
OPEN ACCESS Your Publisher of Choice DE GRUYTER OPEN Society-Pays Publishing Program.
Virtual Biodiversity ViBRANT 3 in 1: The Pensoft Writing Tool (PWT) Lyubomir Penev, Pavel Stoev, Teofor Georgiev Pensoft Publishers ViBRANT.
Service activities ViBRANT Project Year 3/Final Review Meeting – Brussels Description & Objectives WP Description WP Objectives WP partners.
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Implementation of TaxPub, a JATS extension for domain-specific markup in taxonomy: the experience of a biodiversity publisher Lyubomir Penev, Terry Catapano,
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Streamlining the registration- to-publication pipeline Lyubomir Penev, Teodor Georgiev, Pavel Stoev Sherborn Meeting, NHM London, 28 Oct 2011 ViBRANT.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer September G A Darwin-Core Archive solution to publishing and.
Open access journals Pensoft Journal Ststem PJS 2.0 Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT ViBRANT Tools for DNA taxonomists,
Making small data big: The Biodiversity Data Journal (BDJ) Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, David M. Roberts 4 & Vincent.
Global & Regional Initiatives on Information Management Eero Mikkola(IUFRO) Joris Siermann (CIFOR) Global Forest.
Managing journals: challenges and opportunities How to get started (with OJS) Jackie Proven.
To be Published for free or to be Read for free: OA publishing from an Easterneuropean perspective Lyubomir Penev Pensoft Publishers, Sofia APE 2011 Berlin.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
Virtual Biodiversity ViBRANT Vince Smith & Dave Roberts Natural History Museum, London ViBRANT Virtual Biodiversity.
Virtual Biodiversity ViBRANT Literature Mining and Mark-up ViBRANT’s text processing tools David Morse, The Open University, UK,
At the frontline of publishing in systematic zoology: A presentation of ZooKeys Lyubomir Penev 1, Terry Erwin 2, Jeremy Miller 3 1 Pensoft Publishers,
The Pensoft Journal System and XML-based workflow Lyubomir Penev Life and Literature Conference, Chicago 2011 ViBRANT Virtual Biodversity.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
The Global Names Architecture: Integration In Action (NOT “Inaction”) 1.Overview of GNA, GNI & GNUB (15 mins) 2.Questions, Elaborations & Clarifications.
J-STAGE, NOW NEXT STAGE large scale scholarly e-journal platform of Japan.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
A paradigm shift in biodiversity publishing: mobilization, mark up, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
A Biodiversity Content Management System for Research, Education, and Outreach Cynthia Sims Parr University of Maryland, College Park Co-authors Roger.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
TDWG 2006 Conference, St Louis Digitizing the legacy literature of biodiversity An introduction to the Biodiversity Heritage Library (BHL) Neil Thomson.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
TaxonX : A mark-up schema and approach for systematics literature American Museum of Natural History and University of Karlsruhe in collaboration with.
Jeremy Miller 1,2, Donat Agosti 2,3, Guido Sautter 2, Terry Catapano 2,4, David King 5, Serrano Pereira 1, Rutger Vos 1, Soraya Sierra 1 Unlocking the.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
The Future of Informatics in Digital Literature – or Literature and it’s (Digital) Future Donat Agosti and Terrance Catapano Plazi TDWG, Woods Hole, September.
The PLAZI Markup System Donat Agosti Terry Catapano Robert “Bob“ Morris Guido Sautter Universität Karlsruhe (TH) Research University – founded 1825.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Incentives for Biodiversity Data Publishing June 2011.
Literature & interoperability: a working example using ants Donat Agosti, Terry Catapano, Guido Sautter, Christiana Klingenberg & Christie Stephenson TDWG.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
DE GRUYTER OPEN PUBLISHING PROPOSAL 2January 2016Publishing Proposal.
Plazi: Prospects for Markup of Legacy and New Taxonomic Literature Terry Catapano TDWG Fremantle, WA October 21, 2008.
Scratchpads An online platform for biodiversity data Laurence Livermore Biodiversity Informatics | Department of Life Sciences Natural History Museum London.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
GB22 TRAINING EVENT FOR NODES – 4 OCTOBER 2015 Session 02: 2015 Data Publishing Landscape Laura Russell.
ZooBank: Scope of Registry
Gain Global Exposure: Partner with EBSCO to Promote your Scholarship
International Congress of Entomology, Orlando
Flanders Marine Institute (VLIZ)
Data publishing from the viewpoint of a biodiversity publisher
VI-SEEM Data Repository
GLOBAL BIODIVERSITY INFORMATION FACILITY
Publishing and Mark-up of Collection Data
Presentation transcript:

The XML mark up process from the viewpoint of a biodiversity publisher Lyubomir Penev, Donat Agosti, Teodor Georgiev, Terry Catapano, Vladimir Blagoderov, David Roberts, Vincent S. Smith, Norman F. Johnson, Guido Sautter, Robert A. Morris, Vishwas Chavan, Tim Robertson, Pavel Stoev, Jeremy Miller, Sandra Knapp, Cynthia Parr, W. John Kress, Terry Erwin

The four stages of the XML- based editorial workflow S UBMISSION – tagged or non-tagged manuscripts? S UBMISSION – tagged or non-tagged manuscripts? PEER-REVIEW/EDITORIAL – the technical challenges of the mark up process PEER-REVIEW/EDITORIAL – the technical challenges of the mark up process PUBLICATION – different publishing formats and to whom they are addressed? PUBLICATION – different publishing formats and to whom they are addressed? DISSEMINATION and USE – Link yourself or perish! DISSEMINATION and USE – Link yourself or perish!

Quick facts about ZooKeys Launched on 4 th of July 2008; published 60 issues and >11,000 pages until now, 4,400 registered users; The first mandatory Open Access journal in taxonomy ZooKeys registers all new taxa in ZooBank (mandatory from the 1st issue); all new taxa descriptions are supplied through XML to EOL; all taxon treatments are supplied to Plazi; Wikispecies registers all new taxa as well Since July 2010, ZooKeys implements XML, TaxPub- based editorial wokflow; the journals partners with GBIF, EOL, BHL, NLM, NHM, Plazi and others in various innovative publishing and dissemination projects CrossRef member, ISI ans Scopus covered, indexed in Zoological Record, DOAJ, CABI Abstracts, Google Scholar; approved for archiving in PubMedCentral Impact Factor in the 3rd year of existence

Semantic tagging: What we have currently at disposal? Plazi’s TaxonX (mainly for legacy literature) and NLM TaxPub (for prospective publishing) as published working XML schemas; TaXMLit is being developed as a promised TDWG standard Plazi’s TaxonX (mainly for legacy literature) and NLM TaxPub (for prospective publishing) as published working XML schemas; TaXMLit is being developed as a promised TDWG standard Domain-specific Mark up tools: Golden Gate (TaxonX), and PMT (Pensoft Mark Up Tool) (based on NLM TaxPub, any other schema) Domain-specific Mark up tools: Golden Gate (TaxonX), and PMT (Pensoft Mark Up Tool) (based on NLM TaxPub, any other schema) TDWG standards and vocabularies approved (DarwinCore) or being under discussion (TaxonConcept, SPM, TaXMLit, etc.) TDWG standards and vocabularies approved (DarwinCore) or being under discussion (TaxonConcept, SPM, TaXMLit, etc.) RDF/XML and OWL links to external resources (ontologies), LSIDs, GUIDSs, etc., quickly gaining popularity and acceptance! RDF/XML and OWL links to external resources (ontologies), LSIDs, GUIDSs, etc., quickly gaining popularity and acceptance! ECAT, GNA, GNI, GNUB, promise an exciting infrastructure for taxon names at global level ECAT, GNA, GNI, GNUB, promise an exciting infrastructure for taxon names at global level BHL CiteBank promises a similarly exciting infrastructure for biodiversity literature references BHL CiteBank promises a similarly exciting infrastructure for biodiversity literature references Semantic Web enhancements to taxonomic papers promote connections; Data publication seems to soon become an indispensable part of taxonomic papers Semantic Web enhancements to taxonomic papers promote connections; Data publication seems to soon become an indispensable part of taxonomic papers Dissemination of published results becomes at least as important as the publication itself! Dissemination of published results becomes at least as important as the publication itself!

bn Four stages of an XML-based publication and dissemination work flow Scratchpads and Lifedesk-generated manuscripts GBIF IPT-generated manuscripts from metadata descriptions PENSOFT MARK UP TOOL (PMT), InDesign layout Manuscripts generated from authors’ databases Marked up final publication in PDF, HTML and XML formats Post-publication mark up of legacy literature Manuscripts marked up with MS Word & Open Office plugins Upfront pre-submission mark up (tagged manuscripts) Non-tagged manuscripts Legacy publications PDF, HTML, OCR Scans PLAZI’s GOLDEN GATE (GG) Marked up publication and treatments in HTML and XML formats Mark up integrated simultaneously with the peer-review, editorial and publication process ISI, Zoological Record Indexing (GBIF, GNA, etc.) Aggregators (EOL, WIKI, etc.) Dissemination, archiving, indexing, harvesting PubMedCentral & other archives

Pensoft Mark Up Tool (PMT) work flow

But why to mark up? Who will be using that? What does it give more than the usual PDF?

The four publication formats and their targets Print (high-resolution, full-color, identical to the PDF): to provide paper archiving and to satisfy the requirements of the Biological Codes Print (high-resolution, full-color, identical to the PDF): to provide paper archiving and to satisfy the requirements of the Biological Codes PDF: electronic copy of the printed version; for e-archiving (personal and institutional libraries; BHL) easy to read, browse and search PDF: electronic copy of the printed version; for e-archiving (personal and institutional libraries; BHL) easy to read, browse and search HTML: addressed to individual users to provide interactive reading and semantic enhancements; saves time and efforts to the readers through cross-referencing, Web harvesting, linking to external resources, etc.; HTML: addressed to individual users to provide interactive reading and semantic enhancements; saves time and efforts to the readers through cross-referencing, Web harvesting, linking to external resources, etc.; XML: to provide a format for archiving, data mining, data use/re-use to institutions (repositories, e-archives, aggregators, indexers, taxon-oriented web platforms, etc.) XML: to provide a format for archiving, data mining, data use/re-use to institutions (repositories, e-archives, aggregators, indexers, taxon-oriented web platforms, etc.)

Semantic enhancements to published texts

The occurrence dataset in Google Earth

Automated dissemination of published contents to GBIF, EOL, Plazi, etc.

Archiving in PubMedCentral (as TaxPub XML, PDF and separate images files)

The lessons learned The main difficulties are caused by: The specificity of the domain (e.g., taxon names, synonyms, instability of nomenclature, lack of global LSID infrastructure, etc.) Mark up of occurrence data (certainly a great challenge) Cost efficiency Sociological barriers: the majority of authors are not willing to change their writing habits; most are still not aware about the tremendous advantages of the Web 2.0 technologies Most small taxonomy publishers (and some bigger ones) do not understand the semantic tagging or just cannot afford it Semantic tagging of and semantic enhancements to biodiversity papers are publishers’ care; publishers should better present and disseminate the published contents to the benefit of their authors and society It is not easy, but it is exciting!

In 2010, Pensoft is committed to: Participate in all stages of extension, testing and implementation of the NLM TaxPub schema Extend the list of semantic enhancements and links to external resources with some 30 % Implement all these practices routinely also in botany and ecology through: