LINKED LITERATURE BHL DEVELOPMENTS CITEBANK Chris Freeland Technical Director, BHL.

Slides:



Advertisements
Similar presentations
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Advertisements

Trish Rose-Sandler, Missouri Botanical Garden TDWG Oct 2013 Florence Italy Art of Life project Finding a goldmine of natural history illustrations within.
NYBG + KE EMu The New York Botanical Garden + KE EMu Melissa Tulig Botanical Information Management.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
The JISC IE Metadata Schema Registry Pete Johnston UKOLN, University of Bath JISC Joint Programmes Meeting Brighton, 6-7 July 2004
WorldCat Local – the single search that connects people to all your library materials WorldCat Local 27August 2010 Stockholm.
Scaling up The International Plant Names Index (IPNI) James A. Macklin Harvard University Herbaria Paul J. Morris Harvard University Herbaria & Museum.
EU BON citizen science gateway Veljo Runnel University of Tartu Natural History Museum.
OpenUp! A New Project on Opening up the European Natural History Heritage for EUROPEANA W. G. Berendsohn, A. K. Michel, A. Güntsch, W.-H. Kusber (2011)
Biodiversity Heritage Library by Connie Rinaldo. Overview History EOL/BHL: WHY? Members/Collaborators Process Governance Sustainability: Legal and Financial.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
The Open Content Alliance Project Liz Bell & Charley Pennell.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
OpenUp! Natural History Heritage Information for Europeana Gerda Koch AIT-Angewandte Informationstechnik Forschungs-GmbH, Graz/Austria
Shared October 13, 2010 Shelf Michael Roy, Dean of Library and Information Services, Middlebury College A Networked Image Platform Jeremy Stynes, Head.
Publishing Solutions for Contemporary Scholars: The Library as Innovator and Partner Sarah E. Thomas University Librarian Cornell University Ithaca, NY.
Digital Library Architecture and Technology
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian &
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Sustaining a biodiversity data infrastructure: OpenUp!, BioCASe and GBIF Walter Berendsohn Botanic Garden and Botanical Museum Berlin-Dahlem Freie Universität.
The Pensoft Journal System and XML-based workflow Lyubomir Penev Life and Literature Conference, Chicago 2011 ViBRANT Virtual Biodversity.
Tom Garnett April 12, 2007 Smithsonian Institution Libraries National Museum of Natural History Board Science Committee Meeting Biodiversity Heritage Library.
Biodiversity Heritage Library © 2008 Biodiversity Heritage Librarywww.biodiversitylibrary.org Scientific Disciplines From Discovery to Delivery Cathy Norton.
Google Books, UMI and Other Intriguing Trends in Digital Publishing Joe Wible Hopkins Marine Station of Stanford University October 9, 2006.
Biodiversity Heritage Library © 2011 Biodiversity Heritage Librarywww.biodiversitylibrary.org Scientific Disciplines From Discovery to Delivery Cathy Norton.
Breakouts. Penguins: Skunks: Cacti: Beetles: Classroom A - Suzanne Classroom C - Chris Lecture Hall 2 - Connie Ward Lecture Hall - Marie (Theme: Content.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Cross-domain access to Europe’s heritage Jon Purday Senior Communications Advisor, Europeana Doom or Bloom: reinventing the library in the digital age.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
DuraCloud Enabling services for managing data in the cloud Michele Kimpton, CBO DuraSpace Bill Branan, Senior Developer DuraSpace.
Crowd-sourcing the creation of “articles” within the Biodiversity Heritage Library Bianca Crowley Trish Rose-Sandler
TDWG 2006 Conference, St Louis Digitizing the legacy literature of biodiversity An introduction to the Biodiversity Heritage Library (BHL) Neil Thomson.
Botanicus.org: Prototyping a Web 2.0 interface to digitized taxonomic literature Chris Freeland - Application Development Manager Doug Holland – Director.
Making search simpler The NHM Library and Archives Virtual Library Project.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
EVERY CONNECTION has a starting point. Jasmine de Gaia Product Management WorldCat Consumer Discovery Social Networking & WorldCat.org.
KE EMu, the world’s premier collections management software.
DuraCloud Open technologies and services for managing durable data in the cloud Michele Kimpton, CBO DuraSpace.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
Removing Taxonomic Impediments: How the EOL and BHL Projects can help…. Graham Higley Natural History Museum, London At TDWG 2007.
OCLC Online Computer Library Center Scott Wasinger OCLC NetLibrary September 4, 2007 Going Global with eBooks.
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
Biodiversity Heritage Library for Europe Towards a global library of life Henning Scholz Museum für Naturkunde Berlin.
The Biodiversity Heritage Library (BHL-Europe) Towards a global library of life Patrick Grootaert Royal Belgian Institute of Natural Sciences IXth European.
BHL-Europe Biodiversity Heritage Library for Europe – ECP-2008-DILI – Milan, ElPUb2009 – DC Social Tagging Workshhop, 10 June
BHL-Europe Biodiversity Heritage Library for Europe – ECP-2008-DILI – Kick-off meeting – Berlin – May 2009www.biodiversitylibrary.org Biodiversity.
Collections Management A superior collections management system for the world’s largest: Museums Art Galleries Historical Societies Herbaria Botanic Gardens.
Towards the Universal Digital Library in Natural History Antonio G. Valdecasas, Marian Ramos & Isabel Morón Museo Nacional de Ciencias Naturales, Madrid.
Improving the interoperability of European biodiversity digital libraries Henning Scholz Museum für Naturkunde Berlin.
Taxonomic Name Recognition (TNR) in Biodiversity Heritage Library (生物多样性图书馆分 类学名称识别) Qin Wei (魏琴), Chris Freeland, P. Bryan Heidorn Missouri Botanical.
Biodiversity Heritage Library for Europe Towards a global library of life.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Biodiversity Heritage Library: A Successful Collaboration, A Fully Open Access Collection Marty Schlabach Mann Library, Cornell University Upstate New.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Freeland, LAPI II, 18 NOV 2008 Digital Libraries for Science: Botanicus & Biodiversity Heritage Library Chris Freeland Director of Bioinformatics, Missouri.
World wide access to biodiversity literature The Biodiversity Heritage Library Henning Scholz 1 & Tom Garnett 2 1 Museum für Naturkunde, Berlin, Germany.
Biodiversity Heritage Library for Europe
An Open Knowledge & Research Information Infrastructure
Ahmet Fatih Mustacoglu
Publishing Solutions for Contemporary Scholars: The Library as Innovator and Partner Sarah E. Thomas University Librarian Cornell University Ithaca, NY.
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

LINKED LITERATURE BHL DEVELOPMENTS CITEBANK Chris Freeland Technical Director, BHL

Biodiversity Heritage Library: BHL Members

Biodiversity Heritage Library: BHL Members: US/UK  Academy of Natural Science (Philadelphia, PA)  American Museum of Natural History (New York, NY)  California Academy of Science (San Francisco, CA)  The Field Museum (Chicago, IL)  Harvard University Botany Libraries (Cambridge, MA)  Harvard University, Ernst Mayr Library of the Museum of Comparative Zoology (Cambridge, MA)  Marine Biological Laboratory / Woods Hole Oceanographic Institution (Woods Hole, MA)  Missouri Botanical Garden (St. Louis, MO)  Natural History Museum (London, UK)  The New York Botanical Garden (New York, NY)  Royal Botanic Gardens, Kew (Richmond, UK)  Smithsonian Institution Libraries (Washington, DC)

Biodiversity Heritage Library: BHL Members: BHL-Europe  Museum für Naturkunde - Leibniz- Institut für Evolutions- und Biodiversitätsforschung an der Humboldt-Universität zu Berlin  Natural History Museum, UK  Narodni muzeum NMP CZ  Angewandte Informationstechnik Forschungsgesellschaft mbH  Freie Universität Berlin FUBBGBM  Georg-August-Universität Göttingen Stiftung Öffentlichen Rechts  Naturhistorisches Museum Wien  Hungarian Natural History Museum  Museum and Institute of Zoology, Polish Academy of Sciences  University of Copenhagen  Stichting Nationaal Natuurhistorisch Museum, Naturalis  National Botanic Garden of Belgium  Royal Museum for Central Africa,  Royal Belgian Institute of Natural Sciences  Bibliothèque nationale de France  Museum national d’histoire naturelle  Consejo Superior de Investigaciones Cientificas  Università degli Studi di Firenze  Royal Botanic Garden, Edinburgh  Species 2000  John Wiley & Sons limited  Helsingin yliopisto UH-Viikki

Biodiversity Heritage Library: Stats: Now Online  15,000 titles  40,000 volumes  16.4mil pages  Soon:  34,000 titles  65,000 volumes  24mil pages Oldest book: Schöffer’s Herbarius, 1484.Herbarius

Biodiversity Heritage Library: Stats: Usage  Jan – Sep 2009  266,000 visitors  436,000 visits  2.1million pageviews  Daily average  970 visitors  1,600 visits / day  7,700 pageviews / day Jan – Sep 2009 Launch to 30 Sep 2009

Biodiversity Heritage Library: Cloud storage & computing  DuraCloud Pilot  Test of cloud storage & computing with DuraSpace Foundation & New York Public Library  Early alpha stage  More info:  Inhouse solution demo’d by MOBOT & MBL  Redundant high performance storage on commodity hardware  Cloud computing pub crawl working group tonight 

Biodiversity Heritage Library: Global, coordinated development  New functionality from BHL-Europe  Improved deduplication tools  Semantic interface  OAIS-compliant preservation infrastructure  Building a community of developers  Funded & volunteer  RubyBHL:  PyBHL: shtml shtml  New partners, new content

Biodiversity Heritage Library: Open Source Pageturning UI

Biodiversity Heritage Library: Open Software & Development  BHL Bits:  Portal code, utilities, services   Taxonomic Literature Group  Google Group for discussion of “taxonomic literature & the services required to make literature interoperable within biodiversity research and biodiversity informatics.” 

Biodiversity Heritage Library: Open Data  Downloads  Simple tab-delimited exports of core data   Data model  DB schema as ERD 

Biodiversity Heritage Library: Services  Names Service  Return all occurrences of a name throughout BHL digitized corpus Documentation:  Access to 51million name strings using TaxonFinder 1.4million unique names  Working out a strategy for obscure species  Algorithm improvements to detect nomenclatural & taxonomic acts  OpenURL  Facilitate links to citations: protologues, articles, references Documentation:  Useful to Nomenclators, Reference Systems IPNI Tropicos

Biodiversity Heritage Library: Services: OpenURL pid=title:3934&volume=14&issue=&spage=301&date= pid=title:3934&volume=14&issue=&spage=301&date=1879

Biodiversity Heritage Library: Services: OpenURL Disambiguation  Looking for:  BHL returns:

Biodiversity Heritage Library: Services: OpenURL Results

Biodiversity Heritage Library: How?  Tropicos maintains internal authority list of publications:  Each protologue/reference tied to authority:  Matched Tropicos TitleIDs to BHL TitleIDs:  Throw citations at resolver at regular intervals & cache data in Tropicos = pid=title:3934&volume=14&issue=&spage=301&date= pid=title:3934&volume=14&issue=&spage=301&date=1879

Biodiversity Heritage Library: Encyclopedia of Life  522,000 species pages linked to BHL  #1 referring site

Biodiversity Heritage Library: Other Consumers  EarthCape Labs  Sort/Search capabilities with harvested names  YouTube demo:  BioGUID / iPhylo  BHL Name Timeline & Comparison  New Viewer  Tagging  So much cool stuff we can’t keep up!

Biodiversity Heritage Library: Crowdsourced Articles  Demo:

Biodiversity Heritage Library: Crowdsourced Articles  12,000 PDFs generated through September 2009  4,900 submitted with article metadata  Analysis:

Biodiversity Heritage Library: Great, but how to…  display / manage?  meet community demands for bibliography / citation management?  build from more open source tools?

Biodiversity Heritage Library: Development goals re: citations  Create a repository for community-vetted taxonomic bibliographies.  Ability to ingest, display, download, and index articles so that the BHL can operate as an article repository.  Build from existing community of work around Drupal / Biblio.  In use by collaborators

“something like GenBank or NameBank for citations…” So, CitationBank…or CiteBank (saves chars) Need…

Biodiversity Heritage Library: Crowdsourced Articles  PDFs from BHL pushed into Drupal/Biblio:

Biodiversity Heritage Library:

Biodiversity Heritage Library:

Biodiversity Heritage Library: PDF

Biodiversity Heritage Library: CiteBank boundaries Book Citation Pageturning UI PDF OCR eBook/Kindle Stored *somewhere* & retrievable via HTTP URI Citation Bibliography CiteBank

BHL Data Flow – Sep 2009 CiteBank

Biodiversity Heritage Library: Copyright  Bold statements that need some good legal counsel:  Citations don’t have copyright  Unless you get them from OCLC, other services  Bibliographies have copyright  They’re a scholarly work  Underlying content has copyright  Except when it doesn’t

Up for discussion…

Biodiversity Heritage Library: Who can upload & edit?  Trusted repositories?  Approved specialists?  BHL Librarians?  People in this session?  Citizen scientists?  6 th graders?  Rod Page? Discussion: Session participants thought it important that BHL get as many citations as possible, then find ways of implementing trust mechanisms for users such as iSpot (Drupal module), ratings systems, ways of tagging inappropriate materials.

Biodiversity Heritage Library: What about duplicates?  3 Bibliographies had Syst. Nat.  All 3 in different reference manager formats  All 3 had variant forms of title: Syst. Nat. Systema Naturae Systema naturae per regna tria naturae Library catalogues: Caroli Linnaei...Systema naturae per regna tria naturae :secundum classes, ordines, genera, species, cum characteribus, differentiis, synonymis, locis. Discussion: Important to have all the ways in which materials have been referred to over time, then have algorithms & people aggregate titles/articles (translations) into reconciliation groups, resulting in a master index.

Biodiversity Heritage Library: Accuracy  How clean is clean?  How dirty is dirty?  What’s good enough?  How to Rank  Gold/Platinum?  Dirty Bucket/Clean Bucket? Discussion: Let users decide which is the “right” form for use; may differ from project to project. BHL should take it all in, then refine using our libraries’ collected knowledge + involvement from domain specialists.

Biodiversity Heritage Library: Right technologies?  “But Drupal’s awful…just ask ___ for their bad experience.”  “Drupal’s great!”  “MySQL won’t scale”  “MySQL’s great!” Discussion: Drupal has limitations, but a large community of developers & implementers. There may be a “Montpellier Declaration” to centralize efforts within biodiversity informatics around the framework. Drupal/Biblio is a good starting point for CiteBank, needs further evaluation after more data are loaded & site is used.

Biodiversity Heritage Library: Next steps  Bring hardware online at MBL  Have one point of redundancy  By Q  Bring BHL-Europe & other nodes online  In conjunction with DuraCloud & other solutions  Release CiteBank for beta & sandbox testing  Beta at  Sandbox at  Production release by Q  Integration of BHL-Europe tools & content

Biodiversity Heritage Library: Coming soon  Darwin’s Library  AMNH, NHM, CUL, BHL (MOBOT)  Funded by NEH/JISC  Digitization of Darwin’s personal library, with annotations New interfaces for recording, indexing, displaying annotations  Inhouse scanning from partners/contributors

Biodiversity Heritage Library: Fun: BHL In Your Pocket!  Content now available in EPUB format  Used by Stanza, transferable to Kindle  Blog post by John Mignault (NYBG): 

Biodiversity Heritage Library: Links & such Biodiversity Heritage Library CiteBank beta CiteBank sandbox Go play! Follow BHL on

Biodiversity Heritage Library: Thanks! Chris Freeland Technical Director, BHL Director, Center for Biodiversity Informatics, Missouri Botanical Garden  Presentation online through TDWG & at