Hiberlink is funded by the Andrew W. Mellon Foundation Investigating Reference Rot in Web-Based Scholarly Communication Martin Klein Los Alamos National.

Slides:



Advertisements
Similar presentations
A Web-Based Resource Model for eScience: Object Reuse & Exchange 2008 Microsoft eScience Conference Indianapolis, December 8, 2008.
Advertisements

50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Publisher: Name of service: License in place: within Service Type:
Whats Different about the Digital: Community Action via UK LOCKSS Alliance Adam Rusbridge UK LOCKSS Alliance Coordinator EDINA, University of Edinburgh.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Hiberlink is funded by the Andrew W. Mellon Foundation Investigating Reference Rot in Web-Based Scholarly Communication Martin Klein Los Alamos National.
Its all about sharing Sylvia van Peteghem, Ghent University.
Herbert Van de Sompel OCLC ESR, Evanston, IL, March Archiving the Evolving Scholarly Record: A Perspective Herbert Van de Los Alamos.
Electronic publishing: issues and future trends Anne Bell.
PUBMED CENTRAL (PMC). HOMESCREEN SUBJECT: MEDICAL PMC is a free full-text archive of biomedical and life sciences journal literature at the U.S. National.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
NIH Public Access Policy What it means to OHSU Researchers Presented by: Andrew Hamilton Date: 10/22/2009.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
Digital Preservation and Portico: An Overview Eileen Fenton Executive Director, Portico Council on Libraries Dartmouth February 1, 2007.
Where data and journal content collide what does it mean to ‘publish your data’? Peter Burnhill, Muriel Mewissen & Adam Rusbridge EDINA, Information Services.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
Open Annotation Collaboration Rob Sanderson, Herbert Van de Sompel DMSS Meeting, May 14-15, Stanford, CA Robert Sanderson –
UKOLN is supported by: OAI-ORE : Object Reuse and Exchange an introduction ( UKOLN staff seminar UKOLN,
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert.
NIH Public Access Policy What it means to OHSU Researchers Presented by: Andrew Hamilton Date: 3/18/2007.
Digital | Curation | Centre Reflections on Daedalus:etc End of Phase 2... Chris Rusbridge, Digital Curation Centre Funded by:
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
DIGITAL MANUSCRIPT INTEROPERABILITY SharedCanvas and IIIF in Practice Benjamin Albritton Digital Manuscript Product
Digital | Curation | Centre The UK Digital Curation Centre Michael Day UKOLN, University of Bath (with thanks to Peter Burnhill, Chris Rusbridge, et al.)
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
Archival HTTP Redirection Retrieval Policies Temporal Web Analytics Workshop 2013, Rio De Janiro Ahmed AlSum, Michael L. Nelson Old Dominion University.
Can’t I Just Get This Online Somewhere? User issues with electronic journals Sarah Beasley Portland State Univeristy Library.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
ResourceSync was funded by the Sloan Foundation & JISC A Modular Framework for Web-Based Resource Synchronization Martin Klein Los Alamos National Laboratory.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Collection Management Strategies in a Digital Environment Cecily Johns CMI Project Director August 2001.
Russell McDonald 21st September Discussion Points What is SFX and the OpenURL? Recent developments Business issues associated with SFX –Breakdown.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
The OAI: overview and historical context OAI Open Meeting – Washington DC – January 23 rd 2001 Herbert Van de Sompel & Carl Lagoze Cornell University --
Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 1 Hiberlink – Towards Time Travel for the Scholarly Web Martin.
From Document Data Model to XML Trade Documents From Document Data Model to XML Trade Documents Electronic trade documents Document Data Model Supply Chain.
Open access- a funders perspective (or “What we want from institutions”) CRC/RLUK/ARMA/SCONUL meeting 27 th January 2011 Robert Kiley, Head Digital Services,
Mathematics & UHM Library Sara Rutter Spring 2008.
OAI Object Reuse & Exchange: Atom Serialization Nordbib Workshop, September , Stockholm, Sweden OAI-ORE: Atom Serialization The ORE Editors are:
Traditional Distribution Electronic Distribution User Florida Entomologist Issues Reprints FTP.
Open Archives Initiative Gail McMillan Digital Library and Archives, Virginia Tech Society for Scholarly Publishing: June 1, 2000.
OAI-PMH for Resource Harvesting Tutorial OAI4, October 20 th 2005, CERN, Geneva, Switzerland The American Physical Society Project: Standards-based Mirroring.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Hiberlink is funded by the Andrew W. Mellon Foundation The Missing Link Proposal #hiberlink #memento Herbert.
Open Archives Initiative CNI Phoenix December 13, 1999 Dale Flecker, Harvard Carl Lagoze, Cornell John Ober, CDL Don Waters, Mellon.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
CNRS Documentation project : CCSD (Center for Direct Scientific Communication ) Htask meeting (Madrid) 06/12/ Lyon Daniel Charnay / Hélène Jamet.
CNI Task Force Meeting April 7, 2008 OAI-ORE Project Briefing David Reynolds Tim DiLauro Sayeed Choudhury Library Digital Programs Sheridan Libraries Johns.
Herbert Van de Sompel OCLC ESR, Washington, DC, December Archiving the Evolving Scholarly Record: A Perspective Herbert Van de
An Application Profile and Prototype Metadata Management System for Licensed Electronic Resources Adam Chandler Information Technology Librarian Central.
World Sustainable Development Web Archive: Preserving and disseminating knowledge for sustainable growth Steve Witt and Lynne Rudasill, University of Illinois.
Mod_oai: Metadata Harvesting for Everyone Michael L. Nelson, Herbert Van de Sompel, Xiaoming Liu, Aravind Elango
Big Data, Little Data, No Data – Who is in Charge of Data Quality? World Data Systems Webinar #9 9 May 2016 Christine L. Borgman Distinguished Professor.
The Multi-Faceted Use of the OAI-PMH in the LANL Repository Written By: Henry, Xiaoming,Patrick Henry, Xiaoming,Patrick and Herbert. Presented By: Shashi.
Reference Rot and E-Theses: Threat and Remedy Hiberlink ETD2014, Leicester UK July 25th 2014 Funded by the Andrew W. Mellon Foundation Peter Burnhill EDINA,
Why We Need Multiple Archives
Digital Education Manager, EDINA
DART: Drivers, Design, Dimensions, Demonstrators and Deliverables
Signposting the Scholarly Web: An Overview
Introduction to Digital Libraries Week 13: Reference Linking & OpenURL
Interoperable Repository Statistics
Welcome to HRA Open Maryrose Franko Executive Director, HRA
Presentation transcript:

Hiberlink is funded by the Andrew W. Mellon Foundation Investigating Reference Rot in Web-Based Scholarly Communication Martin Klein Los Alamos National #hiberlink #memento Herbert Van de Sompel Los Alamos National

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Hiberlink Team Work Los Alamos National Laboratory: Research Library: Martin Klein, Herbert Van de Sompel, Harihar Shankar University of Edinburgh: Edina: Peter Burnhill, Neil Mayo, Muriel Mewissen, Christine Rees, Tim Stickland, Riachard Wincewicz Language Technology Group: Beatrice Alex, Claire Grover, Richard Tobin, Ke “Adam” Zhou Funding: Andrew W. Mellon Foundation

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Reference Rot

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Link Rot

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Content Drift

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Content Drift

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Content Drift

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Content Drift

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 The New York Times Cares Links in Supreme Court decisions: -Link rot: 29% -Reference rot: 49.9%

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 These resources: Are not necessarily under the custodianship of parties that care about long term integrity, access Do not necessarily have the same sense of fixity that e.g. journal articles have Links to these resources are subject to Reference Rot: Link Rot: Link stops working, e.g. HTTP 404 Content Drift: Linked content changes over time Entrance Hiberlink

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 !Exist Archived !Archived ExistArchived ExistArchived

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Articles Increasingly Link to Web Resources URIs extracted from PubMed Central papers

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Quantifying Reference Rot

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Study Parameters Time frame of publications: Jan 1997 – Dec 2012 Articles in XML and PDF format Convert PDF to XML URI extraction Challenge: URI broken up by newline; underscore as image Store publication date URI live web test URI archive lookup via Memento infrastructure

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Link Rot in arXiv

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 arXiv Elsevier PMC

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Content Drift in arXiv Archived within 14 days of publication

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 arXiv 1 Month14 Days24 Hours Elsevier 1 Month14 Days24 Hours PMC 1 Month 14 Days 24 Hours

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Solving Reference Rot

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 Link by means of the original URI Augment the link with temporal context aimed at increasing link robustness o Date of linking o URI of archived snapshot(s) 404-No-More collaboration aims at standardizing an approach for HTML o Harvard Law Library (perma.cc) o Harvard Berkman Center for Internet & Security o Los Alamos National Laboratory o Old Dominion University Linking to Archived Resources

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 mset – Augmenting Links

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 mset – Augmenting Links

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 mset – Augmenting Links

Hiberlink - Martin Klein IIPC GA, Paris, France, May 19th 2014 mset – Augmenting Links

Hiberlink is funded by the Andrew W. Mellon Foundation Investigating Reference Rot in Web-Based Scholarly Communication Martin Klein Los Alamos National #hiberlink #memento Herbert Van de Sompel Los Alamos National