Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 1 Hiberlink – Towards Time Travel for the Scholarly Web Martin.

Slides:



Advertisements
Similar presentations
Open Access at the World Bank OA Policy and Open Knowledge Repository (OKR) Interoperability Jose de Buerba, Sr. Publishing Officer Paschal Ssemaganda,
Advertisements

LOCALIZED REFERENCE LINKING PROJECT Dale Flecker NFAIS/NISO Linking Workshop February 24, 2002 Philadelphia.
PRESERV Repositories and stakeholders Jessie Hey PRESERV Partners Meeting 18 Nov 2005.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Southampton University Research e-Prints: e-Prints Soton School of Medicine Discussion 19 Jan 2005 Pauline Simpson Elizabeth.
Open Access Niamh Brennan Trinity College Dublin DRIVER Summit, Goettingen, January 17th 2008 Local Integration, National Federation TCD-RSS, TARA, IReL-Open,
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
Institutional Repositories: Laying Foundations for a New Era of Scholarly Communication? Jessie Hey Online Information London, UK 1 Dec 2004 A practical.
UKOLN is supported by: An overview of the OpenURL UKOLN/JIBS OpenURL Meeting London, September 2003 Andy Powell, UKOLN, University of Bath
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
Extended-Linking Services: towards a Quality Web Eric F. Van de Velde California Institute of Technology
Where data and journal content collide what does it mean to ‘publish your data’? Peter Burnhill, Muriel Mewissen & Adam Rusbridge EDINA, Information Services.
Open Annotation Overview Frankfurt Germany, 10 th of October Open Annotation: Social Bookmarking and Annotation of eBooks Robert Sanderson
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
Open Annotation Collaboration Rob Sanderson, Herbert Van de Sompel DMSS Meeting, May 14-15, Stanford, CA Robert Sanderson –
1 The Practical Commons: Viability and Next Steps Saturday November 20, 2004 University of Maine Saturday November 20,
Developments in Linking: OpenURL Eric F. Van de Velde California Institute of Technology
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Prototypes of pro-active approaches to support the archiving of web references for scholarly communications Richard Wincewicz 1, Peter Burnhill 1 & Herbert.
© University of Reading October 2009 CentAUR Central Archive at the University of Reading Introduction for ‘early adopters’ Alison.
Developing PANDORA Mark Corbould Director, IT Business Systems.
Databases & Data Warehouses Chapter 3 Database Processing.
Guillaume Rivalle APRIL 2014 MEASURE YOUR RESEARCH PERFORMANCE WITH INCITES.
Sakai & Next steps in Course Management David Millman April 2006.
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
1 Chuck Koscher, CrossRef New Developments Relating to Linking Metadata Metadata Practices on the Cutting Edge May 20, 2004 Chuck Koscher Technology Director,
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Memento Update CNI Task Force Meeting, Spring Memento Herbert Van de Sompel Robert Sanderson Michael L. Nelson Giant Leaps.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
General division 3.1. Types of scientific reports and their purpose 3. Scientific literature, online resources at the Internet oral form - it is not obligatory,
Open access & visibility Management Digital Preservation ORA: Purposes.
SLIDE 1DID Meeting - Montreal Integrating Data Mining and Data Management Technologies for Scholarly Inquiry Ray R. Larson University of California,
1 Annual Meeting 2004 CrossRef Publishers International Linking Association, Inc Charles Hotel, Cambridge, MA November 9 th, 2004.
3.1. Types of scientific reports and their purpose oral form - it is not obligatory, could be very formal and it can not be used for the justification.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
Economists Online researchers and libraries collaborate. A subject-specific service model. Benoit Pauwels Université Libre de Bruxelles.
Hiberlink is funded by the Andrew W. Mellon Foundation Investigating Reference Rot in Web-Based Scholarly Communication Martin Klein Los Alamos National.
Hiberlink is funded by the Andrew W. Mellon Foundation The Missing Link Proposal #hiberlink #memento Herbert.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Data Citation Implementation Pilot Workshop
REF: Open access requirements Directorate of Academic Support December 2015.
Visualizing JSTOR: Exploring OAI-ORE for Information Topology Navigation CERN Workshop on Innovations in Scholarly Communication (OAI6) 17 th June, 2009.
When to Choose Google Scholar For finding journal articles or abstracts by title through a range of academic publishers, preprint repositories and electronic.
Managing ETDs with Associated Complex Digital Objects Gabrielle V. Michalek Director, Scholarly Publishing, Archives and Data Services Carnegie Mellon.
Scholarly works, research, reports, publications What is an Institutional Repository? Focus on Research Groups Promoting Physics Faculty, Students and.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
Open Access and the ESRC New directions in scholarly communications in the social sciences.
Reference Rot and E-Theses: Threat and Remedy Hiberlink ETD2014, Leicester UK July 25th 2014 Funded by the Andrew W. Mellon Foundation Peter Burnhill EDINA,
Digital Education Manager, EDINA
Tim Smith CERN Geneva, Switzerland
Signposting the Scholarly Web: An Overview
Linking persistent identifiers at the British Library
Managing ETDs with Associated Complex Digital Objects
Your ORCID researcher identity: Enhancing the impact and visibility of your research web presence Research Week. 11th May 2017 Debbie Martindale: Librarian:
Web archive data and researchers’ needs: how might we meet them?
Introduction to Digital Libraries Week 13: Reference Linking & OpenURL
Institutional Repositories
Interoperable Repository Statistics
3. Scientific literature, online resources at the Internet
Presentation transcript:

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 1 Hiberlink – Towards Time Travel for the Scholarly Web Martin Robert Herbert Van de The Hiberlink Project is supported by the Andrew W. Mellon Foundation

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 2 LANL Herbert Van de Sompel Rob Sanderson Martin Klein U. Edinburgh Claire Grover Beatrix Alex Richard Tobin Adam Zhou Hiberlink Project and Partners EDINA Peter Burnhill Christine Rees Muriel Mewissen Tim Strickland Neil Mayo Two year project funded by Andrew W. Mellon Foundation

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 3 Problem Statement Preservation of formal scholarly output is (relatively) well understood. Preservation of the resources that make up the context for that research is not: Datasets Software Workflows Videos, Slides Project and Demonstration web sites AJAX …

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 4 To what extent are web resources that are referenced from works in repositories still available at their original URL … or from archives of web resources? Participants: LANL, UNT, arXiv Paper: Contributions: Much larger scale than any previous study, 162,052 unique URLs Automatically searched multiple archives for all URLs, rather than manually for a small subset Pilot Study

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 5 Pilot Study: Method Filter Links Normalize Links Extract Links Extract Metadata Normalize Metadata Results: (URL,Time, Memento- Time, Paper, Subject) (URL, Paper, Time, Subject) * * We filtered broken and intra/inter-repository links.

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 6 Memento

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 7 Pilot Study: Results 72% in archives and/or still exist High proportion of archived URLs, possibly due to academic level and general disciplines 78% in archives and/or still exist 45% still exist, but not archived! Possibly due to high value, but very discipline specific references UNT arXiv

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 8 To what extent are web resources that are referenced from works in repositories still available at their original URL … or from archives of web resources? Redo the same experiment with… Even larger dataset with millions of papers and URLs Text mining processes for URL extraction Track location of URL (citations, footnote, text, etc) Evaluation of extraction via gold standard dataset Determine type of resource referenced Track type of publication (journal, thesis, report, etc) Hiberlink: Quantify Full Extent of the Problem

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 9 We propose two active archiving solutions of resources referenced from scholarly papers to ensure that the scholarly record remains unbroken 1. Active Crawling: Run extraction routines at repositories, publishers, or third parties via text mining agreements or open access publications Feed the URL seed list to existing web crawlers, such as the Internet Archive IA (and others) already Memento compliant Hiberlink: Propose Solutions (1)

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA Transactional Archiving: Willing server forks responses for resources and sends to both browser and to archive for preservation Hiberlink: Propose Solutions (2)

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA pilot study showed: Significant problem! Random archiving by web crawlers is not enough Hiberlink project will: Fully quantify the extent to which web resources that form the context of scholarly output are available and archived Propose active solutions to prevent the loss of further resources Use Memento for both research and access Summary

Hiberlink – Towards Time Travel for the Scholarly Web July 25 th 2013, Indianapolis, IN, USA 12 Hiberlink – Towards Time Travel for the Scholarly Web Martin Robert Herbert Van de The Hiberlink Project is supported by the Andrew W. Mellon Foundation