2006-12-09 The Evolution of Mathematical Communication 1 Using Metadata for the Interlinking of Digitized Mathematics Thomas Fischer State and University.

Slides:



Advertisements
Similar presentations
Why Should Publishers Implement the OpenURL Framework? Andrew K. Pace Head, Systems NCSU Libraries
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Distributed Service Registries Workshop, July 2005 Slide 1 NISO Metasearch Initiative Registries Robert Sanderson Dept. of Computer Science University.
UKOLN is supported by: An overview of the OpenURL UKOLN/JIBS OpenURL Meeting London, September 2003 Andy Powell, UKOLN, University of Bath
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
JISC CETIS Metadata and Digital Repository SIG meeting, Manchester 16 April 2007 A Dublin Core Application Profile for Scholarly Works (eprints) ‏ Julie.
International Conference on Dublin Core and Metadata Applications DC-Scholar, 24 th September /10/2014 Scholarly Works Application.
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
Open Repositories 2007 Eprints Application Profile The Eprints Application Profile: a FRBR approach to modelling repository metadata Julie Allinson, UKOLN,
A centre of expertise in digital information management UKOLN is.
A centre of expertise in digital information management UKOLN is.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
Journals.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Images Application Profile meeting 29th October 2007, London Julie Allinson Digital Library Manager Library & Archives, University of York SWAP a Dublin.
A centre of expertise in digital information management UKOLN is supported by: Eprints Application Profile UK Repositories Search Project.
William Y. Arms Corporation for National Research Initiatives March 22, 1999 Object models, overlay journals, and virtual collections.
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
© 2006 DCMI DC-2006 – International Conference on Dublin Core and Metadata Applications 3-6 October 2006 Thomas Baker Dublin Core Metadata Initiative.
EdReNe Workshop London, 8th – 9th January 2008 Enhancing the LOM application profiles using the DOI AIE – Italian Publishers Association.
OpenURL: Linking LC’s E-Resources Ardie Bausenbach Automated Planning and Liaison Office Library of Congress November 24, 2003.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
OpenURL and Canonical Citation Linking in Classics A Collaborative Project at Cornell between Classics and the University Library Metadata Working Group.
Linking resources Praha, June 2001 Ole Husby, BIBSYS
What is an Open URL? It is a draft National Information Standards Organization standard: NISO Z the OpenURL Framework for Context-Sensitive.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
SWAP FOR DUMMIES. Scholarly Works Application Profile a Dublin Core Application Profile for describing scholarly works (eprints) held in institutional.
Linking Courseware to Library Resources Using OpenURL The Missing Link? CNI April 30, 2003 Oren Beit-Arie Linking Courseware to.
Text linking in the humanities: citing canonical works using OpenURL CNI Spring 2009 Task Force Meeting Eric Rebillard Departments of Classics and History.
The DNER - a national digital library Andy Powell ZIG Meeting, York October 2001 UKOLN, University of Bath UKOLN is funded by Resource:
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
European Endeavor Users Group Meeting Helsinki, Sept Esa-Pekka Keskitalo, System Analyst Helsinki University Library OpenURL 1.0.
1 Reference Linking in Project Euclid …with some thoughts on the preservation of digital collections. A presentation at the Workshop on Linking and searching.
A centre of expertise in digital information management UKOLN is supported by: FRBR and Metadata Application Profiles Peter Cliff, Research.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Emerging Uses for the OpenURL Framework Ann Apps and Ross MacIntyre MIMAS, The University of Manchester.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
DNER Architecture Andy Powell 6 March 2001 UKOLN, University of Bath UKOLN is funded by Resource: The Council for.
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
Metadata for the Web Andy Powell UKOLN University of Bath
Evidence from Metadata INST 734 Doug Oard Module 8.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Linking Resources in the Humanities: Using OpenURL to Cite Canonical Works 2009 DLF Spring Forum David Ruddy Cornell University Library.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
The Akoma Ntoso Naming Convention Fabio Vitali University of Bologna.
Smart Linking With SFX SFX Training, Intranet Internet range of authorities, technologies A&I e-print FTXT OPAC FTXT A&I Electronic Scholarly Information.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
A centre of expertise in digital information management 10 minute practical guide to the JISC Information Environment (for publishers!)
Networked Information Resources Federated search, link server, e-books.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.
Session 3 Metadata & Workflow
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Accessing a national digital library: an architecture for the UK DNER
Link Resolver and Knowledge Base in Discovery Services
Metadata - Catalogues and Digitised works
Prepared by Elena Escolano
Attributes and Values Describing Entities.
Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Presentation transcript:

The Evolution of Mathematical Communication 1 Using Metadata for the Interlinking of Digitized Mathematics Thomas Fischer State and University Library Göttingen, Germany

The Evolution of Mathematical Communication 2 Overview The Problem: Search vs. Access The Situation: Players and Communication Suggested Solutions: –Metadata Standards –Resolving Services –Registries

The Evolution of Mathematical Communication 3 Description of the Problem Two kind of “searches”: Trying to find the answer to some question: What are the different differential structures on a 4-manifold? Trying to find an article: Where is “Characteristic numbers of 3-manifolds” by Milnor and Thurston available? The first kind requires well formulated searches against some appropriate databases. The second is in some sense trivial. The latter question is the one this talk is dealing with: If you know what you want, how do you get it?

The Evolution of Mathematical Communication 4 Players: Sources of electronic literature Publishers –Publishers usually produce electronic versions of mathematical journals –Many publishers produce “backfiles”: retrodigitized versions of printed journals Preprint servers –Preprint servers collect electronic versions of mathematical papers since the early 90’s –There are several different of them with no unified structure Digitization centres –There are several (mostly national) initiatives to produce retro- digitized versions of historical and recent mathematics

The Evolution of Mathematical Communication 5 Sources of printed mathematics literature Local libraries –Provide more or less extensive collections of published journals and books –Provide additional services if the desired material is not available Authors –May provide copies of their papers to interested researchers Remote libraries –Provide articles and books through inter-library loan –Some provide remote scanning services, delivering the scanned object to the user’s desktop electronically Publishers and bookstores –Provide options to buy books

The Evolution of Mathematical Communication 6 Researchers’ interest Immediate access, if possible Most authoritative version No or no additional or lowest possible cost High quality of presentation (order of importance may depend on institutional or personal preferences)

The Evolution of Mathematical Communication 7 Searching for Milnor/Thurston: Try Google Try Google Scholar Try SpringerLink or ScienceDirect Zentralblatt Math: Enseign. Math., II. Sér. 23, (1977) [ISSN ] Google: “Enseign. Math” (presented by seals: swiss electronic academic library service, with some problems …)

The Evolution of Mathematical Communication 8 Problems Background idea: World Digital Mathematics Library But now: No unified discovery or access scheme No uniform standards of reference No uniform quality standards

The Evolution of Mathematical Communication 9 Not solved or not solvable? Quality standards for retrodigitization seem to be hard to enforce: Different players: publishers, scientific communities, digitization centres Money involved in quality of scanning and administration of complex metadata Different scopes and long term orientation

The Evolution of Mathematical Communication 10 Available building blocks Review journals (Mathematical Reviews, Zentralblatt MATH) Metadata standards (Dublin Core) Communication protocol (OAI-PMH) Reference standard (OpenURL) Willingness to co-operate

The Evolution of Mathematical Communication 11 Goal: unified access Look up the requested paper in Zentralblatt or MathReviews (they might cover preprints and other ‘gray’ literature for that purpose) Receive a link to a resolving service Obtain the appropriate copy (digital or printed) Necessary: communication network with sufficient precision

The Evolution of Mathematical Communication 12 New developments in the world of metadata Dublin Core: Abstract Model Dublin Core Application Profile: the ePrint Application Profile minidml, a DC-based custom format Dublin Core Simple, enhanced by best practises

The Evolution of Mathematical Communication 13 Dublin Core Abstract Model 1 New: Description sets contain several related descriptions Allows distinct descriptions e.g. for Creator, Journal and Article With this, authority files become easier to build and manage model/ model/

The Evolution of Mathematical Communication 14 Dublin Core Abstract Model 2 A description set is a set of one or more descriptions about one or more resources. A description is made up of one or more statements (about one, and only one, resource) and zero or one resource URI (a URI reference that identifies the resource being described). Each statement instantiates a property/value pair and is made up of a property URI (a URI reference that identifies a property), zero or one value URI (a URI reference that identifies a value of the property), zero or one vocabulary encoding scheme URI (a URI reference that identifies the class of the value) and zero or more value representations of the value.

The Evolution of Mathematical Communication 15 Constructing Smooth Loop Spaces Differential Geometry; Algebraic Topology Andrew Stacey University of Sheffield

The Evolution of Mathematical Communication 16 Eprints Application Profile 1 Eprint: a scientific or scholarly research text (as defined by the Budapest Open Access Initiative) a DC Application Profile for describing an eprint Each description set describes only one eprint (i.e. one ScholarlyWork entity). Extends the Dublin Core set by numerous additional fields related to scholarly work Uses concept of related description for entities that allow a distinctive separate description, e.g. Creator, Funder, Affiliated Institution Incorporates parts of the “Functional Requirements for Bibliographic Records” (FRBR), in particular the distinction between Work, Expression, Manifestation and Item. _Profilehttp:// _Profile

The Evolution of Mathematical Communication 17 Why FRBR? FRBR disentangles some problems with bibliographic references: The Work is the abstraction of the original product, the ideas. An Expression is a version of this, e.g. the original one, a translation, the third revision… A Manifestation is a preprint or a printed and/or digital version in a journal or book. An Item (copy) is the actual physical object: the article in the journal on the shelf, the specific digital copy on a particular server. Different properties refer to different levels, think e.g. of a title and a title of the translation, page numbers, publisher, URL…

The Evolution of Mathematical Communication 18 Eprints Application Profile 2 Provides as comprehensive a format for describing a scholarly work as desired Is well adapted to electronic sources, not based on printed matter Can be mapped to Dublin Core simple (with some losses) Probably the optimal data format available, but not easy to implement

The Evolution of Mathematical Communication 19 The minidml format A DC-based metadata format enriched by the separation of some elements Beyond DC simple, minidml provides references: –different identifier schemes, e.g. – Ann. Inst. Fourier 1, 1-4 (1949) –plus separate subfields for citation – giving MR and Zbl numbers With 20+ elements plus some schemes the most relevant information on a scholarly article can be captured

The Evolution of Mathematical Communication 20 DC Simple enriched by “Best Practices” 1 “Recommended Practice for Creating Unqualified Dublin Core Records, for Mathematical Literature” (unwieldy title!) in preparation by –Thierry Bouche, NUMDAM (Numérisation de documents anciens mathématiques) –Thomas Fischer, Staats- und Universitätsbibliothek (SUB), Göttingen –Claude Goutorbe, Cellule MathDoc, Grenoble –David Ruddy, Project Euclid, Cornell University Library Based on experience with and analysis of different OAI metadata schemes used by digitization centres Goal: provide rules for DC simple that make the OAI metadata most useful

The Evolution of Mathematical Communication 21 DC Simple enriched by “Best Practices” 2 Some suggestions: Use UTF-8 for special characters outside of mathematical formulas, not the TeX encoding Use prefixes to clarify the meaning of data, e.g. “isbn:”, “msc:”, “bibliographicCitation:” Use last name first rule for names; use only one version of the name for each author Link to reference journals in the element, using appropriate prefix, e.g. “mr:”, “zbl:”, “jfm:” Give full bibliographic citation information in identifier field

The Evolution of Mathematical Communication 22 Bibliographic Citation and OpenURL OpenURL: NISO standard Z39-88(2004) Basically of the form (without the line break): ctx_ver=Z &rft_val_fmt=info:ofi/fmt:kev:mtx:journal& = & = &...& = Essentially a (search) request with standardized fields Used for –Encoding Bibliographic Citation Information in Dublin Core Metadata ( –ContextObject in SPAN (COinS): Embedding Citation Metadata in HTML ( –SFX: context-sensitive link server from Ex Libris ( –crossRef: an infrastructure for linking citations across publishers (

The Evolution of Mathematical Communication 23 Build a registry! OAI Data providers: Provide data with sufficient information, using a full format like the ePrint application profile or enriched format like minidml and modify the required DC Simple format according to the recommendations OAI Service providers: Collect data from the available sources and provide an OpenURL service to retrieve the documents Review journals: Enrich the review data with an appropriate OpenURL, directed to a resolver at one of the registries

The Evolution of Mathematical Communication 24 A possible communication scheme

The Evolution of Mathematical Communication 25 Some basic tasks ahead: Digitization centres: Get the pages straight and the page numbers right Get full and correct data OAI Service providers: Collect data and analyze and organize appropriately, in particular match different versions of the same article Review journals: Unify the references to journals

The Evolution of Mathematical Communication 26 Some references Andy Powell, Mikael Nilsson, Ambjörn Naeve, Pete Johnston: Dublin Core Abstract Model ( Pete Johnston, Andy Powell: Expressing Dublin Core metadata using XML ( ) ( Digital Library Federation and the National Science Digital Library: Best Practices for Shareable Metadata (August 2005) ( es.doc) es.doc JISC, UKOLN, cetis: EPrint Application Profile (September 2006) ( _Profile) _Profile ViFa Math /VLib Math: (English version in preparation) Digitization Registry: (recommendations for OAI data to appear here)

The Evolution of Mathematical Communication 27 Thank you for your attention! Thomas Fischer State and University Library Göttingen, Germany