Accessing treasure on lands and peoples Peter Burnhill Director, EDINA, University of Edinburgh.

Slides:



Advertisements
Similar presentations
Issues In the Digital Humanities La Trobe eCoffee Dr Craig Bellamy VeRSI, 5 November, 2010
Advertisements

Cultural Heritage in REGional NETworks REGNET. October 2001Project presentation REGNET 2 T1.3. IDENTIFICATION OF STANDARDS TO BE USED 1. OBJECTIVES 2.
Aggregation as a tactic - to support discovery Peter Burnhill & Stuart Macdonald EDINA national data centre University of Edinburgh CERN workshop on Innovations.
1 Demystifying metadata Ann Chapman UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint Information.
Encoded Archival Description Roundtable Society of American Archivists Annual Meeting 2014 August 13.
An Leabharlann UCD Órna Roche UCD James Joyce Library Metadata Documenting your data
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Data Capture Methods. In this topic, we will be looking at: Methods of data capture When it would be appropriate to use each method Advantages and disadvantages.
WMES3103 : INFORMATION RETRIEVAL
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878 – 1879) Maria Nisheva-Pavlova, Pavel Pavlov Faculty.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Presented by Karen W. Gwynn LS – Metadata University of Alabama Prof. Steven MacCall Spring 2011.
XML Basics Hope Greenberg Center for Teaching & Learning.
Strategies for Building Successful Digital Initiatives at Small to Medium Size Institutions Rachel Frick & Andrew Rouner.
What are the Digital Humanities “…the work of the humanities is to create the vessels to store our culture. In this sense, the digitization of archives.
Canonical Concepts of Content Management – Prospects in retrospect Dr. A.Y. Asundi* Professor and Chairman (Ret.), Dept. of Library and Information Science,
Multimedia as a data source. Multimedia is.. ? ●Media mix, e.g. text, sound and pictures. ●Terms such as: collage, montage, mosaic, mixed media, layers,
Mark Phillips Digital Projects Department University of North Texas Annexation of Texas Project.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
An XML Introduction Extensible Markup Language Describe Structure and Content of Data Sample XML Document.
44 CHAPTER SPECIALIZED APPLICATION SOFTWARE Graphics 1. Desktop publishing 2. Image editors 3. Illustration programs 4. Image galleries 5. Graphic.
TERRA KRIDLER SENIOR LIBRARIAN & ASSISTANT UNIVERSITY ARCHIVIST AMERICAN UNIVERSITY IN CAIRO MIDDLE EAST AND NORTH AFRICA INNOVATIVE USERS GROUP CONFERENCE.
The Western Waters Digital Library: Building a Resource Through Multi- State Collaboration and Technology Dawn Paschal Assistant Dean, Digital Library.
University of West Florida Library Digital Initiatives Underway, Under Discussion, On the Horizon Ray Uzwyshyn, Ph.D. Head, Digital and Learning.
XML Extensible Markup Language. What is XML? An infrastructure for describing text and data Developed by W3C(the World Wide Web Consortium)
METADATA What Is It and What Can I Do With It? Vicki L. Gregory Associate Professor School of Library & Information Science University of South Florida.
Research data management: an introduction Basic training course for information specialists Library – RDM Support Project.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
An Overview of Projects and Processes Higher Education Digitisation Service Joanne Lomax Smith
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
Challenges for Academic Libraries in the Networked World Christine L. Borgman Professor & Presidential Chair in Information Studies UCLA & Visiting Professor.
Overview of EAD Jenn Riley Metadata Librarian Digital Library Program.
Archival Description People, Records, and Functions Daniel V. Pitti Institute for Advanced Technology in the Humanities University of Virginia March 2003.
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
Mandeville Special Collections Library Collection Development Presentation 5 Acquiring the Best.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Introduction to metadata
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
XML for Text Markup An introduction to XML markup.
Web Page Design Introduction. The ________________ is a large collection of pages stored on computers, or ______________ around the world. Hypertext ________.
The SpokenWeb Project at Concordia University Annie Murray Digital & Special Collections Librarian Concordia University | Montréal.
An exercise in preservation and applied technology Making an Electronic Text.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Basic Encoded Archival Description METRO New York Library Council Workshop Presented by Lara Nicosia December 9, 2011 New York, NY.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Hiberlink is funded by the Andrew W. Mellon Foundation Investigating Reference Rot in Web-Based Scholarly Communication Martin Klein Los Alamos National.
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
EAD 101: An Introduction to Encoded Archival Description XML and the Encoded Archival Description: Providing Access to Collections Oregon Library Association.
Digital Data Preservation: a schema-driven model Student: Stacy Kowalczyk Co-Authors: Clare McInerney and Phil Mitchell Digital Data Preservation – the.
Launching E-Records with a PERPOS: The Presidential Electronic Records PilOt System 2005 NAGARA Annual Meeting.
Layer 6 Presentation Layer. Overview Now that you have learned about Layer 5 of the OSI model, it is time to look at Layer 6, the presentation layer.
Behind every site is a mix of special languages that your web browser understands The main way of describing any website is HTML HTML stands for Hyper.
Presented by: Amy Carson, Trisha Hansen and Jonathan Sears.
TEI presentation for IS 590 Robert Patrick Waltz July 10 th, 2012.
Definition, purposes/functions, elements of IR systems Lesson 1.
Sharing Your Finding Aids in CONTENTdm Encoded Archival Description (EAD) Files in Mountain West Digital Library June 3, 2009 Sandra McIntyre, Mountain.
Perspectives on Information Course Introduction January 25, 2016.
CS 330 – Software Engineering What is Software Engineering? Lab 1.
The origins of Information Science
European Digital Library
Markup of Educational Content
Catherine Lai MUMT-611 MIR January 27, 2005
W. Christopher Lenhardt
Introduction to Metadata
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Lifecycle Metadata for Digital Objects
Metadata to fit your needs... How much is too much?
Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution - Non-Commercial - Share Alike 3.0 License..
Presentation transcript:

Accessing treasure on lands and peoples Peter Burnhill Director, EDINA, University of Edinburgh

Inspired by a Keynote remark by Professor Gillies …

Credits: who planned the dive & dived the wreck The team within EDINA: Des Reid, Senior Software Engineer Dimitrios Sferopolous, Software Engineer Neil Mayo, Software Engineer Jackie Clark, Web Designer led by Christine Rees, Head of Bibliographic & Multimedia And those to whom we all owe lots: (in Centre for Research Collections, IS Library & Collections) Kirsty Stewart (project manager and archivist) Lesley Bryson nee Doig (initial project manager) Grant Buttars, Deputy University Archivist Andrew Wiseman, Researcher, TEI expert Donald William Stewart, Senior Project Researcher led by Arnott Wilson (University Archivist) & John Scally (University Collections)

A treasure to be unlocked

Digital Library has mixed parentage - a re-mix of the document tradition & the computation tradition approaches based on a concern with documents, with signifying records: archives, bibliography, documentation, librarianship, records management, and the like … [Domain knowledge speak] approaches based on uses of formal techniques, whether mechanical (such as punch cards and data-processing equipment) or mathematical/computational (as in algorithmic procedures). [Software engineer speak] Prof. Michael Buckland, Presidential Address, American Society for Information Science, JASISs 50th (1998) Languages & Perspectives

Heard report on work from the Dive Team On from the marvels of reading and interpreting of marks on paper of the notebook entries … and the meticulous transcription into machine-readable text … and their tagging using Encoded Archival Description (EAD) with text in XML format* * * mark-up that software can process more easily

Example of the XML EAD data (1) GB 237 Coll- 97/CW114/42 CW Song about Uamh-an-Oir, accompanying story and notes 1867 Edinburgh University Library, Special Collections folio 67v, line 17 to folio 68r, line 4 <!--Replace language code if other than English with ISO three letter language code. Add further language tags if necessary Gaelic

Work at the Refactory The XML files were passed to engineers at EDINA … import script in Perl that parses the XML and constructs the relational structure, with reference to an existing database schema as shown. The green boxes indicate high- level entities: catalogue entry, its transcript and images. The pink cat_* boxes are links from catalogue entries to such things as places, people and subjects.

Example of the XML EAD data (2) <!--Insert controlaccess index terms here if needed Index Subjects Caves Dogs Hair Loss (of people or things) Men Rescues Waulking songs People | Mor Iain ic Dhòmhnaill Bhàin | fl1867 | Isle of Barra | Inverness-shire MacNeil | Roderick | c | Ruaraidh an Rùma | crofter | Mingulay

Example of the XML EAD data (3) English Alexander Carmichael Scope and Content Song about Uamh-an-Oir probably collected from Roderick MacNeil, aged 88, crofter, Miùghlaigh/Mingulay beginning 'Na minn bheaga na minn bheaga/theaga, Dol eir creagan dol sna creag' composed of thirteen lines. Uamh- an-Oir is described as starting at Cliata cliff and going under Barra to Gearragaal east of Orasay [Uamh an Òir, Cliaid, Orasaigh, Barraigh/Isle of Barra]. The story tells how five men went into the cave with dogs but only the dogs returned and they were hairless. 'The smith of Loch an Duin [Loch an Dùin] put out the torches. Great men sent them in against their will.' Carmichael writes a note to himself to see Mor Iain ic Dhonuil Bhain [Mòr Iain ic Dhòmhnaill Bhàin] for the 'oran sith sung here at the luadh...She Knows all about the songs made'. A vocabulary note reads ' "Fiallan fiadhaich" An insect on the brain &c!' Written transversely over the text in ink is 'Transcribed Book No III page 62 A[lexander] C[armichael].

Work at the Refactory This structure is imported into Solr – software used … to control searching copies of the text (which have been normalised for more effective searching) … and for retrieval of text and images to be rendered on the website

Tobar an Dualchais 6,000 new items now available to search & play Over 24,000 tracks of stories, songs, music, poetry and factual information recorded in Scotland and further afield, from 1930s onwards. Thousands of oral recordings recorded in Scotland and further afield, from the 1930s onwards. –including stories, songs, music, poetry and factual information. HLF funding Joint project: Sabhal Mòr Ostaig, University of Edinburgh, BBC Scotland, National Trust for Scotland

Early work between EDINA & Special Collections SCIMSS Special Collections Index of Manuscripts, 1995/96Special Collections Index of Manuscripts –Once an advanced Web service, now retired: Wayback Machine..

web index was created from the Special Collections departmental sets of 180 binders comprising, in alphabetical order, about 54,000 loose-leaf slips containing varied typescript dating from the 1930s.

Early work between EDINA & Special Collections SCIMSS Special Collections Index of Manuscripts, 1995/96Special Collections Index of Manuscripts –Once an advanced service, now retired Statistical Accounts of Scotland, the Sinclair 1790 & 1840 statistical reports on parishes of Scotland, 1999/2001Statistical Accounts of Scotland –a service with Editorial Committee chaired by Dr Ann Matheson

Early work between EDINA & Special Collections SCIMSS Special Collections Index of Manuscripts, 1995/96Special Collections Index of Manuscripts –Once an advanced service, now retired Statistical Accounts of Scotland, the Sinclair 1790 & 1840 statistical reports on parishes of Scotland, 1999/2001Statistical Accounts of Scotland –a service with Editorial Committee chaired by Dr Ann Matheson NAHSTE / GASHE, 2000/03 –Navigational Aids for the History of Science, Technology and the Environment: collections of archives & manuscripts held in Edinburgh, Glasgow & Heriot-Watt Universities –Gateway to Archives of Scottish Higher Education: descriptions of archives dating from 1215 to present day. –Gavin Inglis at EDINA created free text and index controlled searching facilities based on the underlying structure of the XML formatted data, with simple navigation between data components and an easy means of updating existing data.

Launched in 2003, no subscription fee Now used by 379 licensed institutions Several hundred hours of film, across a side range of subject areas and topics Collections include: –Imperial War Museum, Films of Scotland, Royal Mail Film Classics, Digital Himalaya, Culverhouse Classical Music, Logic Lane, Wellcome Film, Biochemical Society, Healthcare Productions, St Georges Medical School Collection, Education & Television Films Ltd, Amber Films, Performance Shakespeare Followed BUFVC/OU project for metadata, digitisation & rights clearance

3,000 hours of video footage Collections include: Gaumont Newsreels, News at Ten, ITN News Reports, Channel 4 News, Reuters archives, Roving Report 60,000 news stories + 25,000 ITN programme scripts + unreleased footage Launched in 2008, no subscription fee, uptake now grown to 344 universities & colleges worked with BUFVC who led project for metadata, digitisation and rights clearance

Initially launched in 2004 Getty Images to Sept 2010 Digital Images for Education from Oct 2010 Schools service started in 2008 and ran until Sept 2010 Engaging 88 subscribing universities & colleges

1 million image, video and sound resources to discover & use 45 Collections so far 8 Collections so far British Library Archival Sound Recordings British Library Archival Sound Recordings

Future activities?

… a rich ecosystem … from food delivered to a one-time wreck

… a rich ecosystem … from food delivered to a one-time wreck Thank you