Crowdsourcing manuscript transcription: the Transcribe Bentham project Martin Moyle, Justin Tonra, Valerie Wallace UCL (University College London)

Slides:



Advertisements
Similar presentations
Issues In the Digital Humanities La Trobe eCoffee Dr Craig Bellamy VeRSI, 5 November, 2010
Advertisements

Social Sciences Collections & Research: a new content-based team Gillian Ridgley, Ian Cooke, Jerry Jenkins.
Developing supported self –employment opportunities for the disability community.
1 Working with Social Media in Research Settings Victoria Wade Careers Consultant.
UCL Library Services and Research Data Management – a case study Martin Moyle UCL Library Services ODE Workshop, LIBER Conference, 27 June 2012.
Manuscript transcription: the habits of crowds The Limits of the Archive | 16 May
The Research Workflow Revolution: The Impact of Web 2.0 And Emerging Networking Tools On Research Workflow Bill Russell Communications Director 4 th April.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Web 2.0 The Read/Write Web. Marc Prensky Terms Digital Natives Digital Natives Digital Immigrants--maintain a pre-digital accent Digital Immigrants--maintain.
Beyond the Digital Incunabular Period: Toward Web 2.0 Gideon Burton Asst. Prof. of English Assoc. Editor, BYU Studies Presentation to the Harold B. Lee.
School of something FACULTY OF OTHER University Library The Library’s Digital Repository or Whatever happened to MIDESS? Michael Emly Jonathan Ainsworth.
What is a blog? “Web log” In simple terms, a blog is a web page where what you write goes in chronological order on the front page Author can write, viewers.
Crowdsourcing Cultural Heritage UCL's Transcribe Bentham Project Dr Melissa Terras Senior Lecturer in Electronic Communication, UCL Dept of Information.
Why This Campaign? Libraries are popular, but taken for granted. Libraries are ubiquitous, but not often visible. Libraries are unique, but facing new.
The Community Café project: language teachers creating and sharing resources online Alison Dickens Subject Centre for Languages, Linguistics and Area Studies.
Social Media Marketing & Management Mrs. Piotrowski 1.
SOCIAL MEDIA STRATEGY FOR GIVE BIG RIVERSIDE July 30, 2013.
CODE 2 Cogeneration Observatory and Dissemination Europe Kick-off meeting July 2012 WP6 Communication and dissemination 10-11/07/2012COGEN Europe.
Hydra: future development A Hydra roadmap… Hydra Europe Symposium – Dublin – 7/8 April 2014 Richard Green.
LIFE 3 LIFE3: Predicting Long Term Preservation Costs Paul Wheatley Digital Preservation Manager The British Library.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
Project EIFL Direct in Lithuania By Ausra Vaskeviciene Martynas Mazvydas National Library of Lithuania, Lithuanian Research Library Consortium.
The DSpace Course Module – An introduction to DSpace.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
FHWA10/1/ MANUAL ON UNIFORM TRAFFIC CONTROL DEVICES REWRITE AND OUTREACH EFFORT TARGET DATE: 2001.
The Blackboard Wiki for Learning Conclusions from the Classroom Dr Mike Reynolds, Teaching Fellow in Economics
STIM Sloan-Stanford Network for the History of Technology.
How to use in your classrooms? Presented by Steve Adler, Cheryl Butler, Allen Day, and Hyewon Lee 1.
Ensuring access to the record of science: driving changes in the role of research libraries APE2014 Berlin, 29 th January Susan Reilly Projects Manager.
5 th World Water Forum Building the Programme for the Next Forum Partnership WWC-Turkey-International Stakeholder Kick-off Meeting Istanbul– March 19,
Cara Catalano Wikis Nassau Library System. Cara CatalanoCara Catalano Library Media Specialist, Turtle Hook Middle School Library Media Specialist, Turtle.
Presentation Outline What is a wiki? How does wiki work? Choosing a Wiki plan The educational benefits of a Wiki Wikis in higHeR eDucation Plans and Pricing.
1 AARP Confidential Leveraging Social Networking to Increase Savings American Savings Education Council April 15, 2009 Embargoed Until May 15, 2009.
Transparency and Open Data: GSS Response Iain Bell HoP MoJ.
Task Force on Digital Solutions Working Group on Services for Conferences and Publishing June 2012 New York.
Join the Conversation: Active Listening on Social Media By Lauren Cleland New Media Specialist, Explore Georgia #TeamGaSocial.
Music Australia Engaging partners and audiences Robyn Holmes, Curator of Music, National Library of Australia.
If you build it will they come? The LAIRAH Study: Quantifying the Use of Online Resources in the Arts and Humanities through Statistical Analysis of User.
New Media in Education Blogs & Wikis for Interactive Learning Dr. Chris Greer Georgia College & State University.
Quizzes and Tutorials : Developing online strategies to support Business students in University College Dublin Mark Tynan & Lorraine Foster, UCD Library.
Continuing the work of the Bill & Melinda Gates Foundation Presented by: Jeff Stauffer WebJunction Service Manager Date: 3 February 2005.
Online curriculum centre Faculty member training, April 2009.
Laulima Workshop for Instructors Solutions to help you engage your students through Laulima.
CURRIKI --An Overview Presented to the Bioscience Interest Group Christine Loew Program Manager
Accessing our archival and manuscript heritage Progress report and content specification Steering group meeting Feb 2005 Richard Butterworth.
Population Census Data Dissemination through Internet H. Furuta Lecturer/Statistician SIAP 1 Training Course on Analysis and Dissemination of Population.
Dr Liz Lyon Associate Director, Outreach Funders: Engaging the Users: the Outreach & Community Support Programme Digital Curation Centre a centre of expertise.
© 2011 Pearson Education, Inc., publishing as Longman Publishers. 1 Chapter 27 Blogs, Wikis, and Social Networks Technical Communication, 12 th Edition.
Educating Academic Advisors in WEB 2.0 Steve Davis-Rosenbaum, Academic Advisor, College of Arts and Sciences, University of KY Patsy Carruthers, Teaching.
Co-designing urban infrastructures: cases, opportunities and challenges Liesbeth Huybrechts, Faculty of Architecture and Art UHasselt.
Sustainable Tourism Network Southern Africa (STNSA) Network Developments: ~ May 2009 to May 2010 ~ Kate Finlay SNTSA.
What is a wiki? Online Collaboration with Wikis. A wiki is an easy-to-use free web page that multiple people can edit.
DSpace An Open Source Dynamic Digital Repository Xizi (Cecilia) Cai IS565 Spring 2013 DL Topic Presentation.
Teaching with Technology: Wikis in Education James Baldwin Information Resource Center Dorine Takam, IRC Assistant/New Media Manager, MP Lib. Sc. October.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Collection Description considerations in the nof-digitise programme Sarah Mitchell Programme Manager New Opportunities Fund.
Teaching English with Technology. A little bit of history…. Web – 1970: Tape recorders, laboratories – 1970: Tape recorders, laboratories.
Primo at the British Library Mandy Stewart. 2 About the British Library The British Library is the National Library of the UK It is a world-class.
Alison Prince Bodleian Libraries Web Manager Practical tips for creating online exhibitions Peter Pavement Surface Impression.
The Times 100 Business Case Studies Edition 16 Using promotion to campaign for public services.
NASBLA Social Media: What is it for? NASBLA is involved in numerous Social Media that all serve a distinct purpose. So, what are they all for?
OER Humanities: The HumBox Project Alison Dickens (Project Director) Subject Centre LLAS.
By: Jamie Morgan  A wiki is a web page or collection of web pages which you and your students can access to contribute or modify content without having.
Working with personal digital archives Susan Thomas Project Manager & Digital Archivist project Manuscripts Matter, Electronica panel London, October.
Raising Awareness of Metrology (MEDEA APMP-APLMF Joint Project 2)
SYNTHESYS3 Parallel Discussions
Experiences of the Digital Repository of Ireland
Projects enhancing participation with the citizen
Presentation transcript:

Crowdsourcing manuscript transcription: the Transcribe Bentham project Martin Moyle, Justin Tonra, Valerie Wallace UCL (University College London) LIBER 2010, Aarhus, 29 June – 01 July 2010

Overview About Transcribe Bentham The transcription interface Sourcing crowds Expected outcomes Next steps

Transcribe Bentham A 1-year project (from April 2010) harnessing the power of crowdsourcing to facilitate the transcription of 12,500 Jeremy Bentham manuscripts. Crowdsourcing: Taking tasks traditionally performed by an employee or contractor, and outsourcing them to a group of people or community, through an "open call" to a large group of people (a crowd) asking for contributions. [Wikipedia]

Project origins 60,000 manuscripts of the philosopher and jurist Jeremy Bentham ( ) held in UCL Library –Fully catalogued ( UCL Bentham Project –Producing a complete scholarly edition of Bentham Began 1959; 26 volumes now published, from a projected 68 –20,000 Bentham manuscripts previously transcribed To varying degrees of quality; no standard markup The majority of the manuscripts are untranscribed and unstudied

Project aims (1) Digitise 12,500 previously unread Bentham manuscripts Create a public transcription interface, with appropriate training tools, enabling crowdsourced TEI-encoded transcription Promote the project to specific target communities of volunteer transcribers Retrospectively convert existing transcripts to TEI

Project aims (2) Develop a web-based ‘Ideas Bank’, based on the transcripts Carry out log analysis and a user study on public interaction with the project Roll out a generic TEI transcription tool, for use by other transcription projects and services Long-term digital curation of digitised MSS and TEI transcripts in the UCL Library Services repository

Project partners UCL Bentham Project UCL Centre for Digital Humanities UCL Library Services University of London Computing Centre Arts and Humanities Research Council Jeremy Bentham –“present, not voting...”

Project components – overview...

Legacy transcripts PROJECT WEBSITE PROJECT EDITORS DIGITAL REPOSITORY SOURCES Images Metadata TEI transcripts Retro-conversion to TEI Quality assurance Manuscripts Training materials Registration Discussion forum Transcription tool Ideas bank Blog Web pages Folio catalogue TRANSCRIPTION WIKI TEI Transcripts COLLECTED WORKS

Interface design: some challenges Transcription is hard! –Legibility; additions, deletions, marginal notes... TEI markup is complex for beginners Quality assurance is expensive, but to demand high quality from volunteers would be unrealistic Wiki environment may alienate some participants

Technical challenges: steps taken Help and guidance in different formats (web pages, video tutorials), and aimed at beginners Users shielded from the underlying complexity Accurate transcription – no markup - is welcomed –Users can begin to add markup as confidence grows Site is being user-tested and soft-launched Digitisation focusing on earlier, more legible MSS

The ‘Transcription Desk’ (beta)

Transcription window

Toolbar... Magnifying viewer

Hiding complexity Line Break - Paragraph - Addition - Deletion - Unclear Reading - Illegible Text - Note - Underline - Unusual Spelling - Foreign Language - Ampersand - Em Dash - User Comment TEI Toolbar

Completed transcript... TEI code rendered as HTML

Help pages...

“Profiles” for registered users

Long-term access/preservation via Library repository

Successful crowdsourcing Rose Holley's checklist for crowdsourcing:

Encouraging participation Three target audiences –Schools Teachers nationally, especially year-old level Local schools, building on UCL’s outreach links –Academics Educators in palaeography, research methods etc Scholars in economic and social history, digital humanities, etc –Amateur historians, enthusiasts, and general public Different communications strategies in place for each group

Encouraging participation Targeting each group involves a combination of activities –Workshops, classes and presentations; paid-for advertisements in relevant print publications (eg History Today); approaches to disciplinary and professional bodies (eg IHR); press releases... Careful planning required –Publication lead times; academic cycle; short project! Web 2.0 activity...

Outcomes and impact Stimulation of public engagement with scholarly archives and manuscript transcription Opening up Bentham’s thought to new audiences –Policy makers, media, public Creation of an open access, digitally-preserved resource for scholars Availability of a re-usable, user-tested transcription tool for future projects and services How do users interact with digital resources? –Quantitative and qualitative data to help best practice

Progress / next steps Digitisation began April 2010 Transcription Desk (beta) in user testing Soft launch, ~20 testers, July 2010 Official launch August 2010 Publicity campaigns begin August 2010 Final report and user study May 2011

Thank you