The Europeana Newspapers Project A Gateway to European Newspapers Online.

Slides:



Advertisements
Similar presentations
State University – Higher School of Economics
Advertisements

Short overview of the SCORE project
A Gateway to European Newspapers Online Building Common History and Identity Around Digital Materials INFORUM, Prague, May 21-24, 2012 Vesna Vuksan, University.
Why does ERA Need to Flourish
THE PROJECT LIFECYCLE National Contact Point
Ute Schwens, Die Deutsche Bibliothek, IFLA Sattelite Meeting Information Technology and DCMI, Goettingen 12/08/03, 1/19 Ute Schwens, Die Deutsche Bibliothek.
EU-Bureau of the Federal Ministry of Education and Research PT-DLR Joseph-Schumpeter-Allee Bonn Tel: 0228 / Fax: 0228 /
06/02/2014 NORDUnet 2000 : Renardus – The Clever Route to Information Renardus – The Clever Route to Information Janne Kanner CSC – Scientific Computing.
Israel, 10th and 11th of December 2003 Italy Israel Bi-national Seminar on Digital Access to Scientific and Cultural Heritage Antonella Fresa MINERVA Technical.
Antonella Fresa Vilnius, 4th October 2007 Antonella Fresa Technical Coordinator MinervaEC MInisterial NEtwoRk for Valorising Activities in digitisation,
CrossAsia at the Staatsbibliothek zu Berlin an approach to organise access to research material in the field of Asian studies.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
WP7 Internal Evaluation & Quality Assurance Green Employability Project StudioCentroVeneto - Toni Brunello and Paolo Zaramella Vienna, 10th January 2012.
ICT PSP Infoday Luxembourg Call 2011 – 2.4 eLearning ICT-PSP Call Objective eLearning Marc Röder Infso E6/eContent and Safer Internet Luxembourg,
February 16, 2014Ministry of Regional Development - 2 Mid-term assessment of information and publicity measures Commission Regulation (EC) No 1828/2006.
Information and Communication Technologies (ICT) in the Seventh Framework Programme Coordination actions ICT Calls Jan- March 2012.
The NATURE–SDIplus project Best Practice Network for SDI in Nature Conservation Co-funded by the Community Programme eContentplus ECP-2007-GEO Co-funded.
DRIVER Step One towards a Pan-European Digital Repository Infrastructure Norbert Lossau Bielefeld University, Germany Scientific coordinator of the Project.
Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
LIBER, Europeana and the Europeana Newspapers Project Dresden, Aleš Pekárek, Association of European Research Libraries, Den Haag, NL.
The European(a) Newspapers Project A Gateway to European Newspapers Online Paris, Thorsten Siegmann, Staatsbibliothek zu Berlin, Germany.
Europeana Newspapers Project A Gateway to European Newspapers Online.
PwC SCHEMAS Forum for metadata schema implementers The SCHEMAS project and metadata ETB Workshop, London, 9-10 January 2001 Michael Day,
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
NRG, Bristol, November 17, 2005 Hans Petschar The European Library Vision for European Digital Library.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
1 Welcome to KTH! KTH, the Royal Institute of Technology Excellence in Education, Research and Entrepreneurship Victor Kordas, KTH Grants Office.
WP3. Evaluation, Monitoring and Quality Plan Dr. Luis Sobrado 27 th May 2011.
The Europeana ecosystem and the role of libraries Jill Cousins The Researcher of Tomorrow, Europeana Libraries Final Conference, Madrid, December 4, 2012.
1 Dissemination to Policy and Decision Makers and a Wider Audience Peter J. Bates pjb Associates
The European Activities of BR Communication e-CODEX e-Justice Communication via Online Data Exchange Bucharest, June 14 th 2013.
WP2 – communication and dissemination
Co-funded by the European Union Semantic CMS Community Content Management From free text input to automatic entity enrichment Copyright IKS Consortium.
FPS HEALTH, FOOD CHAIN SAFETY AND ENVIRONMENT 1 Joint Action on Health Workforce Planning and Forecasting Zuzana Matlonova Brussels – April 11th, 2013.
Testing and Evaluation in Digital Preservation Projects: the case of KEEP Milena Dobreva Janet Delve, David Anderson, Leo Konstantelos.
Europeana Collections Remembering the First World War Rotterdam, Thorsten Siegmann, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz.
CIRAS PROJECT OVERVIEW
On the Two Sides of the Pond By Hans-Jörg Lieder, Head of the Department of Bibliographic Services – Union Catalogue of Serials Staatsbibliothek zu Berlin.
The view from Europe Paola Gargiulo – CASPUR (and Valentina Comba University of Bologna – Italy) Fiesole Collection Development Retreats Fiesole 2004 March.
Converging parallel universes Library services as building blocks of digital humanities research 42nd LIBER Annual Conference Munich June 2013 Gregor Horstkemper.
OpenUp! A New Project on Opening up the European Natural History Heritage for EUROPEANA W. G. Berendsohn, A. K. Michel, A. Güntsch, W.-H. Kusber (2011)
1 EuropeanaLocal- Europeana Knowledge Sharing Workshop EuropeanaLocal- Europeana Knowledge Sharing Workshop 13/14 January 2009 Rob Davies, Scientific Co-ordinator.
The German Union Catalogue of Serials and its interlibrary services Hans-Jörg Lieder Head of the Department of Bibliographic Services Staatsbibliothek.
nd Management Meeting Project Overview & Progress EUN, Brussels, 30 September 2010 Caroline Kearney Knowledge-Building.
Mary Rowlatt AccessIT Project Coordinator MDR Partners
Exploring Europe's Television Heritage in Changing Contexts Connected to: Funded by the European Commission within the eContentplus programme
Europeana Sounds – Uniting the sounds of Europe Richard Ranft (British Library) Zane Grosa (National Library of Latvia) IASA Nordic conference, 26 May.
Enhancing the Culture of Reading and Books in the Digital Age - ARROW Olav Stokkmo, Chief Executive, IFRRO 13 October 2009IFLA-IFRRO-WIPO-IPA-EWC Conference;
Ingenious-science.eu ECB National Needs Analysis Workshop – Israel Charmaine Kerr ECB Project Coordination.
ICT PSP Infoday Brussels Call 2011 – Theme 2 Digital Content ICT-PSP Call Theme 2: Digital Content Federico Milani, Marc Röder Infso E6/eContent.
Consolidating the European Library Space Luxembourg November 1999.
Cross-domain access to Europe’s heritage Jon Purday Senior Communications Advisor, Europeana Doom or Bloom: reinventing the library in the digital age.
Local content in a Europeana cloud Rob Davies, MDR Partners, Project Manager.
Mary Rowlatt AccessIT Project Coordinator MDR Partners
Andreas Juffinger 14 June, 2012, Washington DC Europeana Research Opening Up Europeana for Research.
Europeana Libraries: building a pan-European aggregator Wouter Schallier, LIBER Executive Director Eva/Minerva 15/11/2011.
National Library of Estonia in the TEL-ME-MOR project IST4Balt workshop in Estonia June 2006 Baltic ICT Community.
IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
Cultural Heritage Projects Bundesamtsgebäude Wien, June 30, 2000 Cultural Heritage Projects: Renardus Dr. Heike Neuroth Lower Saxony State and University.
Participation in 7FP Anna Pikalova National Research University “Higher School of Economics” National Contact Points “Mobility” & “INCO”
EuroRoadS A pan-European Road Data Solution Project within the eContent programme.
National Library of Finland Strategic, Systematic and Holistic Approach in Digitisation Cultural unity and diversity of the Baltic Sea Region – common.
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
Creating Access to Europe’s Television Heritage Vienna, EDL Workshop November Dr. Alexander Hecht (Austrian Broadcasting Corporation ORF) Johan.
Matthias Wentzlaff-Eggebert, BZgA This work is part of the Joint Action on Improving Quality in HIV Prevention (Quality Action), which has received funding.
TELplus WP 1 “Making searchable digitised images via OCR” TELplus Kick-off Meeting Tallinn, 15./16. October 2007 Max Kaiser / Joachim Korb Austrian National.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
eContentplus 2008 Work Programme
DRIVER Digital Repository Infrastructure Vision for European Research
APENet and EUROPEANA: Digitization Issues in the European Context
Presentation transcript:

The Europeana Newspapers Project A Gateway to European Newspapers Online

2 Content Aims Consortium Structure Areas of activity

3 Why newspapers? Zeitungen sind die Sekundenzeiger der Geschichte (Newspapers are the sweep hands of history) Arthur Schopenhauer Relevant to all citizens Highly relevant to European policies incl. Europeana Newspapers in libraries – between Heaven = solid and complete originals, excellent microfilm copies and Hell = frail and crumbly originals, missing editions, incomplete supplements, poor microfilm copies, legal uncertainties with contemporary material

4 Aims & Objectives 1) Selection, Refinement & Aggregation of content Make Europeana the largest provider of pan-European newspaper collections Provision of more than 18 million newspaper pages to Europeana, many of those with full-texts 2) Analysis of existing newspaper collections Survey of newspaper holdings in Europe 3) Quality Assurance & Best practice recommendations Contribute to optimised workflows and data aggregation infrastructures Provide best practice recommendations for digitization, refinement, workflows, metadata etc. and evaluation tools 4) Presentation and full-text search Improve access to newspaper collections within Europeana

5 Consortium & Stakeholders 17 partners from 12 countries within the consortium National libraries University libraries SME External partners and stakeholders: Involvement of libraries outside the project consortium Framework: funded as a Best Practice Network in the ICT-PSP programme of the European Commission Project Duration: February 2012 – January 2015

Europeana Newspapers Consortium NLF SBB ONB NLP BnF NLE SUB HH USAL NLL KB LIBER CCS NLT UB UIBK LFT BL TEL

Consortium Partners 9. University of Salford 10. CCS Content Conversion Specialists GmbH 11. Stichting LIBER 12. National Library of Latvia 13. National Library of Turkey 14. University Library of Belgrade 15. University of Innsbruck 16. Landesbibliothek Dr. Friedrich Tessmann 17. The British Library 1. Staatsbibliothek zu Berlin (project co-ordinator) 2. National Library of the Netherlands 3. National Library of Estonia 4. Österreichische Nationalbibliothek 5. National Library of Finland 6. Staats- und Universitätsbibliothek Hamburg 7. Bibliothèque nationale de France 8. National Library of Poland

Project Structure Work Package 1: Coordination and Management Berlin State Library (SBB) Work Package 2: Refinement of digitised newspapers National Library of the Netherlands (KB) Work Package 3: Evaluation and Quality Assessment University of Salford (USAL) Work Package 4: Aggregation and presentation of digitised newspapers for Europeana The European Library (TEL) Work Package 5: Metadata best practice recommendations University of Innsbruck (UIBK) Work Package 6: Dissemination and Exploitation Association of European Research Libraries (LIBER)

WP 1: Coordination and Management Project administration management of all financial and organisational commitments Financial control Project communication provide infrastructure for internal communication Project quality assurance monitoring, evaluation, and reporting of results based on defined criteria Risk management avoid conflicts inside the Consortium

WP 2: Refinement of digitised newspapers Analyse and select available digital newspaper collections Define digitisation requirements and minimum quality of newspapers Coordinate refinement of selected content provided by libraries Provide recommendations on best practices for refinement of digitised newspaper collections

WP2: Refinement of digitised newspapers – OCR and OLR 8 million pages as is 10 million refined pages: OCR (UIBK, Austria) 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany) UIBK enriches the OCR with structural information from their Document Understanding Platform CCS produces OCR and verification of column recognition, zoning, article segmentation, and page class recognition CCS provides libraries with a client technology for manual correction of recognition and segmentation results CCS: Column recognition, article segmentation UIBK: Detection of headings, footnotes, etc. Table of contents extraction

WP 2: Refinement – Named Entity Recognition KB provides named entities recognition (NER) for material from up to three languages (Dutch, English, and German)

WP 3: Evaluation and Quality Assessment Use scenarios with evaluation profiles, datasets, ground truth, and evaluation tools Overview of usability, limitations and potential of existing material Identification of bottlenecks and recommendations for improvements Evaluation of refinement processes carried out in WP2 Recommendations for best practice in digitisation projects

WP 4: Aggregation and presentation for Europeana Identification and analysis of public and private digital newspaper collections across Europe Establish a realistic schedule for aggregation Creation of a European registry for digitized newspapers Recommendations how to align newspaper metadata to EDM Aggregate newspaper metadata from content providers Creation of a full-text index of newspaper content Development of a newspaper content browser

15 WP 4: Aggregation of content Aggregation of 18 million pages of digitised newspapers to Europeana and to The European Library Metadata transformation to meet the requirements of the Europeana Data Model (EDM) Distribution of data to Europeana

WP 4: Survey on existing digitised newspaper collections Project partners and others will be contacted until summer 2012 to analyse the extent of digitised newspapers collections at their institutions Embedding of results in Zeitschriftendatenbank of Staatsbibliothek zu Berlin (Union Catalogue of Serials) Identification of potential new partners for the extension of the network If you hold digital newspaper collections and like to participate in the survey please contact:

WP 4: Presentation & Access to full-texts Within the lifetime of the project, a content browser will be built within TEL portal so that users can … Search full text, e.g. by search term, by named entities by collections of newspapers by date …. See newspaper images Be linked to relevant library sources This browser will be built in TEL during the project; and exported to Europeana after the project

WP 5: Metadata best practice recommendations Analysis of metadata formats in use by libraries Align metadata models with the METS/ALTO standard and release best practise recommendations Usability of the recommendation will be tested through an evaluation cycle Provide recommendations on best practices for refinement of digitized newspaper collections for Europeana

19 WP 6: Dissemination Objectives Establishment of publicity Increasing usage of Europeana Awareness raising among target groups Tasks 1. Media Communication 2. Workshops and conferences Three main dissemination workshops National information days Network extension 3. Exploitation

Thank you for your attention! Contact