Presentation is loading. Please wait.

Presentation is loading. Please wait.

Cezary Mazurek Marcin Werla Poznań Supercomputing and Networking Center (Poznań, Poland) 2009-09-30ECDL.

Similar presentations


Presentation on theme: "Cezary Mazurek Marcin Werla Poznań Supercomputing and Networking Center (Poznań, Poland) 2009-09-30ECDL."— Presentation transcript:

1 Cezary Mazurek Marcin Werla Poznań Supercomputing and Networking Center (Poznań, Poland) ECDL 2009, Corfu, Greece

2 ECDL 2009, Corfu, Greece

3 Main organizational models Regional digital libraries Created and maintained by several institutions from particular region Gather mostly resources related to the region, its history and culture but also academic educational materials and national cultural heritage Institutional digital libraries Created and maintained by single institutions (like universities) Gather mostly resources related to present activities (like institutional repositories) and history of the institution In many cases the technical base and support for digital libraries is provided by local computing or networking centres (like PSNC) ECDL 2009, Corfu, Greece

4 Regional digital libraries Institutional digital libraries Overall number of digital objects 285 thousands Number of active digital libraries: 19 regional 21 institutional Number of cooperating institutions: Several hundreds of libraries, museums and archives + several other digital libraries in the phase of planning, configuration or initial content uploading ECDL 2009, Corfu, Greece

5 Main aims To facilitate the use of resources from Polish digital libraries To increase the visibility of these resources in the Internet To create new, advanced network services both for end-users and digital libraries creators on the base of these resources ECDL 2009, Corfu, Greece

6 Basic assumptions No need nor requirement to move resources to the DLF No fees for the use of the DLF and for being a part of it Open standards are the basis for cooperation Particular digital libraries can use different technological platforms ECDL 2009, Corfu, Greece

7 Basic functions Search in the available publications Simple Advanced Digitization plans Searchable Report API for the prevention of duplicted digitization Location of digital objects on the basis of their OAI Identifiers Database of Polish digital libraries Statistics and reports Information in the DLF is updated on the daily (nightly) basis ECDL 2009, Corfu, Greece

8 See it: ECDL 2009, Corfu, Greece

9 Digital Libraries Federation search plugin

10 Digital Libraries Federation InstitutionalRegionalLibrariesArchivesMuseums…. National (exclude??) Other InstitutionsDigital librariesMetadata aggregator ECDL 2009, Corfu, Greece

11 We gather the information about content providers and their information systems Database of Polish Digital Libraries in the DLF ECDL 2009, Corfu, Greece

12 We gather the metadata of objects that should be visible in Europeana Done with the OAI-PMH In most cases we require the OAI-PMH interface In really special cases we can do it in different way (eg. Polish Internet Library) Now we harvest only Dublin Core Simple Works on new national metadata schema started in September 2009 Approximate time of development: 3 months Approximate time of deployment: ??? ECDL 2009, Corfu, Greece

13 We will try to clean-up the metadata, normalize it and enrich On the DLF level there are automatically built dictionaries on the basis of aggregated metadata Separately for each metadata element Separately for each metadata language Differences between the metadata from various digital libraries have negative impact for the searching possibilities of the end-users That is why the metadata normalization is so important The basic analysis shows which elements are crucial and which should be easy to clean-up The analysis was done in April 2009 on the metadata of aggregated objects ECDL 2009, Corfu, Greece

14 DC Element Number of unique values How many times values were used in metadata Average number of uses per one value format ,2 language ,6 type ,7 rights ,5 coverage ,2 publisher ,3 contributor ,4 subject ,6 relation ,2 date ,4 identifier ,3 description ,1 source ,1 creator ,1 title , ECDL 2009, Corfu, Greece

15 Format In 99% of descriptions: MIME type(eg. text/html, image/x.djvu) Language In most cases: ISO (pol, ger, lat, fre etc.) Sometimes one value pol, ger instead of pol, ger Rights Name of the institution which holds the original object Type … ECDL 2009, Corfu, Greece

16 Values for Type (top 20) Number of objects with the value % of aggregated objects % of aggr. obj. (after clean-up) czasopismo ,9% 33,8% gazeta ,4% 31,3% gazety ,8% Czasopismo ,8% książka ,8% Gazeta ,2% pocztówka ,7% czasopisma ,3% text ,1% grafika ,8% fotografia ,7% artykuł z czasopisma ,5% 2,6% artykuł ,1% Czasopisma ,8% dzienniki urzędowe ,7% stary druk ,6% 1,1% starodruk ,6% rysunek ,5% rękopis ,5% mapa ,5% Sum85,1%68,9% ECDL 2009, Corfu, Greece

17 DC Element Number of unique values How many times values were used in metadata Average number of uses per one value format ,2 language ,6 type ,7 rights ,5 coverage ,2 publisher ,3 contributor ,4 subject ,6 relation ,2 date ,4 identifier ,3 description ,1 source ,1 creator ,1 title , ECDL 2009, Corfu, Greece

18 (Polish version of objects description) ValueNo. of associations% of all associations gazety regionalne122142,56% czasopisma77161,62% prasa polska54241,14% czasopisma niemieckie50091,05% gazety sublokalne49681,04% Grodków49621,04% Grottkau49611,04% Wielkopolska44220,93% 19 w.42490,89% Prusy41640,87% Czasopisma regionalne i lokalne polskie -19 w.41400,87% wiadomości polityczne40940,86% Gazety polskie r.40770,85% kultura40710,85% czasopisma sublokalne38130,80% Górny Śląsk37310,78% architektura35660,75% Wrocław35150,74% Śląsk34480,72% budownictwo33880,71% ECDL 2009, Corfu, Greece Confused with coverage:temporalspatial

19 (Polish version of objects description) ValueNo. of associations% of all associations Poznań ,62% Telecomp Service na zlecenie PBI223105,12% Kraków136623,14% Warszawa112452,58% Toruń112212,58% Katowice81871,88% Drukarnia Polska79981,84% Drukarnia Dziennika Poznańskiego T.A.68281,57% Warszawa : Telecomp Service na zlecenie PBI68241,57% Drukarnia Dziennika Poznańskiego S.A.57851,33% Nakładem F[ranciszka] T[adeusza] Rakowicza54061,24% Kielce52921,22% Krakowskie Wydawnictwo Prasowe RSW "Prasa"51371,18% Breslau51301,18% E. Neugebauer49591,14% Wangefield49591,14% Grottkau49591,14% Bydgoszcz47521,09% Drukarnia Dziennika Poznańskiego39230,90% Drukarnia J. I. Kraszewskiego38690,89% ECDL 2009, Corfu, Greece Geographical location…

20 We have over 40 digital libraries in Poland which are filled with content and metadata coming from hundreds of institutions from different domains We harvest the metadata and provide a single point of access to it The PIONIER Network Digital Libraries Federation (http://fbc.pionier.net.pl/)http://fbc.pionier.net.pl/ The software used for this service will be released as an open-source by the end of this year Cooperation with Europeana (but not only this) requires cleaning-up and normalization of metadata This is currently our biggest challenge But we do not want to solve it only by technical means on the level of our aggregator Close cooperation with content providers and some organizational changes prepared by them should effect in more efficient and sustainable metadata improvement process than a purely technical solution ECDL 2009, Corfu, Greece

21 Cezary Mazurek Marcin Werla Poznań Supercomputing and Networking Center (Poznań, Poland) ECDL 2009, Corfu, Greece Thank you for your attention. Any questions?


Download ppt "Cezary Mazurek Marcin Werla Poznań Supercomputing and Networking Center (Poznań, Poland) 2009-09-30ECDL."

Similar presentations


Ads by Google