Presentation is loading. Please wait.

Presentation is loading. Please wait.

Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

Similar presentations


Presentation on theme: "Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central."— Presentation transcript:

1 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central Means for the Development of an European School Portal - The Project European Schools Treasury Browser – ETB Presentation at the 7 th Annual Meeting of the IuK Initiative Trier 11.-14.03.2001 Michael Kluck Humboldt University Berlin, Computer Uses in Education (HUB) Social Sciences Information Centre Bonn (IZ)

2 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 2 Introduction (I) The ETB project is embedded in the context of the European Schoolnet (EUN) www.eun.orgwww.eun.org The European Schoolnet is the new framework for the co-operation between the European Ministries of Education on Information and Communication Technology in Education. EUN builds a European network of national and regional computer networks of repositories on schools.

3 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 3 BUILD THESCHOOLNET INFORMATION SPACE

4 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 4 Introduction (II) ETB works out the technological and structural prerequisites for this network of networks. Building on a preceding project, ETB shall realise the technical infrastructure and the content-based integration of the different services and of their cultural and linguistic contexts. The presentation is concentrated on the content integration of the participating networks and repositories. The main user groups will be teachers and pupils.

5 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 5 Developing a Common Metadata Set Context and General purpose: Get similarly structured information Facilitate targeted search Avoid mismatch of the specific search and the unstructured universe of the Internet: -Topic versus person (i.e. Ohm, Kierkegaard) -Different domain-specific meanings (i.e. Leistung, Disziplin) -Domain-specific meaning versus general meaning (i.e. Lehre, services)

6 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 6 ETB Metadata Derived from the Dublin Core metadata elements and the EUN Metadata Element Set (developed in the preceding EUN project) Quite minimalised, but with obligation types M = mandatory O = optional Using RDF syntax

7 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 7 ETB Metadata Elements (I) TitleM CreatorM SubjectO or M?! DescriptionM PublisherO ContributorO DateO TypeO

8 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 8 ETB Metadata Elements (II) FormatO IdentifierM SourceO LanguageM RelationO CoverageO Rights ManagementO AudienceO EUN User LevelO

9 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 9 ETB Metadata Elements (III) Element Subject Besides freely chosen keywords ETB thesaurus terms Sound or video clip representing the content of an audio, audiovisual, visual or multimedia resource

10 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 10 ETB Metadata Elements (IV) Element EUN User Level -School level or age group -Pre-school (education) -Primary (education) -AdultEducation -Secondary (education) -Vocational (eduction and training) -HigherEducation -Juvenile (material for children and adolescents in general) -Adult (material for adults in general)

11 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 11 Producing Metadata Direct entry by authors (adapting given rules/definitions or using an online template) Generation by repositories during input Extraction from existing un-coded data by defining extraction rules

12 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 12 Metadata Extraction and Mapping For different repositories which have different metadata structures mapping schemes will be set up into the ETB Metadata Element Set. For repositories without metadata schemes metadata will be extracted from the entries as far as structured elements of the resources can be detected and an algorithm for converting them into metadata fields can be applied.

13 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 13 Metadata Exchange via NNTP

14 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 14

15 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 15 Technical Goals of ETB A new approach for a European Network of repositories Network based on Publish not Pull Added value to users from a thesaurus Retain full local editorial policy High quality control tools Wider outreach Support of multilinguality

16 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 16 ETB Thesaurus (I) Search problems Natural language problems: -Synonymy, homonymy, polysemy, phrases, compounds, spelling variations Lack of relevance control Multilinguality

17 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 17 ETB Thesaurus (II) Thesaurus benefits Effective control of indexing language (preferred terms, inter-language equivalence) Systematic display of descriptors (ease of navigation through the terminology) Indexing and searching by using post-coordination Following recommendations of Dublin Core Basics for solving heterogeneity

18 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 18 ETB Thesaurus (III) The content of the repositories in the EUN context (= multimedia material, teaching material, school projects) and schools as target area and teachers and pupils as main target groups need specific terminology. Only few repositories have developed an own terminology.

19 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 19 Handling Heterogeneity (I) Making use of existing content descriptions Dealing with heterogeneity on the content level means: Same words or phrases may indicate different meanings in different environments (i.e. education, or class): -Occurring anywhere in the full text of an Internet resource -Being the code of an classification scheme assigned to an document -Being an indexing term taken from a specific thesaurus

20 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 20 Handling heterogeneity (II) Use of existing intellectual work done by the different repositories or resource authors: indexing or classifying documents even with different schemes or terminologies Use of existing terminologies or classification schemes for automatic processing of transfer relations

21 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 21 Handling heterogeneity (III) Methods for solving heterogeneity problems Intellectual building of cross-concordances between relevant terminologies and classification schemes and between different languages, and automatic (statistical) building of transfer components Developing transfer components in between those terminologies and schemes and between those and the words occurring in the full texts (co- occurrence analysis, fuzzy methods, neural networks etc.)

22 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 22 Multilingual Access Using ETB thesaurus and heterogeneity handling ETB thesaurus allows indexing or searching in any covered language and results can automatically be retrieved in all other languages. Heterogeneity handling (intellectually or automatically processed) allows the use of any (language specific) scheme: results can also be retrieved in other schemes or languages. Integration of results in the area of cross-language information retrieval and its evaluation (see: CLEF = Cross-Language Evaluation Forum at www.clef- campaign.org )www.clef- campaign.org

23 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 23 Conclusion ETB is strongly integrated in an existing and rapidly developing application for practitioners (teachers and pupils) with a good political support for handling ICT in education. ETB is strongly integrated into top level research on distributed networking, metadata, (cross-language) information retrieval, multilingual thesauri, and heterogeneity handling.

24 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 24 Thank you for your attention! Further information On the multilingual ETB thesaurus http://www.en.eun.org/eun.org2/eun/en/etb/content_frame. cfm?lang=en&ov=3813 On other aspects of the ETB Project (collection description, quality management, technical solutions) http://www.en.eun.org/eun.org2/eun/en/etb/sub_area_fram e.cfm?sa=195&row=1 Michael Klucks publications http://www.educat.hu- berlin.de/~kluck/kl-personal.htmlhttp://www.educat.hu- berlin.de/~kluck/kl-personal.html

25 etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 25 References Ardö/Koch 1999: Anders Ardö, Traugott Koch: Automatic classification applied to the full-text Internet documents in a robot-generated subject index. In: Online Information 99. Proceedings. 23rd International Online Information Meeting. London, 7-9 Dec 1999, p.239-246. Manuscript at: http://www.lub.lu.se/~traugott/online99.htm http://www.lub.lu.se/~traugott/online99.htm Kluck et al. 2000: Michael Kluck, Jürgen Krause, Matthias Müller, in Kooperation mit Rudi Schmiede u.a. Virtuelle Fachbibliothek Sozialwissenschaften. Bonn: 2000 (= IZ-Arbeitsbericht, Nr. 19); at http://www.bonn.iz- soz.de/publications/series/working-papers/#Virtuell pdf-file for downloading. http://www.bonn.iz- soz.de/publications/series/working-papers/#Virtuell Koch/Vizine-Goetz 1999: Traugott Koch, Diane Vizine-Goetz: Automatic Classification and Content Navigation Support for Web Services. DESIRE II co-operates with OCLC. In: Annual Review of OCLC Research 1998 http://www.oclc.org/oclc/research/publications/review98/koch_vizine-goetz/automatic.htm http://www.oclc.org/oclc/research/publications/review98/koch_vizine-goetz/automatic.htm Koch 1998: Traugott Koch: Nutzung von Klassifikationssystemen zur verbesserten Beschreibung, Organisation und Suche von Internet-Ressourcen. Buch und Bibliothek 50:5, p.326-335. Manuscript with hyperlinks at: http://www.ub2.lu.se/tk/publ/bubmanus.html http://www.ub2.lu.se/tk/publ/bubmanus.html Meier 2000: Wolfgang Meier, Matthias N.O. Müller, Stefan Winkler: Virtuelle Bibliothek Sozialwissenschaften. Problembereich und Konzeption. In: Bibliotheksdienst, Vol. 34, No. 7/8, 2000, p. 1236-1244 http://www.dbi- berlin.de/dbi_pub/bd_art/bd_2000/00_07_12.htmhttp://www.dbi- berlin.de/dbi_pub/bd_art/bd_2000/00_07_12.htm Krause 1999: Jürgen Krause: Sacherschließung in virtuellen Bibliotheken. Standardisierung versus Heterogenität. In: Grenzenlos in die Zukunft. 89. Deutscher Bibliothekarthag in Freiburg im Breisgau 1999. Frankfurt am Main: 2000 (ZfBB-Sonderheft 77) Krause 1996: Jürgen Krause: Informationserschließung und -bereitstellung zwischen Deregulation, Kommerzialisierung und weltweiter Vernetzung [Schalenmodell]. Bonn: 1996 (= IZ-Arbeitsbericht, Nr. 6); at http://www.bonn.iz-soz.de/publications/series/working-papers/#Informationserschließung pdf file for downlaoding. http://www.bonn.iz-soz.de/publications/series/working-papers/#Informationserschließung Krause/Marx 2000: Jürgen Krause, Jutta Marx: Vocabulary Switching and Automatic Metadata Extraction or How to Get Useful Information from a Digital Library. In: First DELOS Workshop on Information Seeking Searching and Querying in Digital Libraries, Zürich, Switzerland, 11.-12.12.2000 (forthcoming in the proceedings) Krause 2000: Jürgen Krause: Information Systems for Social Science Research. A Perspective from Information Science. In: Symposium Information system for social sciences, 1.-2.10.2000, Mannheim (forthcoming in the proceedings) Weibel/Koch 2000: The Dublin Core Metadata Initiative. Mission, Current Activities, and Future Directions. In: D-Lib Magazine 6 (12) 2000 at: http://www.dlib.org/dlib/december00/weibel/12weibel.htmlhttp://www.dlib.org/dlib/december00/weibel/12weibel.html


Download ppt "Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central."

Similar presentations


Ads by Google