Presentation is loading. Please wait.

Presentation is loading. Please wait.

Knowledge Creation for an Educational Use of Digital Libraries across Language Boundaries US-Korea Joint Workshop on Digital Libraries August 10-11, 2000.

Similar presentations


Presentation on theme: "Knowledge Creation for an Educational Use of Digital Libraries across Language Boundaries US-Korea Joint Workshop on Digital Libraries August 10-11, 2000."— Presentation transcript:

1 Knowledge Creation for an Educational Use of Digital Libraries across Language Boundaries US-Korea Joint Workshop on Digital Libraries August 10-11, 2000 Sung Hyon Myaeng Division of Information & Communication Chungnam National University KOREA http://ir.cnu.ac.kr/

2 Myaeng @ Chungnam National University 2 Outline General Goal Project Goals Virtual Document Concept & MIRAGE-III Cross-Language IR Work Project Goals Revisited

3 Myaeng @ Chungnam National University 3 General Goal To develop core technology for providing an educational environment where research and educational materials can be searched, shared, and used creatively through internationally interoperable digital libraries

4 Myaeng @ Chungnam National University 4 Project Goal Develop techniques for federated searching in multi- lingual, heterogeneous digital library (DL) environment A digital library environment for active learning with distributed, multilingual materials (OAI-compliant) Develop a technical basis for an OAI-compliant information gathering environment using the federated searching techniques Extend the Virtual Document work to incorporate OAI and integrate the federated searching capability US & Korea USKorea

5 Myaeng @ Chungnam National University 5 Virtual Document Concept & MIRAGE-III

6 Myaeng @ Chungnam National University 6 Motivation DL as a Dynamic Knowledge Space Dynamic: creation of new materials Knowledge: inter-connection using links Space: “distance” among objects (retrievable) Virtual Document (vs. physical document) A document virtually exists over existing digital resources on the Internet A way of sharing and exploiting existing information to create knowledge

7 Myaeng @ Chungnam National University 7 Virtual Document: Example Van Gogh's: Masterpieces from the Van Gogh Museum, Amsterdam, will be based entirely on the holdings of the Van Gogh Museum. The exhibition will illustrate Van Gogh's entire career, from the Potato Eaters of 1885 through Wheatfield of Crows of 1890, the year of his death. It will include such famous works as the Self Portrait as an Artist (1888) The Zouav (1888), The Bedroom (1888e,) and The Harvest (1888). Embedding, Total Link Embedding, Partial Links The exhibition will illustrate Van Gogh's entire career, from the Potato Eaters of 1885 through Wheatfield of Crows of 1890, the year of his death. It will include such famous works as the Self Portrait as an Artist (1888) The Zouav (1888), The Bedroom (1888e,) and The Harvest (1888). Van Gogh VDoc Instantiation Referential Link

8 Myaeng @ Chungnam National University 8 Virtual Document: Concept Document consisting of links only. Types of links embedding / referential one-to-one / one-to-many / many-to-one specific / generic total / partial etc.

9 Myaeng @ Chungnam National University 9 One-to-One vs One-to-Many Depending on the cardinality of the destination Hamlet, Prince of Denmark 1. Shakespear 2. Hamlet - A Note on Sources The text of this play was acquired via Gopher from iretap.spies.com ………… EnglishKorean Translated by C. Park Translated by C. Park Translated by J. Lee Translated by J. Lee Translated by Y. Kim Translated by Y. Kim The infant William Shakespeare may have been born on this day in 1564. And since no other day seems a likelier candidate,..

10 Myaeng @ Chungnam National University 10 Generic vs Specific Depending on the condition of the source The infant William Shakespeare may have been born on this day in 1564. And since no other day seems a likelier candidate, …… It is hard to believe, but once again they are new and improved.My motive in publishing these pages remains to help and stimulate others in Shakespeare studies There are also links to the Lambs' Tales From Shakespeare (an orignal html edition mounted at this site). Near the bottom of the page I have placed …… The Complete Works of William Shakespeare Welcome to the Web's first edition of the Complete Works of William Shakespeare. The original electronic source for this server is the Complete Moby(tm) Shakespeare, which is freely available online. There may be differences between a copy of a play ……which is freely available online Generic Link About the categories Shakespeare's plays are often arranged in three categories: tragedy, comedy, or history. …… About the categories Shakespeare's plays are often arranged in three categories: tragedy, comedy, or history. …… Shakespeare Discussion Area Welcome to the discussion pages to discuss Shakespeare and his work, to ask and answer questions, and to enliven the site. Shakespeare Discussion Area Welcome to the discussion pages to discuss Shakespeare and his work, to ask and answer questions, and to enliven the site. Site 1

11 Myaeng @ Chungnam National University 11 Total vs Partial - Image Total Link Partial Link

12 Myaeng @ Chungnam National University 12 Total vs Partial - Video Star Wars: Episode I Snapshots set7 The Trade Federation army marches towards Theed Star Wars: Episode I Snapshots set7 The Trade Federation army marches towards Theed Starwars.mov Total Link Partial Link Time(min,sec)

13 Myaeng @ Chungnam National University 13 Virtual Document: Definition A hub & a style sheet A hub consists of R-links (referential links) E-links (embedding links) Metadata (Dublin Core + index terms)

14 Myaeng @ Chungnam National University 14 Virtual Document: Benefits Easy creation of composite documents links on read-only documents links on semantics Retrieval of composite and component documents Savings in storage (and network traffic) unnecessary to copy/store large documents linking a part of a large document

15 Myaeng @ Chungnam National University 15 Virtual Document: Benefits (Cont’d) Handling multiple versions & representations with one-to-many links Annotating to documents with metadata “community document” & support for collaborative work Automatic reflection of changes in participating documents

16 Myaeng @ Chungnam National University 16 The Architecture of MIRAGE-III Other MIRAGE-REGULAR RS LS SS VDocPDoc ` Link Server Link DB Index Meta Searcher Retrieval Server Storage Server MIRAGE-REGULAR (Public DL) MIRAGE-LITE (Personal DL) RS LS SS User Agent Authoring Tool Client User Agent

17 Myaeng @ Chungnam National University 17 Authoring Panel Image Panel Text Panel Virtual Document Authoring Tool Partial Embedding Link Partial Embedding Link

18 Myaeng @ Chungnam National University 18 Retrieval Interface (in an ordinary way) Input Bar Result List Panel PDoc Browser Selection Retrieval Command

19 Myaeng @ Chungnam National University 19 Link Condition Input Bar Metadata Condition Result List Panel Retrieval Command Selection VDoc Browser Retrieval Interface (Link-based)

20 Myaeng @ Chungnam National University 20 Summary Virtual Document Concept Link-based Retrieval Retrieval of Composite/Component Documents An education and KM tool for new functionality Personal DL / Public DL

21 Myaeng @ Chungnam National University 21 Further Research Multimedia: Audio & Video Federated Searching Diverse formats of documents Copyright/Ownership Issues Right management based on DOI (Digital Object Identifier) Envisioned

22 Myaeng @ Chungnam National University 22 Cross-Language IR (1) Research Goal How far can we go with most readily available resources? Development of a practical CLIR system Query translation with a bilingual dictionary Disambiguation with co-occurrence statistics mutual information statistics in the target corpus only selection of one or more terms + term weighting Experiments with TREC-6

23 Myaeng @ Chungnam National University 23 Cross-Language IR (2) Intermediate conclusions: Using a target corpus for disambiguation can give a reasonable performance. We haven’t reached the upper limit. Ongoing work Different disambiguation & weighting methods? Ways to accurately translate phrases?

24 Myaeng @ Chungnam National University 24 Project Goal - Revisited Develop techniques for federated searching in multi- lingual, heterogeneous digital library (DL) environment A digital library environment for active learning with distributed, multilingual materials (OAI-compliant and others) Develop a technical basis for an OAI-compliant information gathering environment using the federated searching techniques Extend the Virtual Document work to incorporate OAI and integrate the federated searching capability US & Korea USKorea


Download ppt "Knowledge Creation for an Educational Use of Digital Libraries across Language Boundaries US-Korea Joint Workshop on Digital Libraries August 10-11, 2000."

Similar presentations


Ads by Google