ICS-FORTH March 30, 2006 1 Waking from a Dogmatic Slumber - A Different View on Knowledge Management for DLs Martin Doerr Alicante, Spain September 21,

Slides:



Advertisements
Similar presentations
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Advertisements

CoAKTing IFD Dave in Hawaii. 2 CoAKTing IFD n Objective is to advance the state of the art in collaborative mediated spaces for distributed e- Science.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
DELOS WP5 Workshop: Semantic Interoperability in DL systems, 17 th September 2004, Bath, UK Semantic Interoperability in Digital Library Systems Task 3:
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
UKOLN is supported by: Put functionality Augmenting interoperability across scholarly repositories 20/21 April 2006 Rachel Heery, UKOLN, University of.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
1 Computational Asset Description for Cyber Experiment Support using OWL Telcordia Contact: Marian Nodine Telcordia Technologies Applied Research
Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
1 A Case Study in E- Science: Building Ecological Informatics Solutions for Multi-Decadal Research ARL/CNI 2008 Conference Washington, DC 16 October 2008.
1 NEST New and emerging science and technology EUROPEAN COMMISSION - 6th Framework programme : Anticipating Scientific and Technological Needs.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
DIGITAL HUMANITIES SUMMER SCHOOL 2011 DIGITAL LIBRARY TECHNOLOGIES AND BEST PRACTICE, PART 1: DECONSTRUCTING DIGITAL LIBRARIES Christine Madsen R&D Project.
A Virtual Research Environment for the Study of Documents and Manuscripts 1 1 Research administration Resource discovery Data creation, use and analysis.
Who are the Experts?Simon KampaSlide 1 Who are the Experts? Simon Kampa IAM Group University of Southampton
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
CSTA K-12 Computer Science Standards (rev 2011)
CRMarchaeo CRMarchaeo v1.2.1
1 ICS –FORTH, Oct.30-Nov.4,2006, Cyprus Documenting Events in Metadata Martin Doerr, Athina Kritsotaki Center for Cultural Informatics Institute of Computer.
The Dream of a Global Network of Knowledge
ICS-FORTH May 23, An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute.
1 CIDOC CRM + FRBR ER = FRBR OO … an equation for a harmonised view of museum information and bibliographic information Martin Doerr First CASPAR Seminar.
KOS and the Conduct of Science© Straits Knowledge 2011 Knowledge Organisation Systems as Enablers to the Conduct of Science Patrick Lambe.
ICS-FORTH March 30, Waking from a Dogmatic Slumber - A Different View on Knowledge Management for DL’s Martin Doerr London, UK March 30, 2006 Center.
Melbourne, October 13, Electronic Communication on Diverse Data - The Role of the oo CIDOC Reference Model - Martin Doerr (ICS-FORTH, Crete, Greece)
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
The NSDL Registry Jon Phipps Stuart Sutton Diane Hillmann Ryan Laundry Cornell U. U. of Washington.
Carlos Lamsfus. ISWDS 2005 Galway, November 7th 2005 CENTRO DE TECNOLOGÍAS DE INTERACCIÓN VISUAL Y COMUNICACIONES VISUAL INTERACTION AND COMMUNICATIONS.
ICS – FORTH, August 31, 2000 Why do we need an “Object Oriented Model” ? Martin Doerr Atlanta, August 31, 2000 Foundation for Research and Technology -
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Idea-garden.org SOCIAL SEMANTIC INFORMATION SPACE An Interactive Learning Environment Fostering Creativity Grant agreement no: nd CIDOC CRM-SIG.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Harmonising without Harm: towards an object-oriented formulation of FRBR aligned on the CIDOC CRM ontology Maja Žumer (University of Ljubljana) & Patrick.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Linking resources Praha, June 2001 Ole Husby, BIBSYS
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Smithsonian, March 26, International Symposium “Sharing the Knowledge” Martin Doerr Smithsonian, Washington DC March 26, 2003 FORTH, Greece Chair,
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
ISO TC 37/CLARIN DISCUSSION UTRECHT, DECEMBER 9/ Thinning Down a Bloated Cat SUE ELLEN WRIGHT DECEMBER 2013.
DELOS Network of Excellence on Digital Libraries Yannis Ioannidis University of Athens, Hellas Digital Libraries: Future Research Directions for a European.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Workshop on Semantic Interoperability in e-Science Martin Doerr
Metadata in Digital Preservation: Setting the Scene
Semantic Interoperability in Digital Library Systems
Presentation transcript:

ICS-FORTH March 30, Waking from a Dogmatic Slumber - A Different View on Knowledge Management for DLs Martin Doerr Alicante, Spain September 21, 2006 Center for Cultural Informatics Institute of Computer Science Foundation for Research and Technology - Hellas NKOS Workshop

ICS-FORTH March 30, There are no new research challenges in DL. There are only the ones from 30 years ago we still have not solved (anonymous, ECDL2005) Apologies: Ill be deliberately provocative and possibly incomplete. Dont take me too serious. What are Digital Libraries (or more generally Digital Memories )? Information systems preserving and providing access to source material, scientific and scholarly information, such as libraries of publications, experimental data collections, scholarly and scientific encyclopedic or thematic databases or knowledge bases. Knowledge Management for DLs Traditional Use Cases

ICS-FORTH March 30, The traditional library task: Collect and preserve documents and provide finding aids The job is solved, when the (one, best) document is handed out. All you want is in this document. Implementing the finding aids: Assumption: User knows a topic, characterized by a noun, or knows associations of the topic uncorrelated to the problem to be solved (e.g. organic farming for host-parasite studies.) Semantic interoperability is limited to the aggregation task: Metadata are mainly homogeneous (DC, MARC etc.), challenge is the matching of terminology (KOS). Knowledge Management for DLs Traditional Use Cases

ICS-FORTH March 30, Knowledge Management for DLs Problems No support to solve a problem, e.g., what species is this object? No support to learn from the aggregated source, to retrieve by contexts, e.g., Which professions had the relatives of van Gogh? e.g., Which excavation drawings show the finding of this object? e.g., Which resolution had Galileos telescope when he observed... (in general how reliable was a scientific observation, can we correct the values found?). No support to integrate complementary information in multiple sources into new insight, e.g., Which where the clients of van Goghs paintings? No support for cross-disciplinary search. e.g. Ecology, ethnology and biodiversity. Biology and archaeology.

ICS-FORTH March 30, Knowledge Management for DLs Grand Challenge DLs should become integral parts of work environments as sources to find integrated knowledge and produce new knowledge. But How ? Employing global networks of knowledge…. Is that a dream ? Isnt Digital information and human knowledge is too diverse, fuzzy, case-dependent? Is the Semantic Web much further than AI decades before?

ICS-FORTH March 30, Knowledge Management for DLs Grand Challenge We regard suitable knowledge management as the key. We distinguish: 1. Core ontologies for schema semantics, such as: part-of,located at,used for, made from. They are small and rich in relationships that structure information and relate content. 2. Ontologies that are used as categorical data for reference and agreement on sets of things, rather than as means of reasoning, such as: basket ball shoe, whiskey tumbler, burma cat, terramycine. They do not structure information. They aggregate, more than integrate. 3. Factual background knowledge for reference and agreement as objects of discourse, such as particular persons, places, material and immaterial objects, events, periods, names.

ICS-FORTH March 30, Knowledge Management for DLs Preconceptions and Solutions Libraries should not depend on domain specific needs. Domains are too many and too diverse. DLs need a generic approach. This seduces us to only employ intuitive top-down approaches for generic metadata schemata. As a result, when the fantasy is exhausted, research stops. We need deep knowledge engineering, generalizing in a bottom-up manner from real, specific cases to find the true generic structures across multiple domains. We need interdisciplinary work on real research scenarios. Ontologies are huge, messy, idiosyncratic and domain dependent. Mapping is the only generic thing we can do We are transfixed with ontologies used as categorical data (term lists), for which this statement is mainly true. We oversee the different character of ontologies describing schema semantics. They may pertain to generic classes of discourse. We need interdisciplinary work.

ICS-FORTH March 30, Knowledge Management for DLs Preconceptions and Solutions Queries are mainly about classes. The main challenge of information integration is the integration of classes (terms). We believe this is not sufficiently supported by empirical studies. Query parameters pertain to universals and particulars and relationships. We need to systematically analyze original research questions. Manual work is not scalable or affordable. Only fully automated methods have a chance This seduces us to discard the quality of manual, intellectual decisions. Yet billions of people produce content manually. Wikipedia demonstrates, that the above is not true. We need to design the interactive processes and the awarding of users to massively involve Virtual Communities / Organisations in cataloguing, data cleaning and ontology development. We need semiautomatic, highly distributed algorithms. We need interdisciplinary work.

ICS-FORTH March 30, Knowledge Management for DLs Do we talk about the same thing? We need more reasoning! This is true. But what sort of reasoning? And before any reasoning can be done, data must be connected, in a global network of knowledge. We must first clarify: Do we talk about the same thing? Requisites for a global network of knowledge: 1. A sufficiently generic global model (core ontology with the revelant relationships). 2. Methods to populate the network: knowledge extraction / data transformation. 3. Massive, distributed, semiautomatic detection of co-reference relations (data cleaning ) across contexts and to 4. Curate referential integrity of co-reference in order to create, maintain and improve the consistency of global networks of knowledge as a continuous process (not making yet another database). And only then we can do advanced reasoning and intelligent query processing...

ICS-FORTH March 30, Knowledge Management for DLs A nearly global model: ISO21127 The CIDOC Conceptual Reference Model (ISO/FDIS 21127) is a core ontology describing the underlying semantics of data schemata and structures from all museum disciplines and archives. Now being merged with IFLA FRBR concepts. It is result of long-term interdisciplinary work and agreement. In essence, it is a generic model of recording of what has happened in human scale, i.e. a class of discourse. It can generate huge, meaningful networks of knowledge by a simple abstraction: history as meetings of people, things and information. It bears surprise: more effective metadata structures, and linking schemes can be created from it.

ICS-FORTH March 30, P14 performed P11 participated in P94 has created E31 Document Yalta Agreement E7 Activity Crimea Conference E65 Creation Event * E38 Image P86 falls within P7 took place at P67 is referred to by E52 Time-Span February 1945 P81 ongoing throughout P82 at some time within E39 Actor E53 Place E52 Time-Span Knowledge Management for DLs Example: The ISO21127 Solution

ICS-FORTH March 30, Integration by Factual Relations Ethiopia Johanson's Expedition CIDOC CRM Core Ontology Documents in Digital Libraries Hadar Discovery of Lucy AL Lucy Deductions Linking documents by co-reference Primary link corresponding to one document Donald Johanson Cleveland Museum of Natural History Instance of real world nodes (KOS) Knowledge Management for DLs Hypertext is wrong: Documents contain links!

ICS-FORTH March 30, Content Source 1 Source 2 Query Friends of a Friend 1. query Knowledge Management for DLs Identifier Equivalence input: Martin Read output: find Kostas, guess Κώστας 2. query input: Κώστας output: George

ICS-FORTH March 30, match Authority service local ids Content LinktableLinktable find co- referen ce find co-reference match Source 1 Source 2 local ids id Dyn amic li nk Join Join across sources by transitivity of co-reference query Knowledge Management for DLs Co-reference via Authority input: Martin output: George Κώστας / Kostas

ICS-FORTH March 30, match local ids Content make a co-reference Source 1 Source 2 local ids Dyn amic li nk Join Join across sources by transitivity of co-reference query Knowledge Management for DLs Curating Co-reference without Authority input: Martin output: George local ids make a co-reference

ICS-FORTH March 30, Knowledge Management for DLs Conclusions It is feasible to create effective, sustainable, large-scale networks of knowledge: The CRM and its extensions seems to have the power to integrate historical knowledge in Archives, Libraries and Museums. Even e-Science applications have been tested. The CRM is a model of factual relationships at first. Humanities collect factual knowledge. Sciences collect categorical knowledge. But we oversee the record of experimental data, which justifies this knowledge and is by far larger than the resulting categorical knowledge. Descriptive sciences already produce both categorical and factual knowledge. Thesis: Once there is a global model, we must invest in managing and preserving co- reference. Else no large-scale networks of knowledge will ever emerge. Co-reference clusters can be distributed and are scalable.

ICS-FORTH March 30, Knowledge Management for DLs Conclusions If we rethink old positions, we will find surprising new answers to..an information model for digital libraries that intentionally moves 'beyond search and access, without ignoring these functions, and facilitates the creation of collaborative and contextual knowledge environments. (C.Lagoze, D-Lib Magazine 2005) But: We need a massive investment in understanding and generalizing the intellectual processes and original research questions in interdisciplinary work. We have to do research in dynamic collaborative knowledge organization forms, formal processes and algorithms that converge to higher stages of knowledge integration via co-reference management. The large networks of integrated knowledge to come will need continuous maintenance with new, specific social organisation forms and GRID-like resource access, and they may look very different from our current systems… (This is again a 30 years old challenge, are we closer now?)