LRI Université Paris-Sud ORSAY Nicolas Spyratos Philippe Rigaux.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Collection Service. 19 February 2001CYCLADES Kick-off meeting Collection A set of documents A set of services on the documents A set of polices that regulate.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
XML DOCUMENTS AND DATABASES
Thomas D. Wason IMS Project Feb IMS Meta-Data Thomas D. Wason, Ph.D. IMS Project & GEM
A Virtual Organisation for e-Learning Nicola Capuano, Pierre Carrolaggi, Jerome Combaz, Fabio Crestani, Matteo Gaeta, Erich Herber, Enver Sangineto, Krassen.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
Learn how to search for information the smart way Choose your own adventure!
TAXONOMY-BASED ANNOTATION OF XML DOCUMENTS Application to e-Learning Resources Nicolas Spyratos University of Paris-South France Joint work with B. Gueye.
1 1 File Systems and Databases Chapter 1 The Worlds of Database Systems Prof. Sin-Min Lee Dept. of Computer Science.
Data Warehouse success depends on metadata
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education, Inc. publishing as Prentice Hall 4-1.
Distributed Collaborations Using Network Mobile Agents Anand Tripathi, Tanvir Ahmed, Vineet Kakani and Shremattie Jaman Department of computer science.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Software Documentation Written By: Ian Sommerville Presentation By: Stephen Lopez-Couto.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
New trends in Semantic Web Cagliari, December, 2nd, 2004 Using Standards in e-Learning Claude Moulin UMR CNRS 6599 Heudiasyc University of Compiègne (France)
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
IS432: Semi-Structured Data Dr. Azeddine Chikh. 1. Semi Structured Data Object Exchange Model.
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
Module Title? DBMS Introduction to Database Management System.
U.S. Department of the Interior U.S. Geological Survey NWIS, STORET, and XML National Water Quality Monitoring Council August 20, 2003.
The Global Learning Resource Connection Supporting the Next Generation of Education The Achievement Standards Network (ASN) A JES & Co. Program Diny Golder.
Software Engineering 2003 Jyrki Nummenmaa 1 CASE Tools CASE = Computer-Aided Software Engineering A set of tools to (optimally) assist in each.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Introduction: Databases and Database Users
Digital environment for e-learning –J. Herget 1 Digital environment for e-learning – A concept for excellence in knowledge transfer Herget, Josef, Prof.
Dynamic Hypermedia Generations through a Mediator using CRM and Web Service Jen-Shin Hong National ChiNan University,Taiwan
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ISP 433/533 Week 11 XML Retrieval. Structured Information Traditional IR –Unit of information: terms and documents –No structure Need more granularity.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
Search Engine Architecture
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu & Gagan Agrawal Enabling.
1-1 Chapter 1 Databases and Database Users 1.1 Introduction 1.2 An Example 1.3 Characteristics of the Database Approach 1.4 Actors on the Scene 1.5 Workers.
Database Concepts Track 3: Managing Information using Database.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
LeGE WS 16 th December 2002 SeLeNe : Self e-Learning Networks Alex Poulovassilis, Birkbeck, Univ. of London One-year Accompanying Measure for IST V.1.9.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Information Retrieval
Jens Hartmann York Sure Raphael Volz Rudi Studer The OntoWeb Portal.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
1 Chapter 2 Database Environment Pearson Education © 2009.
Data Grid Research Group Dept. of Computer Science and Engineering The Ohio State University Columbus, Ohio 43210, USA David Chiu and Gagan Agrawal Enabling.
CSCI 6315 Applied Database Systems Review for Midterm Exam I Xiang Lian The University of Texas Rio Grande Valley Edinburg, TX 78539
Faculty of Education, Language and Community Services Stavroula Tsembas Marketing and Distribution: Metadata Linkages What is metadata? information about.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Wolfram Höpken IFITT RMSIG.
Chapter 8 Research: Gathering and Using Information.
Software Documentation
Chapter 4 Relational Databases
Relational Algebra Chapter 4, Part A
Principles of GIS Fundamental database concepts Shaowen Wang
Ahmet Fatih Mustacoglu
Relational Algebra Chapter 4, Sections 4.1 – 4.2
Introduction to Information Retrieval
Search Engine Architecture
INFO/CSE 100, Spring 2006 Fluency in Information Technology
Information Retrieval and Web Design
Versioning in Adaptive Hypermedia
Presentation transcript:

LRI Université Paris-Sud ORSAY Nicolas Spyratos Philippe Rigaux

Université Paris-Sud One of the largest scientific Universities in France Five campuses Scientific campus located at Orsay (about 25 Kms south of Paris) Students Over ten departments (physics, mathematics, computer science…)

Department of Computer Science 250 members (researchers, teachers) Currently offering 16 programs Two laboratories: LRI (11 research groups) LIMSI (8 research groups) Fundings: Government, CNRS, European projects

SeLeNe related activities Nicolas Databases Conceptual modeling Information integration Philippe Databases (including spatial DB) A strong practical experience in Web environments based on XML Nicolas + Philippe : document integration and restructuring

Motivation In a nutshell: collaborative production of [e- Learning] documents Some preliminary ideas … Authors produce documents A system manages the set of documents Users create new documents by assembling/restructuring existing ones A scenario based on a cooperative, distributed, e-learning system. … and many questions

Preliminary ideas: authors Author = content producer Uses his own structure and vocabulary Stores his documents in his own repository Author = a conscious part of a collaborative system Provides a description of his documents to the system Commits to maintain an up-to-date and available version of each document

Preliminary ideas: the system The system enables cooperation between authors It knows the description provided by each author It can access (and possibly store locally) the documents The system acts as a mediator for users It defines a uniform view for all the documents It provides querying and restructuring services to create new documents

Preliminary ideas: the user The user publishes documents In a specific form (a book, a portal, a set of slides) Using specific choices for the content and the structure The user creates new (derived) documents by Extracting fragments from the documents managed by the system Authoring his own fragments, then integrating them with the extracted ones Materializing at will the result

Keywords Content management How to structure (e-Learning) content and how to describe this structure Content integration How to provide a uniform “view” to query documents and extract fragments Deriving and restructuring document How to create new documents by assembling fragments of existing ones

A simple scenario Three authors, A, B and C, cooperate to produce a course on database systems Author A produces content on data modeling An introduction to the topic Chapters on database design, the relational model and SQL Author B produces content on system aspects Database indexing Query processing and optimization

Contents description Each author uses his own terminology to describe his documents A fragment is any identifiable subset of a document Any fragment must be indexed under some term.

The system We assume a commonly agreed structure for the area of databases Each author must provide a mapping between his terminology and the systems’ terminology The system provides query facilities

Deriving new documents Structure: The user is free to choose the structure of the document he composes Composition: Each fragment is Either directly provided by the user Or chosen from the answer to a query addressed to the system

Query refinement A multi-step process: Initial query shows all the relevant fragments known to the system Subsequent steps restrict the fragments to those considered as relevant to the user Ideally: the refined query delivers exactly the relevant fragments and in the right order

Example (user/teacher) Author C is now a teacher, creating an introduction to DB. It contains A general introduction (written by C) A query retrieving introduction written by A A query selecting fragments on database design (retrieved from A’s documents) An introduction to query processing, with queries retrieving figures from B documents. Questions: assuming a query returns a set of fragments, how can we make a sub- selection

Example (user/learner) Author C is now a learner. He will create a document summarizing the courses he is interested in, namely A query retrieving the general introduction to DB (written by C) His own annotations Several queries, whose results will be mixed with the annotations Questions: how can we make queries “user- friendly”? E.g., as a “path” to the relevant fragment? Relying on “metadata”?

Example: personalized documents Author C is now a learner. The system knows the courses followed by C, maybe with other information (frequency, success, whatever)  relates to “knowledge trajectories”? => the system maintains and updates automatically the document summarizing the course’s material  instance of the “learning trail” concept?

Questions Primitive versus derived documents (problem of cycles)? How can we select a subpart of a result set? Should we allow users to browse directly the sources? What is the granularity of documents? Is there a need for user’s views? Should we introduce replication of content, and how?