RCDL 2007, Pereslavl-Zalessky, 15 - 18 Oct 2007 Converting Desktop into a Personal Activity Dataset Sergey Chernov, Enrico Minack, and Pavel Serdyukov.

Slides:



Advertisements
Similar presentations
Semantically Enhanced Desktop Search Using Directory-Based Clustering and Wordnet Knowledge Ştefania GHIŢĂ
Advertisements

1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Reference Management Software Tools Mendeley. Table of Contents: Part A Background/Location Signup/Login Import References Organize (Manage) References.
Query Chains: Learning to Rank from Implicit Feedback Paper Authors: Filip Radlinski Thorsten Joachims Presented By: Steven Carr.
AVG Internet Security 7.5 Product presentation.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
George Lee User Context-based Service Control Group
Elsweiler, D. and Ruthven, I. and Jones, C. Dealing with fragmented recollection of context in information management. In: Context- Based Information Retrieval.
Information Retrieval in Practice
Search Engines and Information Retrieval
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Web queries classification Nguyen Viet Bang WING group meeting June 9 th 2006.
V0.01 © 2009 Research In Motion Limited Introduction to Java Application Development for the BlackBerry Smartphone Trainer name Date.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Search Engines
Web Information Retrieval Projects Ida Mele. Rules Students can work in teams (max 3 people) The project must be delivered by the deadline that will be.
New “Collaborate” Button Integrate UI directly into the browser. Preferred target: Firefox Easiest browser to extend in terms of UI.
1 What do People Recall about their Documents? Implications for Desktop Search Tools Tristan Blanc-Brude and Dominique L. Scapin INRIA ACM IUI 2007 (22%)
Introduction to EndNote Web Margaret Forrest Academic Liaison Librarian.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
9/19/2015Michał Kopycki1 Exploiting Implicit Feedback to Identify Usage Patterns on the Desktop Bachelor Thesis Leibniz University of Hanover Michał Kopycki.
APPLYING EPSILON-DIFFERENTIAL PRIVATE QUERY LOG RELEASING SCHEME TO DOCUMENT RETRIEVAL Sicong Zhang, Hui Yang, Lisa Singh Georgetown University August.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Tag Data and Personalized Information Retrieval 1.
 Search Engine Search Engine  Steps to Search for webpages pertaining to a specific information Steps to Search for webpages pertaining to a specific.
Microsoft ® Outlook ® 2010 Outlook How-Tos. Course Contents Learn how to perform several daily Outlook tasks using Calendars, Clipart, and features.
New Features v1.4. CPO New Features  Send File Link Improvements  Send File Link via mobile phone  “All Plus Share” permission  Desktop Sync (DS)
25/10/20151Gianluca Demartini Desktop Search Evaluation Sergey Chernov and Gianluca Demartini TREC 2006, 16th November 2006 Pre-Track Workshop.
D IRECTIONS FOR R AISING P RIVACY A WARENESS IN SNS P LATFORMS Konstantina Vemou, Maria Karyda, Spyros Kokolakis 18th Panhellenic Conference on Informatics.
--He Xiangnan PhD student Importance Estimation of User-generated Data.
Contextual Ranking of Keywords Using Click Data ICDE`09 Utku Irmak Vadim von Brzeski Vadim von Brzeski Reiner Kraft.
Using automation to enhance the process of Digital Forensic analysis Daniel Walton School of Computer and Information Science
ESWC 2009 Research IX: Evaluation and Benchmarking Benchmarking Fulltext Search Performance of RDF Stores Enrico Minack, Wolf Siberski, Wolfgang Nejdl.
FACEBOOK Material adapted from
MS 365 Student Accounts Orientation Beavercreek High School.
CS798: Information Retrieval Charlie Clarke Information retrieval is concerned with representing, searching, and manipulating.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Office automation Office automation has changed the equipments and work habits of today’s end users and work groups None will be interested to work in.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
23. Juli deskWeb2.0: Combining Desktop and Social Search Sergej Zerr, Elena Demidova, Sergej Chernov L3S Research Center Hannover, Germany
CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.
Chapter 2 – Introduction to Windows Operating System II Manipulating Windows GUI 1CMPF112 Computing Skills for Engineers.
Information Retrieval in Practice
WP5: Semantic Multimedia
Search Engine Architecture
CARA 3.10 Major New Features
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
Bell Work: Go to Chrome Click the ‘Home’ button (the little house)
SIS: A system for Personal Information Retrieval and Re-Use
Microsoft Windows 7 Basics
Introduction to EBSCOhost
Data Mining Chapter 6 Search Engines
April 06 Lab The ICG Toolkit
Web Mining Department of Computer Science and Engg.
Information Technology Services
Task Detection for Activity-Based Desktop Search
Reference Management Software Tools Mendeley (Part A)
Web Mining Research: A Survey
INSTRUCTOR NOTES/LINKS
Presentation transcript:

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Converting Desktop into a Personal Activity Dataset Sergey Chernov, Enrico Minack, and Pavel Serdyukov L3S Research Center, Hannover, Germany University of Twente, The Netherlands

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Motivation - Over documents on a single PC, including s, text files and photos - Increasing need in PIM evaluation - Desktop usage data analysis

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Potential Applications for Activity Data Implicit links between documents can be applied in following: Ranking opportunities – link-based ranking (PageRank, HITS, etc.) Recommendation features – suggestions of related documents Interface improvement – virtual folders, navigational dialogs, task detection User profiling – popular documents are used for profile creation

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Logger

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Logger Framework

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Privacy Concerns for Main Logger and Firefox Plugin

RCDL 2007, Pereslavl-Zalessky, Oct 2007 L3S Desktop Data Collection Privacy Guarantees - I will not redistribute the data you provided me to people outside L3S. Anybody from L3S whom I give access to the data will be required to sign this privacy statement. - The data you provided me will be automatically processed. I will not look at it manually (e.g. reading the s from a specific person). During the experiment, if I want to look at one specific data item or a group of files/data items, I will ask permission to the owner of the data to look at it. In this context, if I discover possibly sensitive data items, I will remove them from the collection. - Permissions of all files and directories will be set such that only the l3s-experiments-group and the super-user has access to these files, and that all those will be required to sign this privacy statement.

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Logged Information

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Timeline and Permanent Logged Information

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Generic and resource-specific data collected

RCDL 2007, Pereslavl-Zalessky, Oct data collected

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Types of notifications

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Collected Documents

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Resource distribution over the users

RCDL 2007, Pereslavl-Zalessky, Oct 2007 More statistics of collected data Total desktop items: 48,068 (including pictures and web-pages) Total size: 8.1GB User queries: 88 (e.g. “information retrieval author:smith”) Relevance judgments for top-5 results of 3 ranking algorithms

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Conclusions

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Conclusions -Methodology for desktop data collection was proposed -Logging application is implemented -First dataset has been created

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Future Work -Implement automatic tools for data analysis on user side -Test algorithms for activity-data enhanced search and document recommendation -Improve logger to support more applications

RCDL 2007, Pereslavl-Zalessky, Oct 2007 Спасибо!