1 The Use of Provenance in Information Retrieval Simone Stumpf Erin Fitzhenry Tom Dietterich.

Slides:



Advertisements
Similar presentations
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Advertisements

Business Development Suit Presented by Thomas Mathews.
Microsoft Office Illustrated Fundamentals Unit C: Getting Started with Unit C: Getting Started with Microsoft Office 2010 Microsoft Office 2010.
Sharpdesk Overview Desktop Composer Search Imaging      
Microsoft Office 2010 Office 2010 and Windows 7: Essential Concepts and Skills Mark Worden Instructor Use your spacebar or down arrow key to advance slides.
What is Classrooms? Classrooms is a module of Schoolnet that: Provides teachers with standards-based instructional tools, lessons and best practices Delivers.
Welcome to Keyboarding Pro DELUXE ® Get Started Get Started Create Your Student Record Create Your Student Record The Main Menu The Main Menu Send Files.
Pasewark & Pasewark Microsoft Office XP: Introductory Course 1 INTRODUCTION Lesson 1 – Microsoft Office XP Basics and the Internet.
Using a Template to Create a Resume and Sharing a Finished Document
CGS 1060 Introduction to MicroComputer Usage Chapter 1 Windows 7
XP Browser and Basics1. XP Browser and Basics2 Learn about Web browser software and Web pages The Web is a collection of files that reside.
This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation. All.
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Microsoft Office XP Illustrated Introductory, Enhanced Microsoft Office XP Introducing.
1 of 6 Parts of Your Notebook Below is a graphic overview of the different parts of a OneNote 2007 notebook. Microsoft ® OneNote ® 2007 notebooks are digital.
McGraw-Hill Technology Education© 2004 by the McGraw-Hill Companies, Inc. All Rights Reserved. Introduction to Microsoft Office 2003.
The sequence of folders to a file or folder is called a(n) ________.
Microsoft Office 2003 Illustrated Introductory, Premium Edition Microsoft Office 2003 Introducing.
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
1 of 8 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2006 Microsoft Corporation.
Using Microsoft Outlook: Basics. Objectives Guided Tour of Outlook –Identification –Views Basics –Contacts –Folders –Web Access Q&A.
Before you begin If a yellow security bar appears at the top of the screen in PowerPoint, click Enable Editing. You need PowerPoint 2010 to view this presentation.
PubMed/How to Search, Display, Download & (module 4.1)
Microsoft Office 2010 Office 2010 and Windows 7: Essential Concepts and Skills.
1 What do People Recall about their Documents? Implications for Desktop Search Tools Tristan Blanc-Brude and Dominique L. Scapin INRIA ACM IUI 2007 (22%)
So – You want to learn how to put an advanced article submission (cut and paste) onto the state website. (Note: If you have not done so, you will need.
How to Create a Book Purchase Request using Books in Print?
Pasewark & Pasewark 1 Outlook Lesson 1 Outlook Basics and Microsoft Office 2007: Introductory.
XP New Perspectives on Introducing Microsoft Office XP Tutorial 1 1 Introducing Microsoft Office XP Tutorial 1.
Programming with Microsoft Visual Basic 2012 Chapter 12: Web Applications.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Microsoft Outlook 2007 Basics Distance Learning (860) 343 – 5756 Chapman 633/632 Middlesex Community College Visit
| | Tel: | | Computer Training & Personal Development Outlook Express Complete.
Computer Applications Week 2 Introduction Unit to PowerPoint,Word and Internet Research.
Microcomputer Fundamentals Computer Class This class is designed for first-time computer users. Over the next several weeks, we will discuss how computers.
How To: Add HYPERLINKS and IMAGES with HYPERLINKS to your Outlook Signature. By: Tom Jackson
COMPREHENSIVE Windows Tutorial 4 Working with the Internet and .
Web site Development Utilizing Microsoft FrontPage Alexis M. Schilling.
What you should know and/or be able to do..  Desktop Layout  Mouse Operations  Point  Click  Double-Click  Right-Click  Drag  Right-Drag  Create.
Microsoft Internet Explorer and the Internet Using Microsoft Explorer 5.
PubMed/How to Search, Display, Download & (module 4.1)
Fall 2005 Using FrontPage to Enhance Blackboard - Darek Sady1 Using FrontPage to Enhance Blackboard 1.Introduction 2.Starting FrontPage 3.Creating Documents.
Wimba Presenters Guide North Dakota University System 2009.
Microsoft Excel 2007 © Wiley Publishing All Rights Reserved. The L Line The Express Line to Learning L Line.
VistA Imaging Capture via Scanning. October VistA Imaging Capture via Scanning The information in this documentation includes only new and updated.
Computing Fundamentals Module Lesson 3 — Changing Settings and Customizing the Desktop Computer Literacy BASICS.
Microsoft Outlook Objective The learner will be able to perform basic tasks in Microsoft Outlook 2003.
LEARNING HTML PowerPoint #1 Cyrus Saadat, Webmaster.
VistA Imaging Workstation Configuration. October The information in this documentation includes functionality of the software after the installation.
My Workspace ELearning in Sakai Randy Graff, PhD HSC Training.
Microsoft Office XP Illustrated Introductory, Enhanced with Programs, Files, and Folders Working.
1 EndNote X2 Your Bibliographic Management Tool 29 September 2009 Humanities and Social Sciences Resource Teams.
Knowledge Management Platform Communities of Practice User Guide for CoP users Copyright © 2010 Group Technology Solutions. All Rights Reserved.
Advanced InterAct. A quick way of reading through multiple messages is to “summarize” them. This will make them appear as a list in a single message box.
Copyright 2007, EMC Paradigm Publishing Inc. INTERNET EXPLORER 7 BACKNEXTEND 1-1 LINKS TO OBJECTIVES Launching Internet Explorer Launching Internet Explorer.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTION Lesson 1 – Microsoft Office 2003 Basics and the Internet.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTION Lesson 1 – Microsoft Office 2003 Basics and the Internet.
Lesson 6: Working with Word Basics. 2 Learning Objectives After studying this lesson, you will be able to:  Use and customize the Ribbon  Use the Quick.
Computers Are Your Future Tenth Edition Spotlight 5: Microsoft Office Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
Microcomputer Fundamentals Computer Class This class is designed for first-time computer users. Over the next several weeks, we will discuss how computers.
SIGMA Requestor Training In this presentation we will cover : How to log a Sigma ticket How to update a ticket via the notification function How.
Pasewark & Pasewark 1 Office Lesson 1 Microsoft Office 2007 Basics and the Internet Microsoft Office 2007: Introductory.
Intel Confidential Internal Use Only – Do Not Distribute Cutting and Pasting Up: Understanding Users with Task Trail Eleanor Wynn Principal Engineer Intel.
CMF For Content Authors. Slide 1©2001 Zope Corporation. All Rights Reserved. Outline Understand CMF approach to content Demonstrate content author goals.
Exploring ProFile cont’d.
Microsoft Office 2003 Illustrated Introductory, Premium Edition
Office 2010 and Windows 7: Essential Concepts and Skills
Managing Your Literature Search Using Zotero
William Jones, Harry Bruce
Microsoft Office Illustrated Fundamentals
Presentation transcript:

1 The Use of Provenance in Information Retrieval Simone Stumpf Erin Fitzhenry Tom Dietterich

2 Defining Provenance To us, provenance concerns: The origin of content within documents The relationships between documents AttachmentSaveSaveAs

3 Why focus on Provenance for Information Retrieval? People remember the relationships between documents! Episodic vs. Semantic Memory Studies: Blanc-Brude & Scapin (2007) Gonçalves & Jorge (2004) No need to formulate keyword queries Other common document attributes are often inaccurately remembered (Blanc-Brude & Scapin 2007): Title (20% false recall) Size (53.8% false recall) Time (47.6% false recall)

4 Example Use Case: “Where did I save that again?” I got an from Tom… I saved the attachment… And I pasted some information from the attachment into a PowerPoint document… Where did that presentation go??

5 Requirements for Tracking and Visualizing Provenance Instrument all important document provenance events Provenance events are NOT automatically captured by Windows Develop a UI enabling users to locate documents via the provenance relationships they remember Integrate the UI into the Windows Desktop

6 Capturing Provenance Events with TaskTracer TaskTracer is a Personal Information Management system User defines a hierarchy of Projects or Activities As the user works, TaskTracer automatically tags (according to task/project): Files Folders Messages Contacts Web pages

7 Instrumenting TT to Capture Provenance Events TaskTracer already instruments many desktop events: Open, Save, SaveAs, Close Arrived, Open, Close Open URL, Close URL, Follow Hyperlink Idea: Extend existing instrumentation to cover key provenance events CopyPaste, SaveAs, FileCopy/Rename AttachmentAdd, AttachmentOpen, AttachmentSave, Forward*, Reply* FileDownload, FileUpload* *Coming soon

8 Instrumenting TaskTracer to capture Provenance Events (cont.) A Provenance Event “From” Resource“To” Resource Event_idEvent_typeEvent_time SaveAs10233Jan 12 oldFile.doc Id: 1768 etc. newFile.doc Id: 1923 etc. Database of document-to-document provenance relationships

9 A tool for visualizing provenance Developing a User Interface: TaskTrail User’s Query Click to Expand Mouse over details Double-click to open

10 Integrating TaskTrail into the Windows UI Launch a query by right clicking on an item within Windows Explorer, Outlook, TaskExplorer

11

12

13

14 Research Questions Does TaskTrail help users find documents more quickly than other methods? How should the provenance graph be laid out? What kind of provenance events do users accurately recall? How large are the provenance graphs? What patterns exist (if any) in terms of the succession of provenance events?

15 User Studies: Formative Observational Study (planned) What provenance-related actions do users perform? Which of those do they remember? Observe 12 participants in their workplaces Record provenance-related actions performed Interview participants after 1 week to see what they remember Free Recall Cued Recall How do users layout their documents according to what they remember?

16 User Studies: Summative TaskTrail Study at Intel (in progress) 4 participants (so far) are using TaskTracer for at least 1 month each Then they will use TaskTrail to locate their own documents Measures of success: Do users locate more documents using TaskTrail? Do users locate documents more quickly using TaskTrail? Do users prefer using TaskTrail?

17 Provenance-related User Studies are Hard! Must be done “in the wild” Involves: Long time-scales, which increase chances that: Participants will drop out Situation on site will change Potentially sensitive information s to/from users not participating in the study Documents regarding trade secrets Installation of some event-tracking software Software installation/maintenance can introduce compatibility, scheduling and other problems

18 Summary: TaskTrail TaskTrail Instruments desktop provenance relationships Allows user to query by right-clicking objects User can browse visualization of provenance relationships to find desired documents Exploits human episodic memory to help users find documents