Empowering users to access information in the Digital Library Corin Anderson University of Washington.

Slides:



Advertisements
Similar presentations
Web Usage Mining Web Usage Mining (Clickstream Analysis) Mark Levene (Follow the links to learn more!)
Advertisements

Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
The Semantic Web-Week 22 Information Extraction and Integration (continued) Module Website: Practical this week:
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Chapter 12: Web Usage Mining - An introduction
Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.
Top 10 Blackboard Tips for Students. 1. Search Blackboard Blackboard has a search tool, which allows you to search through all the course web sites and.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Web Projections Learning from Contextual Subgraphs of the Web Jure Leskovec, CMU Susan Dumais, MSR Eric Horvitz, MSR.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
1 of 4 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Information Retrieval
Algorithms for Data Mining and Querying with Graphs Investigators: Padhraic Smyth, Sharad Mehrotra University of California, Irvine Students: Joshua O’
Overview of Web Data Mining and Applications Part I
Hierarchical Summaries By: Dawn J. Lawrie University of Massachusetts, Amherst for Search.
Automating Practice Partner with Macro Express Donald T. Stewart, MD 2007User Group Meeting.
Company/Product Overview. You have lots of files all over the place.
1 Web Developer Foundations: Using XHTML Chapter 11 Web Page Promotion Concepts.
1 Web Developer & Design Foundations with XHTML Chapter 13 Key Concepts.
Dreamweaver – Setting up a Site and Page Layouts Web Design Section 7-2 Part or all of this lesson was adapted from the University of Washington’s “Web.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
INFO 344 Web Tools And Development CK Wang University of Washington Spring 2014.
On the Internet, an intelligent agent (or simply an agent ) is a program that gathers information or performs some other service (typically search queries.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Instructors begin using McGraw-Hill’s Homework Manager by creating a unique class Web site in the system. The Class Homepage becomes the entry point for.
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
The Teacher Is In Charge There are dozens of free services, but Gaggle.Net is the only service designed specifically for classroom use. The biggest.
The Internet 8th Edition Tutorial 4 Searching the Web.
Personalized Search Xiao Liu
Introducing HingX now with Capacity Development Network.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
1 International Christian Retail Show 2007 Atlanta, GA July 2007 Custom Reports and Returns.
Chapter 9: Structured Data Extraction Supervised and unsupervised wrapper generation.
Order the featured book of the day Estimated effort: 2.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Newberry Middle School April 12, A system that allows teachers to assess students in an electronic and interactive format. 
1 SEARCHING FOR TRUTH Locating Information on the WWW chapter 5.
Adaptive Web Navigation for Wireless Devices Corin Anderson Pedro Domingos Dan Weld.
Optimizing today's websites using tomorrow's technologies.
Information Retrieval and Web Search Link analysis Instructor: Rada Mihalcea (Note: This slide set was adapted from an IR course taught by Prof. Chris.
Facebook for Beginners One Session Class. What will you learn today? What can you do on Facebook? Creating a profile Privacy Connecting with friends Sending.
University of Malta CSA4080: Topic 7 © Chris Staff 1 of 15 CSA4080: Adaptive Hypertext Systems II Dr. Christopher Staff Department.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
What is ? ●It is a Halton District School Board term ●It is HDSB use of Google Apps for education. It includes: ■ Google Docs ■ Google Drive ■ Google Applications.
Personalizing the Web for Mobile Users Corin Anderson Pedro Domingos Dan Weld.
Wikis. What are Wikis? Could this be a Wiki? MoT0Ehttp:// MoT0E.
Dreamweaver – Setting up a Site and Page Layouts Web Design Section 7-2 Part or all of this lesson was adapted from the University of Washington’s “Web.
Location-based Social Networks 6/11/20161 CENG 770.
CS 540 Database Management Systems Web Data Management some slides are due to Kevin Chang 1.
Mathematics of the Web Prof. Sara Billey University of Washington.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
Data mining in web applications
B OOST W EBSITE P ERFORMANCE WITH T HE C USTOM W ORDPRESS P LUG -I N D EVELOPMENT
Getting High on Search Engines with WebPositionGold
Naming and Directories
Naming and Directories
Naming and Directories
Information Retrieval
PageRank algorithm based on Eigenvectors
Data Mining Chapter 6 Search Engines
CSA3212: User Adaptive Systems
Information Retrieval and Web Design
Naming and Directories
Presentation transcript:

Empowering users to access information in the Digital Library Corin Anderson University of Washington

2 Empowering users DLs provide information to users Tricky: Not all users will be programmers –Non-programmer Web surfer –5 th grade student –Your grandmother How to cater to non-programming masses?

3 The Digital Library Many specialized DLs exist today –Medicine, literature, etc. Eventually, all DLs will be integrated: The DL Until then, use the Web as approximation

4 DL research at the UW Improving web search using popularity Automatic question answering Extracting information by demonstration Adaptive web sites

5 Popularity-based web search Want authoritative pages vs. Approximate authority by freq. of web visits –Data gathered from NCSA web proxies Rank query results based on popularity 1,000 hits 3 hits

6 Automatic question answering Return answers, not web pages, to queries “What’s the tallest mountain in the world?” vs. “Mount Everest” Search web for pages that contain answers –“the tallest mountain is” Heuristics for yes/no, “which is” questions

7 Information extraction Info from DL usually used elsewhere –Query stock history to build a graph in Excel – a list of current movies to a friend Extracting info is tricky –Special file formats (XML,.csv) arcane –Custom built wrappers to select tuples –Building wrappers isn’t easy, either! Solution: demonstrate a wrapper

8 ICE-9 Wrapper generation User demonstrates extracting info ICE-9 learns generalized program Demonstrate on very few instances

9 ICE-9 in action User demonstrated two instances ICE-9 steps through program correctly

10 ICE-9 – current work Collaborative demonstration ICE-9 predicts each step, asks for confirmation –If can’t predict with confidence, just ask user Active learning –ICE-9 suggests which example the user should demonstrate

11 Adaptive web sites Different users have different goals –But traditional web sites treat everyone the same –Everyone sees the same start page, query page, etc. Personalized sites can be customized –Customization is manual, tedious Want a site to learn users’ interest –Based on observed behavior, similarity to others –Adapt to individuals accordingly

12 Adaptations – structural Add link, remove link Add page (index page synthesis)

13 Adaptations – presentational Highlight link, content

14 From users to adaptations Users are clustered to find related visitors Models are fit to clusters to predict behavior Adaptation space is searched for best changes

15 AWS – current work Building, clustering user models Hierarchical user clustering –Users are leaf nodes, related groups interior –Influence of parent nodes decreases with distance Selecting adaptations from models –Choosing structural changes –Defining, selecting presentational changes

16 Summary Successful DLs cater to their users UW research concentrating on connecting users with information Look for us at IUI, KDD, ICML, AAAI, IJCAI, and elsewhere

17 ICE-9 in action ICE-9 learns from subsequent instances –Probabilities now 100%

18 ICE-9 – Version space algebra

19 Selecting adaptations Cluster models analyzed to determine interests “The user has an interest in the page” “The user visits the page by starting at the page – add a link between the two.”

20 User’s computerDate and time of visitRequested pageReferring page some-pc.cs22/Feb/ :49:13/- some-pc.cs22/Feb/ :49:23/edu/ some-pc.cs22/Feb/ :49:34/edu/courses/ some-other-pc.cs22/Feb/ :49:55/- some-pc.cs22/Feb/ :50:08/574http:// some-other-pc.cs22/Feb/ :50:20/info/current/