Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.

Slides:

Advertisements

Similar presentations

© 2011 Delmar, Cengage Learning Chapter 1 Getting Started with Dreamweaver.

Advertisements

Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?

Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.

Reference Management Software Tools Mendeley. Table of Contents: Part A Background/Location Signup/Login Import References Organize (Manage) References.

Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.

CPSC 203 Introduction to Computers Tutorial 59 & 64 By Jie (Jeff) Gao.

© Prof David J Harper 2004 The Challenge of Finding Information in (Long) e-Theses David J Harper The Robert Gordon University Smart Web Technologies Centre.

An Agent Capable of Learning to Create and Maintain Websites Anthony Tomasic, Ravi Mosur Alex Rudnicky, Raj Reddy, John Zimmerman Carnegie Mellon University.

1 Adaptive Management Portal April

L C SL C S Haystack: Per-User Information Environments David Karger.

Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.

Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.

Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.

ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.

DAVID KARGER. Checkered Past Core Algorithms –graph algorithms, randomization, combinatorial optimization –min-cuts, max-flows, shortest paths, minimum.

Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.

The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.

Overview of Search Engines

Domain Modeling (with Objects). Motivation Programming classes teach – What an object is – How to create objects What is missing – Finding/determining.

1 Internet Search Tools Adapted from Kathy Schrock’s PowerPoint entitled “Successful Web Search Strategies” Kathy Schrock’s complete PowerPoint available.

Website Content, Forms and Dynamic Web Pages. Electronic Portfolios Portfolio: – A collection of work that clearly illustrates effort, progress, knowledge,

Automated Tracking of Online Service Policies J. Trent Adams 1 Kevin Bauer 2 Asa Hardcastle 3 Dirk Grunwald 2 Douglas Sicker 2 1 The Internet Society 2.

Section 13.1 Add a hit counter to a Web page Identify the limitations of hit counters Describe the information gathered by tracking systems Create a guest.

1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.

Lecturer: Ghadah Aldehim

The Perfect Search Engine Is Not Enough Jaime Teevan †, Christine Alvarado †, Mark S. Ackerman ‡ and David R. Karger † † MIT, CSAIL ‡ University of Michigan.

Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.

CPSC 203 Introduction to Computers Lab 21, 22 By Jie Gao.

Search Engines and Information Retrieval Chapter 1.

Lesson 1 -What is a Database? -Fields and Records

Copyright © 2008 Pearson Prentice Hall. All rights reserved. 1 Exploring Microsoft Office Word 2007 Chapter 8 Word and the Internet Robert Grauer, Keith.

CPSC 203 Introduction to Computers Lab 23 By Jie Gao.

User’s guide. Compare features:EndNote WebEndNote Save references++ Organize & edit references++ Storage capacity (number of references)10,000unlimited.

Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.

Introduction to Nutch CSCI 572: Information Retrieval and Search Engines Summer 2010.

Personal Information Management Vitor R. Carvalho : Personalized Information Retrieval Carnegie Mellon University February 8 th 2005.

Haystack: Per-User Information Environments David Karger.

Research skills David Godfrey Farnborough 5 th December 2011.

Personalized Search Xiao Liu

FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.

With Windows 7 Introductory© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 Windows 7 Introductory Chapter 3 Advanced File Management and Advanced.

Evaluating Web Pages Techniques to apply and questions to ask.

Internet for Teaching and Learning. Understanding the Web The Web is A collection of publicly accessible pages (web sites) on the Internet All use the.

Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI

WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.

Knowledge Management Platform Communities of Practice User Guide for CoP users Copyright © 2010 Group Technology Solutions. All Rights Reserved.

Tutorial support.ebsco.com Core Collections Complete.

Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.

Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.

ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.

Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.

Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)

Visualization in Text Information Retrieval Ben Houston Exocortex Technologies Zack Jacobson CAC.

Individualized Knowledge Access David Karger Lynn Andrea Stein.

WIRED Future Quick review of Everything What I do when searching, seeking and retrieving Questions? Projects and Courses in the Fall Course Evaluation.

Evaluating Web Pages Techniques to apply and questions to ask.

Research Skills for Your Essay Where to begin…. Starting the search task for real Finding and selecting the best resources are the key to any project.

Chapter 8: Web Analytics, Web Mining, and Social Analytics

Analyzing Requirements IMT 589 January 21, /21/2006IMT589- Applied and Structural Metadata2 Scoping Decide who your metadata beneficiaries (customers)

Haystack: Per-User Information Environments David Karger.

Conceptual Overview For Understanding the New Paradigm Provided by: Web Services Section.

Third Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.

Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.

Microsoft FrontPage 2003 Illustrated Complete Creating a Web Site.

Information Organization: Overview

UNIT 15 Webpage Creator.

Creating a Successful Web Presence

Haystack: an Adaptive Personalized Information Retrieval System

Information Organization: Overview

Presentation transcript:

Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick

Information Access A key task in Oxygen: help people manage and retrieve information Three overlapping projects: l Haystack:  information storage and retrieval  application clients l Semantic Web: next-generation metadata l Volt: collaborative access

Presentation Overview Motivation l Information access behavior and goals System Design & Architecture l Data Model l Interacting data and UI components Working applications l Base haystack l Frontpage l Volt

Motivation

Problem Scenario I try solving problems using my data: l Information gathered personally l High quality, easy for me to understand l Not limited to publicly available content My organization: l Personal annotations and meta-data l Choose own subject arrangement l Optimize for my kind of searching Adapts to my needs

Then Turn to a Friend Leverage l They organize information for their own use l Let them find things for me too Shared vocabulary l They know me and what I want Personal expertise l They know things not in any library Trust l Their recommendations are good

Last to Library/web Answer usually there l But hard to find l Wish: rearrange to suit my needs l Wish: help from my friends in looking

Lessons Individualized access l Best tools adapt to individual ways of organizing and seeking data Individualized knowledge l People know more than they publish l That knowledge is useful to them and others Collaborative use l Right incentives lead to sharing and joint use

Haystack Individualized access l My data collection, organization l Search tools tuned for me Collaborate to leverage individual knowledge l Access unpublished information in others’ haystacks l Self interest public benefit Lens to personalize access to the world library l Rearrange presentation to suit my personal needs

Example Info on probabilistic models in data mining l My haystack doesn’t know, but “probability” is in lots of I got from Tommi Jaakola l Tommi told his haystack that “Bayesian” refers to “probability models” l Tommi has read several papers on Bayesian methods in data mining l Some are by Daphne Koller l I read/liked other work by Koller l My Haystack queries “Daphne Koller Bayes” on Yahoo l Tommi’s haystack can rank the results for me…

System Design

Gathering Data Haystack archives anything l Web pages browsed, sent and received, address book, documents written And any properties, relationships l Text of object (for text search) l Author, title, color, citations, quotations, annotations, quality, last usage Users freely add types, relationships

Semantic Web Arbitrary objects, connected by named links No fixed schema l User extensible Sharable by any application l A new “file system”? DocD. KargerHaystack title author Outstanding quality says HTML type

Gathering Data Active user input l Interfaces let user add data, note relationships Mining data from prior data l Plug-in services opportunistically extract data Passive observation of user l Plug-ins to other interfaces record user actions Other Users

Data Extraction Services Web Observer Proxy Triple Store Mail Observer Proxy Machine Learning Services Web Viewer Volt Viewer/ Editor Spider

Sample Applications

Because everything uses the Semantic Web constructions, a variety of application clients can share information l Web Browser---data viewer l FrontPage---personalized information filter l Volt---collaboration tool

Haystack via Web Web server interface Basic operations: l Insert objects l View objects l Queries

Haystack via Web

Viewer shows one node and associated arrows Service notices we’ve archived a directory; so archives the objects it contains (and so on…)

Haystack via Web Services detect document type, extract relevant metadata Output can specialize by type of object

Mediation Haystack can be a lens for viewing data from the rest of the world l Stored content shows what user knows/likes l Selectively spider “good” sites l Filter results coming back  Compare to objects user has liked in the past l Can learn over time Example - personalized news service

News Service

Scavenges articles from your favorite news sources l Html parsing/extracting services Over time, learns types of articles that interest you l Prioritizes those for display Content provider no longer controls viewing experience l No more ads

Personalized News Service

Collaborative Access Want to leverage others’ work in organizing information l No need to “publish” expertise l Exposed automatically---without effort l Self interest helps others

Volt Volt is about collaboration between people l The Haystack architecture allows easy collaboration among individuals  semantic web references to Haystack objects l Individuals share parts of their Haystack l Group spaces and shared notebooks

Volt

Collaborators Those I interact with l Frequent mail contact l Frequent visits to their home page Those with shared content l And who have same opinions about content l Collaborative filtering techniques Referrals Expertise search engine

Expertise Beacon

Volt Expertise Beacons Group spaces and shared notebooks l Create individual and group profiles Profiles can be used to find other people l Allows targeted search l “Who else is working on this project?” User controls visibility/privacy

Summary Next generation information access Semantic Web l provides a language and capabilities for meta-data Haystack l teases out individual knowledge, l stores it in a coherent fashion, and l allows a variety of application clients to leverage individual meta-data Volt l turns individual knowledge into a community resource

More Info