Information retrieval wed sept 02 2015 data…. -start at 6.45.

Slides:



Advertisements
Similar presentations
R2 Library Features and Functionality Overview. The R2 Library  The R2 Library is an electronic database that enables access to digital book content.
Advertisements

UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Information retrieval mon jan data…. framework for today’s lecture…
Search Engines and Information Retrieval
PrasadL1IntroIR1 Information Retrieval Adapted from Lectures by Berthier Ribeiro-Neto (Brazil), Prabhakar Raghavan (Yahoo and Stanford) and Christopher.
Chapter 3 Database Management
Parametric search and zone weighting Lecture 6. Recap of lecture 4 Query expansion Index construction.
Database Management An Introduction.
Faceted Metadata for Site Navigation and Search Marti Hearst 12/17/2009.
Social Tagging and Search Marti Hearst UC Berkeley.
Information Retrieval in Practice
A metadata-based approach Marti Hearst Associate Professor BT Visit August 18, 2005.
Introduction to Databases CIS 5.2. Where would you find info about yourself stored in a computer? College Physician’s office Library Grocery Store Dentist’s.
Best Practices for Search for the Federal Government Marti Hearst Web Manager University November 10, 2009.
Faceted Metadata for Information Architecture and Search Marti Hearst, SIMS at UC Berkeley Preston Smalley & Corey Chandler, eBay User Experience & Design.
UIs for Faceted Navigation Recent Advances and Remaining Open Problems HCIR’08 Marti Hearst, UC Berkeley (including some slides from Corey Chandler of.
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
“DOK 322 DBMS” Y.T. Database Design Hacettepe University Department of Information Management DOK 322: Database Management Systems.
Using Social Care Online: an overview Version 1.0 April 2015.
An introduction to databases In this module, you will learn: What exactly a database is How a database differs from an internet search engine How to find.
Information retrieval thur jan data…. framework for today’s lecture…
Databases & Data Warehouses Chapter 3 Database Processing.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
RESEARCHING TIPS & STRATEGIES Summer 2008 Melanie Wilson Academic Success Center MSC 207.
Search Engines and Information Retrieval Chapter 1.
Introduction: Databases and Database Users
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Library Home Page Find books & journals Search for articles Find fulltext articles Starting point to research a topic Find Galileo password.
Current Events and Issues Using Index Databases for Finding Answers.
Information Systems & Databases 2.2) Organisation methods.
How can Search Interfaces Enhance the Value of Semantic Annotations (and Vice Versa?) Keynote Talk ESAIR’13: Sixth International Workshop on Exploiting.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Database Management Systems.  Database management system (DBMS)  Store large collections of data  Organize the data  Becomes a data storage system.
Introduction to metadata
Introduction to Information Retrieval Introduction to Information Retrieval CS276 Information Retrieval and Web Search Pandu Nayak and Prabhakar Raghavan.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Comparative Labor History Research Tools & Strategies.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval
IMT530- Organization of Information Resources1 Feedback Lectures –More practical examples –Like guest lecturers –Generally helpful in understanding concepts.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Chapter Three Presentation: User interface How to Build a Digital Library Ian H. Witten and David Bainbridge.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
SmartSearch. SmartSearch is the Library’s new improved Online Catalogue A single site searches all Library resources:  The Library Online Catalogue (ie,
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Definition, purposes/functions, elements of IR systems Lesson 1.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
CS315 Introduction to Information Retrieval Boolean Search 1.
Organization of Information LSIS Summer II (2005)
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Information Retrieval in Practice
NLP Support for Faceted Navigation in Scholarly Collections
Using computers to search electronic databases
What is a Database and Why Use One?
Thanks to Bill Arms, Marti Hearst
Information Retrieval
Networked Information Resources
Introduction to Information Retrieval
Database Design Hacettepe University
Introduction to metadata for IDAH fellows
Information Retrieval and Web Design
Presentation transcript:

information retrieval wed sept data…

-start at 6.45

framework for today’s lecture… data organizing data retrieving data tools supporting the process

Structured Data information with a high degree of organization easy to put into a relational database search is simple and straightforward Unstructured data essentially the opposite of structured data natural language / free text

STRUCTURED vs unstructured data easy to envision structured data in terms of “tables” 5 EmployeeManagerSalary SmithJones ChangSmith IvySmith Typically allows numerical range and exact match (for text) queries, e.g., Salary < AND Manager = Smith.

Relational Databases Structured data Designed to provide search results with exact answers Queries built on schema of structured fields Lack of ranking mechanism (initially) We know the schema in advance, so semantic correlation between queries and data is clear We can get exact answers Information Retrieval Systems

tables in a MS Access relational database – defines each defining a social networking site

Data entry form in a MS Access relational database – create each record

Structured Data information with a high degree of organization easy to put into a relational database search is simple and straightforward Unstructured data essentially the opposite of structured data natural language / free text

typically refers to free text is a good example of unstructured data. it's indexed by date, time, sender, recipient, and subject, but the body of an remains unstructured other examples of unstructured data include books, documents, medical records, and social media posts structured vs UNSTRUCTURED data

magazine article is an example of unstructured data

Relational Databases Information Retrieval Systems Unstructured / semi- structured data Designed to support unstructured natural language full text search Ranking mechanism is very important – results must be sorted by relevance in order to satisfy user’s information need We get inexact, estimated answers

Document collection (corpus) Index Query Representation function Matching function Results CATEGORIES SUBJECT HEADINGS

KWIC Key word in context

KWIC Key word in context

metadata

What is Metadata? Classic definition: data about data Metadata is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource. (NISO) 3 primary “types”: – Descriptive – Structural – Administrative (rights management, preservation)

digital forensics

This reading really made me think about how easily accessible and organized information is today because of the implementation of metadata. It sparked a few questions: Without metadata, how would accessing data, resources and information be different in today’s society? -Chris

b More Metadata: A Cataloging Record

The Idea of Facets Facets are a way of labeling data – A kind of Metadata (data about data) – Can be thought of as properties of items Facets vs. Categories – Items are placed INTO a category system – Multiple facet labels are ASSIGNED TO items

Facets Epicurious example Create INDEPENDENT categories (facets) – Each facet has labels (sometimes arranged in a hierarchy) Assign labels from the facets to every item – Example: recipe collection Course Main Course Cooking Method Stir-fry Cuisine Thai Ingredient Bell Pepper Curry Chicken

The Idea of Facets Break out all the important concepts into their own facets Sometimes the facets are hierarchical – Assign labels to items from any level of the hierarchy Preparation Method Fry Saute Boil Bake Broil Freeze Desserts Cakes Cookies Dairy Ice Cream Sorbet Flan Fruits Cherries Berries Blueberries Strawberries Bananas Pineapple

Using Facets Now there are multiple ways to get to each item Preparation Method Fry Saute Boil Bake Broil Freeze Desserts Cakes Cookies Dairy Ice Cream Sherbet Flan Fruits Cherries Berries Blueberries Strawberries Bananas Pineapple Fruit > Pineapple Dessert > Cake Preparation > Bake Dessert > Dairy > Sherbet Fruit > Berries > Strawberries Preparation > Freeze

labor intensive? expensive?

UNC Libraries Online Catalog e.g. personal crisis

caveat: semi-structured data in fact almost no data is absolutely “unstructured” e.g., this slide has distinctly identified zones such as the title and bullets facilitates “semi-structured” search such as – title contains data and bullets contain structure

Let’s look at a database of magazine & journal articles… …Academic Search Complete >> UNC Libraries Homepage: >> E-Research by Discipline >> Frequently Used >> Academic Search Premier [off-campus log in with onyen/password]

Organization / Search We organize to enable retrieval The more effort we put into organizing information, the more effectively it can be retrieved The more effort we put into retrieving information, the less it needs to be organized first We need to think in terms of investment, allocation of costs and benefits between the organizer and retriever The allocation differs according to the relationship between them; who does the work and who gets the benefit?