Information Retrieval and Web Search Course overview Instructor: Rada Mihalcea.

Slides:



Advertisements
Similar presentations
Modern Information Retrieval Chapter 1: Introduction
Advertisements

Pemrosesan Teks Pendahuluan. Buku referensi [1]Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schutze Introduction to Information.
Web Search and Mining Course Overview 1 Wu-Jun Li Department of Computer Science and Engineering Shanghai Jiao Tong University Lecture 0: Course Overview.
An Introduction to Information Retrieval and Applications J. H. Wang Feb. 19, 2008.
Overview of Library Resources Chemistry
Web Search – Summer Term 2006 I. General Introduction (c) Wolfgang Hürst, Albert-Ludwigs-University.
Multimedia Systems Course Overview & Introduction Instructor: Leila Sharifi UUT Fall
Search Engines and Information Retrieval
ISP 433/533 Week 2 IR Models.
Modern Information Retrieval Chapter 1: Introduction
SLIDE 1IS 202 – FALL 2004 Lecture 13: Midterm Review Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am -
CS 331 / CMPE 334 – Intro to AI CS 531 / CMPE AI Course Outline.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
Introduction to Programming Environments for Secondary Education CS 1140 Dr. Ben Schafer Department of Computer Science.
An introduction to databases In this module, you will learn: What exactly a database is How a database differs from an internet search engine How to find.
1 Web Search and Advanced Internet Services 290N Class Introduction Tao Yang, 2014.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Syllabus CS 765: Introduction to Database Management Systems Fall 2008 Text Database Management Systems Ramakrishnan/Gehrke, 3rd.
CS223 Algorithms D-Term 2013 Instructor: Mohamed Eltabakh WPI, CS Introduction Slide 1.
1 Information Retrieval and Advanced Internet Services 290N Class Introduction Tao Yang, 2015
COMP Introduction to Programming Yi Hong May 13, 2015.
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Information Retrieval CENG 555 Spring Course Web Page Authoritative source of administrivia In-class announcements generally reflected on Web.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Course Overview for Web Computing J. H. Wang Sep. 19, 2011.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
ICS 6B Boolean Logic and Algebra Fall 2015
Overviews of ITCS 6161/8161: Advanced Topics on Database Systems Dr. Jianping Fan Department of Computer Science UNC-Charlotte
B. Prabhakaran1 Multimedia Systems Textbook Any/Most Multimedia Related Books Reference Papers: Appropriate reference papers discussed in class from time.
ICS 6B Boolean Algebra and Logic Winter 2015
Text Based Information Retrieval Text Based Information Retrieval H02C8A H02C8B Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven.
CSM06 Information Retrieval Lecture 1a – Introduction Dr Andrew Salway
Course grading Project: 75% Broken into several incremental deliverables Paper appraisal/evaluation/project tool evaluation in earlier May: 25%
What else is there? CMPT 454: Database Systems II. – Transaction Management. – Query Planning. – Optional topics, e.g. data mining, information retrieval,
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Feb. 22, 2012.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Modern Information Retrieval Presented by Miss Prattana Chanpolto Faculty of Information Technology.
Information Retrieval
ITIS 4510/5510 Web Mining Spring Overview Class hour 5:00 – 6:15pm, Tuesday & Thursday, Woodward Hall 135 Office hour 3:00 – 5:00pm, Tuesday, Woodward.
IR. SI 650/EECS 549 Information Retrieval People search the Web daily Search engines –Google –Bing –Baidu –Yandex Information Retrieval is about search.
1 Advanced Database System Design Instructor: Ruoming Jin Fall 2010.
Information Retrieval CIS-462 Dr. Samir Tartir 2013/2014 First Semester.
B. Prabhakaran1 Multimedia Systems Reference Text “Multimedia Database Management Systems” by B. Prabhakaran, Kluwer Academic Publishers. – Kluwer bought.
Definition, purposes/functions, elements of IR systems Lesson 1.
CSE6339 DATA MANAGEMENT AND ANALYSIS FOR COMPUTATIONAL JOURNALISM CSE6339, Spring 2012 Department of Computer Science and Engineering, University of Texas.
Information Retrieval in Practice
ICS 6D Discrete Mathematics for Computer Science Fall 2014
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Information Retrieval (in Practice)
Proposal for Term Project
Course Overview - Database Systems
CSCE 561 Information Retrieval System Models
Thanks to Bill Arms, Marti Hearst
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Information Retrieval Systems
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
Multimedia Systems Reference Text
Internet Basics and Information Literacy
Information Retrieval CIS-462
Information Retrieval and Web Design
Lecture 1a- Introduction
CS276 Information Retrieval and Web Search
ADVANCED TOPICS IN INFORMATION RETRIEVAL AND WEB SEARCH
Presentation transcript:

Information Retrieval and Web Search Course overview Instructor: Rada Mihalcea

What is this course about? Processing Indexing Retrieving … textual data (or audio, video, geo-spatial, …, data) Fits in four lines, but much more complex and interesting than that

Need for Information Retrieval With the advance of WWW - more than 20 Billion documents indexed on Yahoo, Google, Bing Various needs for information: –Search for documents that fall under a given topic –Search for an answer to a question –Search for information in a different language –Search for s –Search for patents –… –Search for images –Search for music –Search for a (candidate) friend

Definition of IR Salton (1989): “Information-retrieval systems process files of records and requests for information, and identify and retrieve from the files certain records in response to the information requests. The retrieval of particular records depends on the similarity between the records and the queries, which in turn is measured by comparing the values of certain attributes to records and information requests.”

Restated… Information Retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers). These days we often think of Web search, but there are also other types of searches, e.g.: –Search your own computer –Search knowledge bases –Search the library catalogue –Search the deep Web (e.g., search for a certain car on a rental agency web page)

Examples of IR systems Conventional (library catalog) Search by keyword, title, author, etc. E.g. : You are probably familiar with mirlyn.lib.umich.edu Text-based (Lexis-Nexis, Google, Bing). Search by keywords. Some may use queries in natural language. Multimedia (YouTube, Flickr, Tineye) Search for/by visual appearance (shapes, colors,… ). Question answering systems (Ask, Start) Search in (restricted) natural language Other: cross language information retrieval, music retrieval

IR systems on the Web Search for Web pages Search for answers to questions Search for tweets Search for images Search using image queries Search for similar images Search for (image) colors Music retrieval Shazam apphttp://

Course information Instructor: Rada Mihalcea –Besyter 3769, GSI: Shibamouli Lahiri –Beyster 1695, Class meets MW, 12:00-1:30pm Office hours –Instructor: W 2:00-3:00pm –GSI: T 11:30-1:30pm, Th 11:30-1:30pm, F 12:30-2:30pm –Any time electronically

Course resources Class webpage: – –check periodically for updates, announcements, etc. Textbook: –Introduction to Information Retrieval Christopher D. Manning, Prabhakar Raghavan, Hinrich Schütze Recommended: – Readings in Information Retrieval K.Sparck Jones and P. Willett – Modern Information Retrieval Ricardo Baeza-Yates and Berthier Ribeiro-Neto Papers: –Several papers will be assigned throughout the semester

Course communication Use the Piazza forum for any technical communication related to the class –Likely to get a faster answer than if you the instructor or GSI individually –We will try to answer any question sent on the forum within 24 hours (but your peers may answer even faster!)

Grading (tentative) Four programming assignments: 35% –Start early! Some may be time consuming –3 days late policy Exam I: 20% Exam II: 20% Project: 25% No final – final is replaced by the project

Programming language All assignments / project will be in Python Makes life much much easier for text processing problems and for Web based applications Information Retrieval involves a lot of text processing, and often involves Web access – Code reusability Code must run on CAEN Do not use libraries that directly solve the assignment/project –If in doubt, ask the instructor/GSI

Tentative schedule Course Overview Introduction to IR models and methods Web crawling Text analysis and text properties Boolean model Vector-based model Probabilistic model; other IR models IR evaluation and IR test collections Relevance feedback, query expansion Web search: link based and content based Query-based and content sensitive link analysis

Tentative schedule Text classification and text clustering Question answering and information extraction Text summarization and keyword extraction Cross Language IR Social media, crowdsourcing Image retrieval Music retrieval Geospatial search Two guest lectures - TBA