CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.

Slides:



Advertisements
Similar presentations
Chapter 5: Introduction to Information Retrieval
Advertisements

Web Search and Mining Course Overview 1 Wu-Jun Li Department of Computer Science and Engineering Shanghai Jiao Tong University Lecture 0: Course Overview.
Web Search – Summer Term 2006 VI. Web Search - Ranking (c) Wolfgang Hürst, Albert-Ludwigs-University.
Web Search - Summer Term 2006 III. Web Search - Introduction (Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
“ The Anatomy of a Large-Scale Hypertextual Web Search Engine ” Presented by Ahmed Khaled Al-Shantout ICS
Architecture of the 1st Google Search Engine SEARCHER URL SERVER CRAWLERS STORE SERVER REPOSITORY INDEXER D UMP L EXICON SORTERS ANCHORS URL RESOLVER (CF.
SLIDE 1IS 202 – FALL 2004 Lecture 13: Midterm Review Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am -
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Information Retrieval - Organization of the course Jian-Yun Nie 聂建云.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin and Lawrence Page Distributed Systems - Presentation 6/3/2002 Nancy Alexopoulou.
Information Retrieval
Internet Research Search Engines & Subject Directories.
INFORMATION RETRIEVAL VECTOR SPACE MODEL IN-DEPTH PART 3 Thomas Tiahrt, MA, PhD CSC492 – Advanced Text Analytics.
1 Web Search and Advanced Internet Services 290N Class Introduction Tao Yang, 2014.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
1 The BT Digital Library A case study in intelligent content management Paul Warren
1 Information Retrieval and Advanced Internet Services 290N Class Introduction Tao Yang, 2015
Searching the Web Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
How to get the most out of the survey task + suggested survey topics for CS512 Presented by Nikita Spirin.
Anatomy of a search engine Design criteria of a search engine Architecture Data structures.
Modern Information Retrieval Computer engineering department Fall 2005.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
CSM06 Information Retrieval Lecture 4: Web IR part 1 Dr Andrew Salway
CS525 DATA MINING COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Overviews of ITCS 6161/8161: Advanced Topics on Database Systems Dr. Jianping Fan Department of Computer Science UNC-Charlotte
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin & Lawrence Page Presented by: Siddharth Sriram & Joseph Xavier Department of Electrical.
Text Based Information Retrieval Text Based Information Retrieval H02C8A H02C8B Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven.
Course grading Project: 75% Broken into several incremental deliverables Paper appraisal/evaluation/project tool evaluation in earlier May: 25%
Search Engine Architecture
The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
Lecture #10 PageRank CS492 Special Topics in Computer Science: Distributed Algorithms and Systems.
Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Feb. 22, 2012.
Search Engine and SEO Presented by Yanni Li. Various Components of Search Engine.
Information Retrieval and Web Search Course overview Instructor: Rada Mihalcea.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Web Search – Summer Term 2006 VII. Web Search - Indexing: Structure Index (c) Wolfgang Hürst, Albert-Ludwigs-University.
IR. SI 650/EECS 549 Information Retrieval People search the Web daily Search engines –Google –Bing –Baidu –Yandex Information Retrieval is about search.
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
Evaluation of Information Retrieval Systems Xiangming Mu.
Information Retrieval CIS-462 Dr. Samir Tartir 2013/2014 First Semester.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
The Anatomy of a Large-Scale Hypertextual Web Search Engine (The creation of Google)
Introduction to Information Retrieval. What is IR? Sit down before fact as a little child, be prepared to give up every conceived notion, follow humbly.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
Term Project Proposal By J. H. Wang Apr. 7, 2017.
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Proposal for Term Project
Search Engine Architecture
中国计算机学会学科前沿讲习班:信息检索 Course Overview
Course Summary (Lecture for CS410 Intro Text Info Systems)
Search Engines & Subject Directories
WIRED Week 2 Syllabus Update Readings Overview.
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Data Mining Chapter 6 Search Engines
Introduction to Information Retrieval
Search Engines & Subject Directories
Search Engines & Subject Directories
Search Engine Architecture
CS246: Web Information Systems -- Introduction
CSCE 4143 Section 001: Data Mining Spring 2019.
Information Retrieval CIS-462
Web Search and Advanced Internet Services
ADVANCED TOPICS IN INFORMATION RETRIEVAL AND WEB SEARCH
Presentation transcript:

CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY

Contact Info Tel : 9576 No Specific office hours. You can drop by anytime you like. or call me to make sure I am at the office.

Course Info Reference Book: Introduction to Information Retrieval, Authors: Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze Publisher: Cambridge University Press

Course Info Grading:  Homework : 10%  Project : 40%  Paper presentation : 20%  Term Paper : 20%  Attendance during paper presentations: 10%

Topics that will be covered Document Retrieval Techniques Information Retrieval on the Web Data Mining for Information Retrieval

Aim of the course Knowledge: To introduce information retrieval techniques Skills: paper reading and presentation research and/or project work

A Rough Schedule October, November: Lectures on various information retrieval techniques Remaining weeks: Paper and research project presentations

What I will do Give the basics on information retrieval Project supervision Give directions and advise on the projects Coordination of the presentations

What I expect you to do Understand the basic concepts of Information Retrieval Choose a specific area and two related papers on the same topic for presentation in class Attendance is required for paper presentations and you will loose 2% of your overall grade for each presentation you missed. Write a term paper on the two papers presented. Do a project and a final report describing what you learned or achieved in the scope of the project.

Sources TREC Conference SIGIR Conference WWW Conference ACM TOIS Journal SIGMOD, VLDB, ICDE Conferences (database perspective) SIGKDD, ICDM Conferences (data mining perspective)

Tools SMART IR (Cornell Univ.) Glimpse from Univ. Arizona Google Altavista Yahoo

Information Retrieval Refers to the retrieval of any type of information such as Structured data (e.g. relational database) Text (We will focus on this) Video Image, sound DNA

Document Retrieval User Query Static Document Collection Ranked Result Document Collection is previously indexed User query is ad hoc Results are ranked wrt their similarity to the user query

Document Routing User profiles are set in advance Incoming documents are directed to relevant users Useful for redirecting corporate s to relevant departments (sales, marketing, support etc)

Performance Metrics for IR Precision Recall Not practical to have good precision and recall Whole Document Space Relevant Documents Retrieved Documents Relevant and Retrieved Documents

First Reading for Tomorrow The Anatomy of a Large-Scale Hypertextual Web Search Engine (WWW Conference 1998) paper by Sergey Brin and Lawrence Page www-db.stanford.edu/~backrub/google.html

Web Information Retrieval Two possible ways: Use the web structure starting from a location like yahoo where things are categorized Use search engines

Web Information Retrieval Challenges Scale:  Hundreds of millions of queries per day  Web grows, continuous crawling is needed  Obstacles due to OS, and disk seek time Google handles large data sets by indexing and compression Search quality is important Completeness of the index is important But ranking is also of utmost importance due to the size of the Web

Web Information Retrieval Ranking (of google) The idea is to give importance to pages that have a lot of back links Similar to the notion of citations in academia A link graph of the web was formed and maintained (518 million links in 1998 for the prototype)

Web Mining (focused) Crawling and Indexing Topic Directories Clustering and Classification Hyperlink Analysis Personalization (profiles, preferences)