Presentation is loading. Please wait.

Presentation is loading. Please wait.

INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID

Similar presentations


Presentation on theme: "INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID"— Presentation transcript:

1 INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Lecture # 45 Final Notes on Information Retrieval

2 ACKNOWLEDGEMENTS The presentation of this lecture has been taken from the following sources “Introduction to information retrieval” by Prabhakar Raghavan, Christopher D. Manning, and Hinrich Schütze “Managing gigabytes” by Ian H. Witten, ‎Alistair Moffat, ‎Timothy C. Bell “Modern information retrieval” by Baeza-Yates Ricardo, ‎  “Web Information Retrieval” by Stefano Ceri, ‎Alessandro Bozzon, ‎Marco Brambilla

3 Outline Topics that we covered Database Management Research

4 Topics that we covered IR Models IR System Implementation
Boolean / Vector Space / Probability IR IR System Implementation Inverted Index (different levels) Naïve Implementation – Scalable realistic imp Optimizations (query processing, index building: compression) Types of Queries and data handling Deep Web Searching (Search computing) Web Based IR : Page Rank, Crawling Classification/ Clustering Recommender Systems 00:09:00  00:09:25

5 Database Management Research
How to improve web search and surfing Personalization (User Preferences, localization) Social Data Leverage community interaction to create refine content Experts, friends, sub-communities of shared interests Collaborative Working Wisdom of the crowd (page rank, reviews, feedbacks…) (Crowd Sourcing) Collaborative editing (Wikipedia); Collaborative Searching (crowd search) Harnessing the collaborative power (Amazon Mechanical Turk) Data Quality Provenance/Lineage/Source; Confidence on the source; Correlation (did we agree with source before) What is the mileage of my Honda Civic (so many sites..) 00:14:35  00:14:50 (Personalization) 00:16:20  00:16:45 (Social Data) 00:17:30  00:17:45 (Social Data) 00:19:50  00:20:20 (Collaborative) 00:22:15  00:22:30 (Collaborative) 00:25:34  00:25:55 (Collaborative) 00:26:40  00:26:55 (Data Quality)

6 Database Management Research
Data Extraction Question answering relevant information Knowledge vs Reasoning Engines (google vs wolfram alpha) Domain Specific knowledge Cross Lingual Document Summarization Plagiarism Detection Large-scale Data Management Scalability Big Data Management; 00:28:45  00:29:00 (Data Extraction) 00:35:05  00:35:20 (Cross lingual) 00:38:05  00:38:15 (Plagiarism) 00:43:10  00:43:30 (Large scale)


Download ppt "INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID"

Similar presentations


Ads by Google