C. Lee Giles David Reese Professor, College of Information Sciences and Technology Graduate Professor of Computer Science and Engineering Courtesy Professor.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Knowledge Management and Engineering David Riaño.
“ Leveraging SharePoint 2010 Search Technologies ” With: Ivan Neganov.
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
1 Information Retrieval and Web Search Introduction.
Modern Information Retrieval Chapter 1 Introduction.
CS 345 Data Mining Lecture 1 Introduction to Web Mining.
COMPUTER APPLICATIONS TO BUSINESS ||
SEARCH ENGINES By, CH.KRISHNA MANOJ(Y5CS021), 3/4 B.TECH, VRSEC. 8/7/20151.
Web and Intranet Search: What‘s Next After Google* ? Moderator: Gerhard Weikum (Max-Planck Institute for CS) Panelists: Eric Brill (Microsoft Research)
Sinan Kanatsiz January 15, 2008 OCEN at The Pacific Club
Databases & Data Warehouses Chapter 3 Database Processing.
Web Search Engines and Information Retrieval on the World-Wide Web Torsten Suel CIS Department Overview: introduction.
1 Web Search and Advanced Internet Services 290N Class Introduction Tao Yang, 2014.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
1 Information Retrieval and Advanced Internet Services 290N Class Introduction Tao Yang, 2015
Module 3: Business Information Systems Chapter 8: Electronic and Mobile Commerce.
Electronic CommerceNonhlanhla Shongwe  Introduction  Mission statement  Product  Business model  SWOT Analysis  Conclusion.
IST 441 Example Projects. Undergrad Project Find a customer – interest in xbox game forum Build a search engine for Xbox game forums etc. Compare two.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Xiaoying Sharon Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
SEO : Search Engine Optimization. SEO : How It Works Web is a Network of Links Search Engines use automated robots or crawlers to scour the Web for content.
Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST Kick-off.
Search Engine Marketing The Tools of Online Marketing and Thought Leadership.
CSM06 Information Retrieval Lecture 1a – Introduction Dr Andrew Salway
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
ACIS Introduction to Data Analytics & Business Intelligence Database s Benefits & Components.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval
Information Retrieval Systems Info624 – Week 1 Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
C. Lee Giles David Reese Professor, College of Information Sciences and Technology Graduate Professor of Computer Science and Engineering Courtesy Professor.
1 CS 430: Information Discovery Lecture 18 Web Search Engines: Google.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
93% of online activities begin with search billion+ searches conducted worldwide each month. 5 92% of internet users search. 2.
Search Engine and Optimization 1. Introduction to Web Search Engines 2.
Traffic Source Tell a Friend Send SMS Social Network Group chat Banners Advertisement.
Xiaoying Sharon Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
SEARCH ENGINE by: by: B.Anudeep B.Anudeep Y5CS016 Y5CS016.
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Statistical Learning Methods for Natural Language Processing on the Internet 徐丹云.
Information Retrieval and Web Search
The Bing Search APIs in the Azure Marketplace Enable Primal to Deliver Personalized Content “Primal's patented AI provides a comprehensive understanding.
Global Enterprise Search
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Data Mining Chapter 6 Search Engines
CSE 635 Multimedia Information Retrieval
Digital Marketing Company in Delhi NCR
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Web Mining Department of Computer Science and Engg.
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
Information Retrieval and Web Search
Web Search and Advanced Internet Services
IST 511 Information Management: Information and Technology
Professor C. Lee Giles David Reese Professor – IST; graduate Professor - CSE Adjunct Professor – Princeton, Pennsylvania, Columbia, Pisa, Trento Graduated.
Presentation transcript:

C. Lee Giles David Reese Professor, College of Information Sciences and Technology Graduate Professor of Computer Science and Engineering Courtesy Professor of Supply Chain and Information Systems The Pennsylvania State University, University Park, PA, USA Information Retrieval and Search Engines IST 441 Introduction to course and search engines

Course homepage Everything you need to know about the course – –Or put IST441 into Google Project Exercises Readings Schedule Participation Exam

Professor C. Lee Giles Intelligent and specialty search engines; cyberinfrastructure for science, academia and government; big data –Modular, scalable, robust, automatic science and technology focused cyberinfrastructure and search engine creation and maintenance –Large heterogeneous data and information systems –Specialty science and technology search engines for knowledge discovery & integration CiteSeer x (all scholarly documents – focus on computer science) Chem X Seer (e-chemistry portal) CollabSeer (collaboration search) CSSeer (expert finding) Scalable intelligent tools/agents/methods/algorithms –Information, knowledge and data integration –Information and metadata extraction; entity recognition –Chemical formulae & names, tables, and figures –Unique search, knowledge discovery, information integration, data mining algorithms –Expert and collaboration recommendation –Research evaluation

What will be covered What is information –How much is there? Properties of text –Documents models Information retrieval (IR) systems and methods –Query structures –Evaluation and Relevance –Role of the user –Vector models –Inverted index

What will be covered Search engines as IR systems and how they work –Indexers –Crawlers –Ranking –Evaluation –SEO Internet and Web –Web structure Semantic search Google and link analysis Social networks

Approach Readings and Lectures –Exercises –One exam –Participation Projects –Build 2 specialty search engines for a customer Customer defines the project –Built with Nutch, YouSeer, Lucid Works (based on Solr/Lucene)Solr/Lucene –Who uses LuceneLucene –Build a Google Custom Search Engine »Comparison of these two –Customer receives (reviews) search engine at the end of the semester –Presentation on search engines built –Report on search engine due at end of semester Undergrads vs grads Guest seminars

manyeyes visualization

Search gains on July 2008 Pew Internet StudyPew Internet manyeyes visualization

Web Search Engine Use and Commerce Continues to Grow Pew Internet & American Life Internet Project Survey: Sept, Search Engine News: Search engine advertising revenues exceed TV networks Walmart and other retailers express concern over Google FOG replaces FOM

Web Search Engine Use and Commerce Continues to Grow Pew Internet & American Life Internet Project Survey: August, 2008 PewInternet

Web search engine use has new activities Pew Internet & American Life Internet Project Survey: 2009 PewInternet

Web Search Engine Use and Commerce Continues to Grow

Search Engine Market Share

Search Engine Market Share - US seoconsultants

2009 Search Engine Market Share - US seoconsultants

Search Engine Market Share - US

Search Engine Market Share

Dec billion internet users

Marketshare Search engine market share seems to be debatable ComScoreComScore global share

ComScoreComScore global share Number of search engine queries - US About 500M per day

2012

ComScoreComScore global share

ComScoreComScore global share Number of search engine queries - US About billion per day

Students who took this course Google Yahoo Microsoft Facebook RIT IBM Tencent Klout eBay Raytheon Lockheed Martin …