Rakesh Agrawal Technical Fellow Search Labs, Microsoft Research – Silicon Valley.

Slides:



Advertisements
Similar presentations
Information Systems in Business
Advertisements

PantherSoft Financials Smart Internal Billing. Agenda  Benefits  Security and User Roles  Definitions  Workflow  Defining/Modifying Items  Creating.
CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004.
Technology Plan EDLD 5362 Casey Smith.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
PRODUCT FOCUS 4/14/14 – 4/25/14 INTRODUCTION Our Product Focus for the next two weeks is Microsoft Office 365. Office 365 is Microsoft’s most successful.
Enabling the Social Web Krishna P. Gummadi Networked Systems Group Max Planck Institute for Software Systems.
1 Search Engines What is the Internet? The Web is only part of the Internet The Internet is a computer network connecting millions of computers.
Search and Data Management Rakesh Agrawal MSR Search Lab.
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
SiS Technical Training Development Track Technical Training(s) Day 1 – Day 2.
Overview of Web Data Mining and Applications Part I
Best Practices Using Enterprise Search Technology Aurelien Dubot Consultant – Media and Entertainment, Fast Search & Transfer (FAST) British Computer Society.
E_learning.
A Survey of Mobile Phone Sensing Michael Ruffing CS 495.
Electronic EDI e-EDI. The EDI has been in use since 1999 using a paper-based system and computerized spreadsheets to collect and manage EDI data. Over.
Plan Introduction What is Cloud Computing?
C Charting our Course Into Learning Through the Arts.
Standards Aligned System April 21, 2011 – In-Service.
Experience the World’s Data with the DataMarket Adam Wilson Senior Program Manager Microsoft Corporation.
Purpose Intended Audience and Presenter Contents Proposed Presentation Length Intended audience is all distributor partners and VARs Content may be customized.
Web 2.0: Concepts and Applications 2 Publishing Online.
The Urge to Merge Kathleen A. Hansen, Professor University of Minnesota School of Journalism and Mass Communication SLA, Toronto, June 8, 2005 Kathleen.
Serving the Underserved: A Technological Perspective Rakesh Agrawal Microsoft Search Labs Mountain View, California.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
A Distance Learning Approach to Enhancing Technology Use in Rural Schools Jim Barber Wallace Hannum University of North Carolina at Chapel Hill.
DBS201: DBA/DBMS Lecture 13.
A National Resource Working in the Public Interest © 2006 The MITRE Corporation. All rights reserved. KM at MITRE Jean Tatalias KM TEM, December 2007.
Web 2.0: Concepts and Applications 2 Publishing Online.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
COMPUTER-ASSISTED LANGUAGE LEARNING (CALL)
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
NHS – Enabling Change Improving processes and adding value 5th February 2015 Ian Quinnell Associate Director for Programme Management and Service Improvement.
1 Annual Meeting 2004 CrossRef Publishers International Linking Association, Inc Charles Hotel, Cambridge, MA November 9 th, 2004.
System Changes and Interventions: Registry as a Clinical Practice Tool Mike Hindmarsh Improving Chronic Illness Care, a national program of the Robert.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
ITGS Databases.
Page 1 Strategic Foresight Initiative Summary Briefing Emergency Management Higher Education Conference June 6, :30 – 11:30 am.
Introduction to the Semantic Web and Linked Data
From the Advanced Search page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Post-Ranking query suggestion by diversifying search Chao Wang.
MICROSOFT ONENOTE ADVANCED MODULE 1 EXPLORE ONENOTE 2010  Navigate in the OneNote program window  Work in the OneNote program window  Explore.
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
Give Students Every Opportunity! IT Academy 2014.
 A content management system ( CMS ) is a system providing a collection of procedures used to manage work flow in a collaborative environment. These.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
1 Update on Teacher Effectiveness July 25, 2011 Dr. Rebecca Garland Chief Academic Officer.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
MCSA Windows Server 2012 Pass Upgrading Your Skills to MCSA Windows Server 2012 Exam By The Help Of Exams4Sure Get Complete File From
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Health Advocate Overview
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Connected Maintenance Solution
Connected Maintenance Solution
Also available in Curriki.org as a training module
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Also available in Curriki.org as a training module
Becoming a High-Quality Teacher in a Changing World
KNOWLEDGE MANAGEMENT (KM) Session # 37
Chaitali Gupta, Madhusudhan Govindaraju
Knowledge Management Strategies to Improve Business Performance
Presentation transcript:

Rakesh Agrawal Technical Fellow Search Labs, Microsoft Research – Silicon Valley

Current state of affair Evolving search Search labs projects

Current state of affair Evolving Search Search Labs projects

 Navigational Queries  Pseudo- Navigational Queries

Car GPS around $300 Four day trip to Bhutan from Delhi to visit important Buddhist places

Game Console s Party Site

Search queries are not grammatically correct questions, but they are not bags of words either Query terms are often more than strings of characters Data often has structure or structure can be derived Search can span multiple sessions over several days Search often provides entry point for browsing and search and browsing are inter-mixed Expectations from search are increasing

Current state of affair Evolving search Search labs projects

Health Education Humanity’s greatest advances are not in its discoveries – but in how those discoveries are applied to reduce inequity. Bill Gates Harvard Commencement. June 7, 2007

“Is it right? Is it just? Is it in the interest of mankind?” Woodrow Wilson. May 30, Applications to benefit individuals and society

New Challenge: chronic conditions- illnesses and impairments expected to last a year or more, limit what one can do and may require ongoing care In 2005, 133 million Americans lived with a chronic condition (up from 118 million in 1995) Deaths due to infectious diseases

Tremendous simplification in the technologies for effortlessly capturing useful personal information Dramatic reduction in the cost and form factor for personal storage Cloud Computing

Charts for appropriate demographics? Optimum level for Asian Indians: 150 mg/dL (much lower than 200 mg/dL for Westerners) Due to elevated levels of lipoprotein(a)* Distributed computation and selection across millions of nodes Privacy and security *Enas et al. Coronary Artery Disease In Asian Indians. Internet J. Cardiology

Significant achievements, but problems remain …

Poor performance 39% dropouts in primary, additional 15.6% in secondary, additional 11.7% in higher secondary Pass out ratio is 50% at Class X and majority of them pass in 3rd division Less than 8% finish all schooling to qualify for a college education Poorly trained teachers 51% of primary teachers are higher secondary or below Only 44% have received in-service training Absence of learning material for teachers to update their knowledge Poor teacher-student ratios Ratio in primary is 1:43, secondary and Higher secondary is 1:34. About 9% of primary schools have a teacher-student ratio > 1: % of primary schools have no teachers, 19% have only one teacher for all classes Poor quality of material Poor quality of textbooks, out-dated curriculum Source: IBM Report on Improving India’s Education System through Information Technology, 2005

1. Define goal. 2. Find the highest- leverage approach. 3. Discover the ideal technology for that approach. 4. In the meantime, make the smartest application of the technology on-hand. 1. Quality education to all. 2. New pedagogy. 3. Individualized learning with teacher as a discussant. 4. Internet-based mass collaboration to help teachers teach better and improve the educational infrastructure. Framework Application to Education Attacking Complex Problems* * Bill Gates. Harvard Commencement. June 7, 2007

Participation of experts, teachers, parents and students in the development and revisions of curricula Sharing and collaborative development of lectures, assignments, tests, etc. Tools for capturing feedback on textbooks (errors, better explanations, supplementary readings) Collaborative translation and localization of educational material

Current state of affair Evolving search Search labs projects

23 web mining information retrieval machine learning data management algorithms privacy inconsistent data parallel mining ranking link analysis query processing computational economics game theory Search Labs NLP Invent next in Internet search and applications

Best car GPS around $300 Best car GPS around $300 Best car GPS around $300 Category = “Auto GPS” Price = approx(300) Order By ReviewRank Category = “Auto GPS” Price = approx(300) Order By ReviewRank Structure and Semantics in Data and Queries Insights on user behavior from massive data mining From ranking to decision making Task-orientation

Customer Data Data Services Search ShoeQueen ShoeQueen submit ShoeQueen Data Query/Click Logger Query, Click, App-ID Advertising clicks query Symphony Runtime Rev Sharing Component “Trail running shoes” Query Items 1. Collect initial results 2. Add additional data 3. Generate HTML 1. Collect initial results 2. Add additional data 3. Generate HTML ShoeQueen Config +  Semi-structured Query Processing Semi-structured Query Processing forum/review pages imagesads Advertising Web Image Advertising … 3 rd Party Proprietary Video News Symphony Enable non-developers to create and monetize custom search applications that combine their data and knowledge with Search services.

Tools for creating and updating content (Wikipedia++) Trust and authoritativeness of content Personalization of search to find the material suitable for one’s own style of teaching Bootstrapping and incentives

27 Search is becoming an essential “utility” Need to develop new foundations and abstractions to take search to next level Academia can (and must) play a leading role

28 Search Labs’ mission is to invent next in Internet search and applications