1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component.

Slides:



Advertisements
Similar presentations
Dissemination-based Data Delivery Using Broadcast Disks.
Advertisements

Content Interaction and Formatting, Tayeb LEMLOUMA & Nabil Layaïda. November Tayeb Lemlouma & Nabil Layaïda Presented by Sébastien Laborie November.
Panasonic Singapore Labs – Network Team QoS and Delivery Context in Rule-Based Edge Services Prepared for IWCW2002 By Ng Chan Wah
Differentiated Multimedia Web Services Using Quality Aware Transcoding Surendar Chandra, Carla Schlatter Ellis and Amin Vahdat Department of Computer Science,
Connecting Knowledge Silos using Federated Text Mining Guy Singh Senior Manager, Product & Strategic Alliances ©2014 Linguamatics Ltd.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Team 7 / May 24, 2006 Web Based Automation & Security Client Capstone Design Advisor Prof. David Bourner Team Members Lloyd Emokpae (team Lead) Vikash.
ReTemp Design Review 09/09/04 By Kenny Chung Amish Rughoonundon Amish Rughoonundon.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Adaptive Push-Pull: Disseminating Dynamic Web Data Pavan Deolasee, Amol Katkar, Krithi,Ramamritham Indian Institute of Technology Bombay Dept. of CS University.
Web Usage Mining: Processes and Applications
Clementine Server Clementine Server A data mining software for business solution.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Efficiently Maintaining Stock Portfolios Up-To-Date On The Web Prashant Shenoy Manish Bhide Krithi Ramamritham 2002 IEEE E-Commerce System Proceedings.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Capacity planning for web sites. Promoting a web site Thoughts on increasing web site traffic but… Two possible scenarios…
Knowledge Process Outsourcing1 Turning Information into Knowledge... for YOU The Gyaan Team.
Towards Autonomic Hosting of Multi-tier Internet Services Swaminathan Sivasubramanian, Guillaume Pierre and Maarten van Steen Vrije Universiteit, Amsterdam,
Presentation By: Brian Mais. What Is It? Content Management Systems(CMS) describes software that manage content, workflow, and collaboration online and.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Web 2.0: Concepts and Applications 11 The Web Becomes 2.0.
Databases and the Internet. Lecture Objectives Databases and the Internet Characteristics and Benefits of Internet Server-Side vs. Client-Side Special.
Chapter 33 CGI Technology for Dynamic Web Documents There are two alternative forms of retrieving web documents. Instead of retrieving static HTML documents,
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Content Management Systems Week 14 LBSC 671 Creating Information Infrastructures.
Dynamic Content On Edge Cache Server (using Microsoft.NET) Name: Aparna Yeddula CS – 522 Semester Project Project URL: cs.uccs.edu/~ayeddula/project.html.
Web Caching By Neeraj Agrawal. Caching Caching is widely used for improving performance in many context( e.g processor caches in hardware, buffer pool.
Campus Tour COMP 523 Midterm Presentation Justin, Paul, Florian.
Context-Aware Interactive Content Adaptation Iqbal Mohomed, Jim Cai, Sina Chavoshi, Eyal de Lara Department of Computer Science University of Toronto MobiSys2006.
CHAPTER TEN AUTHORING.
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
TranService: Service and Media Translation System for Small Devices Graduate School of Media and Governance, Keio University Jun’ichi Yura
Community-Driven Adaptation Iqbal Mohomed Department of Computer Science University of Toronto.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
Mobile Technology By Devin Satterthwaite November 27, 2007.
Srivastava J., Cooley R., Deshpande M, Tan P.N.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Framework for Virtual Web Laboratory I. Petković M. Rajković.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
1 Shetal Shah, IITB Dissemination of Dynamic Data: Semantics, Algorithms, and Performance.
Dispatching Java agents to user for data extraction from third party web sites Alex Roque F.I.U. HPDRC.
Javascript JavaScript is what is called a client-side scripting language:  a programming language that runs inside an Internet browser (a browser is also.
Ad insertion at proxies to improve cache hit rates Amit Gupta and Geoffrey baehr, Sun Microsystems Laboratories 901 San Antonio Road Palo Alto,CA
Introduction to the World Wide Web & Internet CIS 101.
Introduction to ASP.NET development. Background ASP released in 1996 ASP supported for a minimum 10 years from Windows 8 release ASP.Net 1.0 released.
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
BUILD SECURE PRODUCTS AND SERVICES
Databases and DBMSs Todd S. Bacastow January 2005.
Network Infrastructure Services Supporting WAP Clients
DATA MINING © Prentice Hall.
E-commerce | WWW World Wide Web - Concepts
E-commerce | WWW World Wide Web - Concepts
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
PHP / MySQL Introduction
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Database Driven Websites
Data, Databases, and DBMSs
Data Warehousing and Data Mining
A Component-based Architecture for Mobile Information Access
Web Mining Department of Computer Science and Engg.
Overview The World Wide Web has changed the way that people
Dissemination of Dynamic Data on the Internet
Overview The World Wide Web has changed the way that people
Web Mining Research: A Survey
Stream-Lined Data Management
Presentation transcript:

1 Data Mining at work Krithi Ramamritham

2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component Headline Component Headline Component Headline Component Personalized Component Navigation Component

3 1. What to deliver? Page content may be based on queries on dynamically changing data – e.g., sports scores, stock prices, environment type of access device time and location of access/user Existing sites may contain new information New sites (URLs) may come into being

4 2. How to deliver? Data sources Proxies /caches End-hosts servers sensors wired host mobile host Network

5 Keep Data Up-to-date Update Mumbai temperature every 2 degrees The proxy obtains data from the source(s) | U(t) - S(t) | <= 2Maintains | U(t) - S(t) | <= 2 Source S(t) Proxy / DB P(t) User U(t)

6 When to poll the source? After a specific interval Server Proxy User Pull Based on temporal data mining – time series analysis – and prediction of when change will exceed 2 degrees

7 Where to do the work? Diverse client devices –Differ in hardware, software, network connectivity, form factor Web content needs to be tailored for each client type Each response depends not only on the requested URL but also on the capabilities of the client

8 Transcoding Conversion of one data version to another –Decreasing Image Quality (JPEG quality level) and size - “convert” utility in Linux –Summarizing text transcode => Info extraction/ retrieval/ classification

9 Who should transcode? 1.Download desired version from server 2.Transcode higher version locally Factors influencing decision –Transcoding Complexity –Proxy-server network connection –Load on proxy (Multiple Linear) Regression Predict based on a (linear) model of overheads

10 What is new on the Web? How is the monsoon progressing? Time series analysis: Change prediction, pattern mining

‘Bhav Puchiye’ Interface for Bhav Puchiye

Inverted Pyramid Interfaces Inverted pyramid approach Conclusion Findings Discussions Conclusion Discussions Findings Background & related Information

Bhav Poochiye Pricing Module developed for selected commodities for selected markets for selected areas DEMO

14 Building Usage Profiles Estimate access probabilities based on: Current user/community navigational patterns over site contents (in the form of click streams) Historical user/community access patterns over site contents (in the form of association rules) Cluster needs based on location, income/age of user, time-of-day

15 Data Mining From data to information to knowledge to money!