Microsoft Research1 Characterizing Alert and Browse Services for Mobile Clients Atul Adya, Victor Bahl, Lili Qiu Microsoft Research USENIX Annual Technical.

Slides:



Advertisements
Similar presentations
1 Analyzing Browse Patterns of Mobile Clients Lili Qiu Joint work with Atul Adya and Victor Bahl Microsoft Research ACM.
Advertisements

Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Networking Problems in Cloud Computing Projects. 2 Kickass: Implementation PROJECT 1.
Digital Marketing Analytics v10. Introduction  Name / job role  What company are you with  How much experience do you have using Webtrends  Create.
Ranking Web Sites with Real User Traffic Mark Meiss Filippo Menczer Santo Fortunato Alessandro Flammini Alessandro Vespignani Web Search and Data Mining.
Computer Science Generating Streaming Access Workload for Performance Evaluation Shudong Jin 3nd Year Ph.D. Student (Advisor: Azer Bestavros)
2005/2/23 HUT T Characterizing Web Workload of Mobile Clients Chuang Yu Juha Raitio.
GlobeTraff A traffic workload generator for the performance evaluation of ICN architectures K.V. Katsaros, G. Xylomenos, G.C. Polyzos A.U.E.B. (presented.
The Effect of First-Hop Wireless Bandwidth Allocation on End-to-End Network Performance Lili Qiu, Paramvir Bahl, Atul Adya Microsoft Research NOSSDAV’2002.
Multi-Layer Analysis of Web Browsing Performance for Wireless PDAs Adesola Omotayo & Carey Williamson June 1, 2015.
Small-World File-Sharing Communities Adriana Iamnitchi, Matei Ripeanu and Ian Foster,
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
1 Clustering Web Content for Efficient Replication Yan Chen, Lili Qiu*, Weiyu Chen, Luan Nguyen, Randy H. Katz EECS Department UC Berkeley *Microsoft Research.
Amanda Spink : Analysis of Web Searching and Retrieval Larry Reeve INFO861 - Topics in Information Science Dr. McCain - Winter 2004.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
1 The Content and Access Dynamics of a Busy Web Server: Findings and Implications Venkata N. Padmanabhan Microsoft Research Lili Qiu Cornell University.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
1 Clustering Web Content for Efficient Replication Yan Chen, Lili Qiu*, Weiyu Chen, Luan Nguyen, Randy H. Katz EECS Department UC Berkeley *Microsoft Research.
1 Simulation Evaluation of a Heterogeneous Web Proxy Caching Hierarchy Mudashiru Busari Carey Williamson University of Saskatchewan University of Calgary.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
A Hierarchical Characterization of a Live Streaming Media Workload IEEE/ACM Trans. Networking, Feb Eveline Veloso, Virg í lio Almeida, Wagner Meira,
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Topic-Sensitive PageRank Taher H. Haveliwala. PageRank Importance is propagated A global ranking vector is pre-computed.
Clustering of Web Content for Efficient Replication Yan Chen, Lili Qiu, Wei Chen, Luan Nguyen and Randy H. Katz {yanchen, wychen, luann,
Sujit Dey Adaptive Applications for Wireless Information Technology Sujit Dey ECE Department University of California, San Diego
Consumers on the Web: Identification of usage patterns Consumers on the Web: Identification of usage patterns by Nina Koiso-Kanttila
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Niranjan Balasubramanian Aruna Balasubramanian Arun Venkataramani University of Massachusetts Amherst Energy Consumption in Mobile Phones: A Measurement.
By Huang et al., SOSP 2013 An Analysis of Facebook Photo Caching Presented by Phuong Nguyen Some animations and figures are borrowed from the original.
1 EERE Communications 4/16/2015 EERE Web Coordinators Meeting Conference line: +1 (415) Access Code:
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Authors: Maryam Kamvar and Shumeet Baluja Date of Publication: August 2007 Name of Speaker: Venkatasomeswara Pawan Addanki.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
1 Pieter Meulenhoff KPN Research ROOT2002 I-Mode Performance Monitoring Use of ROOT in telecommunications at KPN Pieter Meulenhoff.
1 6th EC/GIS workshop - Lyon - June 2000 Easy and friendly access to geographic information for mobile users David HELLO (Matra.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
1 An Integrated Approach to Improving Web Performance Lili Qiu Cornell University.
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications REF:Balasubramanian, Niranjan, Aruna Balasubramanian,
Interception and Analysis Framework for Win32 Scripts (not for public release) Tim Hollebeek, Ph.D.
Improving Cloaking Detection Using Search Query Popularity and Monetizability Kumar Chellapilla and David M Chickering Live Labs, Microsoft.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason.
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
Unconstrained Endpoint Profiling Googling the Internet Ionut Trestian, Supranamaya Ranjan, Alekandar Kuzmanovic, Antonio Nucci Reviewed by Lee Young Soo.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Injecting Realistic Burstiness to.
Hot Systems, Volkmar Uhlig
NTMS 2012 GlobeTraff: a traffic workload generator for the performance evaluation of future Internet architectures K.V. Katsaros, G. Xylomenos, G.C. Polyzos.
Evaluation of a Novel Two-Step Server Selection Metric Presented by Karthik Lakshminarayanan
Jorke Odolphi Product Technology Specialist WebCentral Using Microsoft Operations Manager To Monitor And Maintain Your Farm.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Web Prefetching Lili Qiu Microsoft Research March 27, 2003.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
30 Copyright © 2009, Oracle. All rights reserved. Using Oracle Business Intelligence Delivers.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
On the scale and performance of cooperative Web proxy caching 2/3/06.
INTERNET AND . WHAT IS INTERNET The Internet can be defined as the wired or wireless mode of communication through which one can receive, transmit.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Google Analytics Graham Triggs Head of Repository Systems, Symplectic.
Whole Page Performance Leeann Bent and Geoffrey M. Voelker University of California, San Diego.
1 DATA-DRIVEN SOLUTIONS. 2 KEYWORD-LEVEL SEARCH RETARGETING TARGET USERS BASED ON THEIR RECENT SEARCH HISTORY AND SEARCH QUERIES. A user performs a search.
Does Internet media traffic really follow the Zipf-like distribution? Lei Guo 1, Enhua Tan 1, Songqing Chen 2, Zhen Xiao 3, and Xiaodong Zhang 1 1 Ohio.
Learn to use: Salesfloor Reporting.
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Presentation transcript:

Microsoft Research1 Characterizing Alert and Browse Services for Mobile Clients Atul Adya, Victor Bahl, Lili Qiu Microsoft Research USENIX Annual Technical Conference Monterey, CA, June 2002

Microsoft Research2 Outline Motivation Related Work Overview of Data Logs and Key Results Detailed Analysis Notification Services Browse Services Correlation between the Two Services Summary and Implications

Microsoft Research3 Motivation Wireless web services Becoming popular Crucial to understand usage pattern Few existing studies on how they are used

Microsoft Research4 Related Work Workload of clients at wireline networks Server-based studies NASA, ClarkNet, MSNBC, WorldCup, … Proxy-based studies NLANR, Digital, UW, … Client-based studies Boston Univ., WebTV, … Workload of wireless clients Kunz et. al Only 80K requests over seven months No existing study on notification usage

Microsoft Research5 Overview A popular commercial Web site for mobile clients Content news, weather, stock quotes, , yellow pages, travel reservations, entertainment etc. Services Notification Browse Period studied 3.25 million notifications in Aug. 20 – 26, million browse requests in Aug. 15 – 26, 2000

Microsoft Research6 Overview: User Categories Cellular users Browse the Web in real time using cellular technologies Offline users Download content onto their PDAs for later (offline) browsing, e.g. AvantGo Desktop users Signup services and specify preferences Notification log has 200,860 users (99% were wireless users) Browse log: User Type# Users# Requests Cellular 58,432 2,210,758 Offline 50,96820,508,272 Desktop639,971 7,342,206 Misc. 1,634 2,944,708

Microsoft Research7 Major Findings Notification Services Popularity of notification messages follows Zipf-like distribution Top 1% notification objects account for % of total messages Exhibits geographical locality Browse Services 0.1% - 0.5% urls account for 90% requests The set of popular urls remain stable Correlation between the two services Correlation is limited

Microsoft Research8 Outline Motivation Related Work Overview of Data Logs and Key Results Detailed Analysis Notification Services Browse Services Correlation between the Two Services Summary and Implications

Microsoft Research9 Notification Log Analysis Types of Analyses Content analysis Notification message popularity User behavior analysis Geographical locality

Microsoft Research10 Content Analysis Important to content providers and notification service designers Popular categories: weather, news, stock quotes, .

Microsoft Research11 Notification Message Popularity  Researchers have found Web accesses follow Zipf-like distribution (i.e., # request  1/ i  ) Notification message popularity follows Zipf-like distribution (   [1.1, 1.3])  generate synthetic traces

Microsoft Research12 Notification Msg Popularity (Cont.) Notification msgs are highly concentrated on a small number of documents Top 1% notification documents account for 54% - 64% of the total messages Application-level multicast would be an efficient way of delivering popular notifications.

Microsoft Research13 Geographical Locality Local sharing  2 users in the same cluster receive the msg Notification exhibits geographical locality.

Microsoft Research14 Outline Motivation Related Work Overview of Data Logs and Key Results Detailed Analysis Notification Services Browse Services Correlation between the Two Services Summary and Implications

Microsoft Research15 Browser Log Analysis Types of Analyses Content analysis Documents popularity User behavior analysis Temporal stability Geographical locality Load distribution of different users

Microsoft Research16 Content Analysis Important to content providers: what content is interesting to users Rank #1Rank #2Rank #3 NotificationWirelessNewsWeatherStock BrowseWirelessStock quotes NewsYellowPages OfflineHelpNewsStock DesktopSign-ups Sports Top three preferences for different kinds of users

Microsoft Research17 Document Popularity Two definitions of document Base URLs Full URLs: including parameters Document Popularity does not closely follow Zipf-like distribution.

Microsoft Research18 Document Popularity (Cont.) Requests are highly concentrated on a small number of documents 0.1% - 0.5% full urls (i.e., 112 – 442) account for 90% requests Very small amount of memory needed to cache popular query results if content doesn’t change.

Microsoft Research19 Temporal Stability Methodology Consider 2 days’ traces Pick the top n documents from each day Compute overlap Popular urls remain stable  cache popular query results or optimize performance based on stable workload

Microsoft Research20 Geographical Locality Limited geographical locality in users’ browse interest. Compare local sharing in geographical clusters vs. in random clusters

Microsoft Research21 Load Distribution of Users Offline users generate more bursty traffic  need to identify & properly handle such bursts

Microsoft Research22 Outline Motivation Related Work Overview of Data Logs and Key Results Detailed Analysis Notification Services Browse Services Correlation between the Two Services Summary and Implications

Microsoft Research23 Correlation between Notification and Browsing Correlation in the amount of usage Correlation in popular content categories

Microsoft Research24 Correlation in Amount of Usage Low correlation in usage. correlation coefficient is 0.26 for all users, and 0.12 for wireless users.

Microsoft Research25 Correlation in Content Categories Approach Classify notifications and browsing requests into content categories For each individual user, compare his/her top N notification categories with top N browsing categories Metric Average overlap Wireless users have moderate correlation in content. The correlation is much lower when considering all users.

Microsoft Research26 Summary & Implications ObservationsImplications Top 1% notification objects account for % of total messages. Delivering notifications via multicast would be effective. Notification exhibits geographical locality. Useful to provide localized notification services.

Microsoft Research27 Summary & Implications (Cont.) ObservationsImplications 0.1% - 0.5% full urls (i.e ) account for 90% requests. Caching the results of popular queries would be very effective. The set of popular urls remain stable. Cache a stable set of popular query results or optimize query performance based on a stable workload. Limited correlation between users’ browsing and notification pattern. Service providers cannot solely rely on users’ notification profile to predict how much & what they will browse.

Microsoft Research28 Comparison NotificationBrowsing Zipf-like popularity distribution YesNo High concentration of msgs/requests to documents Yes Spatial LocalitySignificantLittle CorrelationLimited correlation in both usage and content