Dr. Mike Lowndes, Interactive Media Manager, Natural History Museum, London – Houses 350-permanent scientific staff, plus postgraduate students; one of.

Slides:



Advertisements
Similar presentations
Protecting Browser State from Web Privacy Attacks Collin Jackson, Andrew Bortz, Dan Boneh, John Mitchell Stanford University.
Advertisements

Monitoring a web sites health. Web Analytics - Definition Measurement of the behavior of visitors to a website Which aspects of the website work towards.
Cookies, Sessions. Server Side Includes You can insert the content of one file into another file before the server executes it, with the require() function.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
Copyright © 2012 Certification Partners, LLC -- All Rights Reserved Lesson 4: Web Browsing.
Digital Marketing Analytics v10. Introduction  Name / job role  What company are you with  How much experience do you have using Webtrends  Create.
Evaluation Workshop: Quantitative Evaluation Methods Peter Dowdell NOF-digitise Technical Advisory Service web:
Lesson 4: Web Browsing.
Measuring Success: SES London 2007 An Introduction to Web Analytics ● Types of Tracking ● Why You Need Analytics ● How to Employ Tracking Data ● Specific.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Chapter 12: Web Usage Mining - An introduction
1 Caching in HTTP Representation and Management of Data on the Internet.
Web applications. Javascript. Web 2.0: The dynamic, read-write web UC Santa Cruz CMPS 10 – Introduction to Computer Science
Web Metrics October 26, 2006 Steven Schwartz President, PowerWebResults.com Southeastern Massachusetts E-Commerce Network University of Massachusetts –
12/11/01 Matt Bridges Advisor: Ralph Morelli. What is Web Analytics? In traditional commerce, store owners can observe their customers habits: What time.
Introduction to eValid Presentation Outline What is eValid? About eValid, Inc. eValid Features System Architecture eValid Functional Design Script Log.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
Web Proxy Server Anagh Pathak Jesus Cervantes Henry Tjhen Luis Luna.
The easy way to a nice looking website design By a total non-designer (Me!)
By Raza / Faisal By: Raza Usmani Faisal Khan. What is SEO? It is the process of affecting the visibility of a website or a web page in a search engine's.
Evaluating Web Server Log Analysis Tools David Strom SD’98 2/13/98.
Alexander Hartmann.  Free service offered by Google that generates detailed statistics about the visitors to a website. A premium version is also available.
Web Analytics
Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore
With Internet Explorer 9 Getting Started© 2013 Pearson Education, Inc. Publishing as Prentice Hall1 Exploring the World Wide Web with Internet Explorer.
1 Web Developer Foundations: Using XHTML Chapter 11 Web Page Promotion Concepts.
Server tools. Site server tools can be utilised to build, host, track and monitor transactions on a business site. There are a wide range of possibilities.
242/102/49 0/51/59 181/172/166 Primary colors 248/152/29 PMS 172 PMS 137 PMS 546 PMS /206/ /227/ /129/123 Secondary colors 114/181/204.
Jump to first page Tracking users Analyzing how people use your site by Dylan Tweney
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
Using audience metrics to grow revenue January 2010.
University of Sunderland CDM105 Session 5 Web Authoring Tools The past and present A history of web authoring tools and an overview of Macromedia Dreamweaver.
NASRULLAH KHAN.  Lecturer : Nasrullah   Website :
1 Tradedoubler & Mobile Mobile web & app tracking technical overview.
Web Analytics Unit 4-1(2005 Fall) Managing the Digital Enterprise By Professor Michael Rappa.
Web Metrics Sean Fox Science Education Resource Center Carleton College
1 Lies, damn lies and Web statistics A brief introduction to using and abusing web statistics Paul Smith, ILRT July 2006.
Lecture 8 – Cookies & Sessions SFDV3011 – Advanced Web Development 1.
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Web Site Statistics A Metric for Measuring Engagement.
Web Analytics Tom Wojciechowski DMS 446/546 - Interface Design.
Team Site Admin with SharePoint 2010 Gareth Johns IT Skills Development Advisor.
1 Emerging Technology Using RSS RSS and syndication By Steve Sloan RSS and syndication By Steve Sloan.
The Problem of State. We will look at… Sometimes web development is just plain weird! Internet / World Wide Web Aspects of their operation The role of.
NASRULLAH KHAN.  Lecturer : Nasrullah   Website :
Web Measurement. The Web is Different from other Commuication Media More precise measurement of activity on Web sites is available More precise measurement.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
ITM © Port,Kazman 1 ITM 352 Cookies. ITM © Port,Kazman 2 Problem… r How do you identify a particular user when they visit your site (or any.
May 6, 2009 Browser Compatibility Testing Definition It is a non functional type of testing where web based applications are tested on various browsers(IE.
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
Web Analytics and Reporting Michal Neuwirth Product Manager – Kentico Software.
Whole Page Performance Leeann Bent and Geoffrey M. Voelker University of California, San Diego.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
Sitecues Metrics David Young (617) Discussion Document.
Web Analytics Fundamentals Presented by Tejaswi, Chandrika, Sunil.
What is Staff Connect? When will Staff Connect launch?
Chapter 17 The Need for HTML 5.
What is Google Analytics?
Examining Resource Use on an Amateur Paleontology Website
PIWIK JUNIOR TIDAL ASSOCIATE PROF., WEB SERVICES & MULTIMEDIA LIBRARIAN NEW YORK CITY COLLEGE OF TECHNOLOGY, CUNY.
Lesson 4: Web Browsing.
“Real Simple Syndication” (RSS)
Unit 27 Web Server Scripting Extended Diploma in ICT
Building Web Applications
Lesson 4: Web Browsing.
Web Application Development Using PHP
Presentation transcript:

Dr. Mike Lowndes, Interactive Media Manager, Natural History Museum, London – Houses 350-permanent scientific staff, plus postgraduate students; one of the largest UK research institutes in the natural sciences. (Right-click or click-hold (Mac) and press k or select Speaker Notes) IWMW 2005: Who’s web is it anyway? Lies, Damn lies and Web Statistics

Contents Why bother? Issues with web logs Issues with analytic tools Browser tracking Comparison between approaches Known issues with browser tracking Nedstat input and findings from Newcastle University

Why bother? Web log analysis is currently the main method used to quantify web site usage for reporting. Results are used by the government as performance indicators for institutional websites. Not accurate or meaningful most of the time –no good for absolute measurement of usage. Can be used for: Trend analysis Content preferences ROI estimation Checking and fixing your site Understanding users behaviour Testing assumed pathways

Issues with server logs Dynamic IP –Many users using the same IP number over time. –Same user assigned many IP numbers over time. Proxies –Several or many users behind 1 IP number Caches (can be ‘in’ Proxies) –Commonly requested files cached closer to the users. – Can form the top hosts accessing sites. Robots and spiders –Few visits but lots of hits. –Analytic packages cannot keep up to date with all of them for exclusion. Syndication –RSS feeds generate huge logs, but are not ‘read’ by humans initially. –Click-through configuration. Reporting by analysis tools –Often weekly or monthly reports: realtime is very labour/server intensive –Reports often complex and techy.

Issues with log analysis tools Webtrends vs Summary.net 1. Natural History Museum –Summary SP (summary.net) Version 4.2.1, unregistered demo, default configuration 2. UKOLN (Bath) –WebTrends ( Version 5, default configuration Both tools were applied to the same log file Default configurations – not removing robots –Note: WebTrends documentation not clear on this point

Measurement discrepancies Summary SPWebtrends 7 Connections (hits)-+0.67% hits Page views (page hits)-+5.00% Visits (user sessions)-+0.07% Failed hits-+0.30% Average visit duration--30.0% (+250%) Browsers IE75%86% Netscape compatible2%4% Referrers Top Level DomainsUS UK AUSCAN NETHER CANAUS JAP

Comparison between tools Not a single measurement was identical. Most measurements were within 5% Visit duration measurement widely different, and can depend on configuration. Possible bug in WebTrends version 5. Page view measurements were quite different. Results broadly similar but direct comparisons, especially of Page Views, are not really justified.

Browser tracking Do they have fewer inaccuracies and distortions? Is it easier on the web team? Is it affordable? Does it give us more information / better information?

Browser tracking Requires code to be added to pages Uses an image, sourced from the tracking website. Also uses javascript and cookies for gathering extended and repeat-visit information Usually hosted services Provide near real-time tracking Few of the issues distorting logs affect these measurements (according to the blurb) Main players: Nedstat, Nielson/Netratings, WebSideStory

Comparison between tools Summary SP VS Nielson/Netratings Run on one section of a site over a month. ‘Visiting’ section of the Natural History Museum site – small but popular and easily tagged.

Results 1 – visits and visitors

Results 2 – pages viewed

Results 3 – country Depends on the quality of the geographical IP database, not the mode of tracking?

Conclusions regarding traditional Log analysis Assuming browser tracking is more accurate… We have fewer visit sessions than we thought, but more visitors –Fewer visits (sessions), possibly due to robot exclusion –More visitors (unique users), possibly due to the masking effect of proxies/caches and browser caches Visit duration is much shorter than thought –possibly due to robots/spiders and cache updating. Country information is roughly accurate so long as a geographical lookup is used. Activity of popular pages, which are often cached, will be underestimated

Browser tracking advantages Almost real-time analysis, incremental data. Better repeat user tracking and individual pathway analysis. Configurable, graphical reports for non-techies –Techie still needs to configure those reports however, as an understanding of web analytics is required Cut our monthly staff time down from 1.5 days to 1 hour Appear to be more accurate in describing the activity of real people, but we would like to see some independent research.

Issues with browser tracking Setup is not trivial: You need to add code to every page. –Multiple server / ownership issues. Does not always work (or get full user details) if Javascript is turned off or cookies disallowed. Does not work with text-only browsers. Unknown compatibility with PDAs, mobiles etc. Questions: Would we get different results with different hosted services? –ABCE: industry standards for measurement Cookies often deleted unless user is confident in the source? –This would affect the measurement of repeat visitors and behaviour Political issues: Issues with external hosting of institutional data Security of personal data issues with external hosting –E.g. measurements of student and staff use of a VLE.

Next steps Many private sector and public sector sites have already moved to browser tracking. About 6 National Museums are currently discussing hosted browser tracking. 5 Universities currently involved in a trial of NedStat.

Thank you