Copyright 2008 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute From Web 1.0 to Web 2.0 and Back – How did your Grandma Use to Tag? Sheila Kinsella, Adriana Budura, Gleb Skobeltsyn, Sebastian Michel, John G. Breslin, Karl Aberer
Digital Enterprise Research Institute From Web 1.0 – Anchortext… Users link to resources of interest from their webpages (implicit annotations) Limited to web authors
Digital Enterprise Research Institute …to Web Tagging Explicit annotations of resources Anyone can participate
Digital Enterprise Research Institute So has anything changed?
Digital Enterprise Research Institute Example results (1)
Digital Enterprise Research Institute Example results (2)
Digital Enterprise Research Institute Example results (3)
Digital Enterprise Research Institute Experimental study Datasets: Web: WEBSPAM-2007, 12M pages from.uk domain Del.icio.us: 2007 Crawl containing tags for 4.5M URLs Overlap between datasets: 192k URLs Goal: ? ≈
Digital Enterprise Research Institute Results
Digital Enterprise Research Institute Overlap between the tags Anchortags overlap with del.icio.us tags Two measures: how many anchortags are also among del.icio.us tags how many del.icio.us tags can be inferred from anchortext k=1k=2k=3k=4k=5
Digital Enterprise Research Institute User study Tags for 20 URLs: top-5 tags from del.icio.us top-5 anchortags 25 people score every tag: 0 (non-relevant), 1 (somewhat relevant), 2 (very relevant).
Digital Enterprise Research Institute But users don’t always agree Relevance of a tag is highly subjective Extent of AgreementFreqency All 3 scores agree35% All 3 scores almost agree50% All other cases15%
Digital Enterprise Research Institute Many new social tags
Digital Enterprise Research Institute Conclusions Substansial overlap between top tags derived from anchortext and del.icio.us tags Users rate anchortags at a similar quality to user- assigned tags – but this is subjective Del.icio.us still provides many new tags ≠ ≈