Milan Vojnovi ć MSRC, Systems and Networking Tagging done by YOU.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

EcoTherm Plus WGB-K 20 E 4,5 – 20 kW.
Números.
1 A B C
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.
Variations of the Turing Machine
AP STUDY SESSION 2.
1
EuroCondens SGB E.
Select from the most commonly used minutes below.
Multiplication X 1 1 x 1 = 1 2 x 1 = 2 3 x 1 = 3 4 x 1 = 4 5 x 1 = 5 6 x 1 = 6 7 x 1 = 7 8 x 1 = 8 9 x 1 = 9 10 x 1 = x 1 = x 1 = 12 X 2 1.
Slide 1 FastFacts Feature Presentation October 16 th, 2008 We are using audio during this session, so please dial in to our conference line… Phone number:
ASTM Member Website Tools Jeff Adkins Diane Trinsey 1 September 2012 Officers Training Workshop.
David Burdett May 11, 2004 Package Binding for WS CDL.
Create an Application Title 1Y - Youth Chapter 5.
Add Governors Discretionary (1G) Grants Chapter 6.
CALENDAR.
CHAPTER 18 The Ankle and Lower Leg
Around the World AdditionSubtraction MultiplicationDivision AdditionSubtraction MultiplicationDivision.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
The 5S numbers game..
突破信息检索壁垒 -SciFinder Scholar 介绍
A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Break Time Remaining 10:00.
The basics for simulations
EE, NCKU Tien-Hao Chang (Darby Chang)
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
MM4A6c: Apply the law of sines and the law of cosines.
Operating Systems Operating Systems - Winter 2010 Chapter 3 – Input/Output Vrije Universiteit Amsterdam.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
Dynamic Access Control the file server, reimagined Presented by Mark on twitter 1 contents copyright 2013 Mark Minasi.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Progressive Aerobic Cardiovascular Endurance Run
Adding Up In Chunks.
FAFSA on the Web Preview Presentation December 2013.
SLP – Endless Possibilities What can SLP do for your school? Everything you need to know about SLP – past, present and future.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
Facebook Pages 101: Your Organization’s Foothold on the Social Web A Volunteer Leader Webinar Sponsored by CACO December 1, 2010 Andrew Gossen, Senior.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
When you see… Find the zeros You think….
2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.
Before Between After.
2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.
Subtraction: Adding UP
: 3 00.
5 minutes.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Speak Up for Safety Dr. Susan Strauss Harassment & Bullying Consultant November 9, 2012.
1 Titre de la diapositive SDMO Industries – Training Département MICS KERYS 09- MICS KERYS – WEBSITE.
Static Equilibrium; Elasticity and Fracture
Converting a Fraction to %
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Clock will move after 1 minute
& dding ubtracting ractions.
Physics for Scientists & Engineers, 3rd Edition
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
1.step PMIT start + initial project data input Concept Concept.
Patient Survey Results 2013 Nicki Mott. Patient Survey 2013 Patient Survey conducted by IPOS Mori by posting questionnaires to random patients in the.
A Data Warehouse Mining Tool Stephen Turner Chris Frala
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Introduction Embedded Universal Tools and Online Features 2.
Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.
Presentation transcript:

Milan Vojnovi ć MSRC, Systems and Networking Tagging done by YOU

Thanks TagBooster project Dinan Gunawardena James Cruise (U Cambridge) Peter Marbach (U Toronto) Fabian Suchanek (MPI) Product groups O14 Sharepoint Communities Tagspace Officelabs MSR Tagging Summit TagBooster User Study Nick Duffield John Mulgrew Andy Slowey This talk Abi Alex Chris Peter Stephen

Social tagging in web2.0

Why tag?

Tagging: what and why Tag suggestions Conclusion

In this talk, well find relations among the following f g x y

Discover, filter, share

Faceted browsing BBC news Michael Palin BBC radio BBC shop bbc BBC news Michael Palin BBC radio BBC shop bbc palin

Tagging vs. traditional classification Traditional classification – Pre-defined vocabulary – Structured – Done by authors/librarians – Non trivial task Social tagging – Use any words – No structure – Done by anyone – Easy

Systems with controlled vocabulary

Social tagging challenges Vocabulary evolution – Filtering tags, tag suggestions, tagging metaphors – Uncontrolled vocabulary: scalable, mitigate vocabulary problem, but tag noise User interface design – Tagcloud, tag clustering Cold start – Lack of prior knowledge about tags for an object – Participation incentive Scale – More tagging events, easier filtering Making use of tags – Related tags for navigation, expertise tracking, tag meta-data for search, scoped rankings of items, faceted browsing

TagBooster User Study Sept-Oct participants Tagging web pages Questionnaire

Tagging done by YOU

Analogous to voting music soul london music soul jazz london black artist british singer Feedback !

Positive Negative Why suggest tags? Hiding users true preference over tags I picked a suggested tag that now I cant remember I tend to overuse same tags all over again exploit vs. explore Less effort (cognitive, typing) Encourage users to use tags (cold start) Conformance in vocabulary

Top Popular: classical suggestion method # sel.tag 174music 110radio 96internet radio 77online radio 49last.fm 40online music 34fm 33streaming music 31streaming 28last fm 22web radio 19scrobbling 18lastfm 12listen 12new music 10mp3 10stream 9streaming radio Suggested tags: radio, music, online radio, internet radio

Users generation of tags singer music jazz Black british rehab London soul Set of all tags artist singer music jazz british singer soul Suggested tags artist

Simple user model music jazz Black british rehab London soul music jazz british singer soul Suggested tags Set of all tags singer artist singer 1-p p imitationnon imitation riri i riri i

Users tag selection affected by tag suggestions Conditional on that the tag was suggested Unconditional Frequency of tag selection tag: apollo

The imitation rate portion of tag selections not in S; suggestions not made portion of tag selections not in the suggestion set S Boes estimate:

Sel.Tag 174music 110radio 96internet radio 77online radio 49last.fm 40online music 34fm 33streaming music 31streaming 28last fm 22web radio 19scrobbling 18lastfm 12listen 12new music 10mp3 10stream 9streaming radio Move-to-Set: simple randomised rule Suggested tags: last.fm, music, online radio, web radio

? Sufficient for Under the user model, for any imitation probability p < 1, the long run frequency of tag selections induces the true popularity ranking Correctness of popularity order

Simple update rule Converges to sampling the suggestion set proportional to the product of true rank scores Suggested tags: last.fm, music, online radio, web radio Suggested tags: last.fm, music, radio, web radio radio Suggested tags: last.fm, music, online radio, web radio Same as show most recent item for suggestion set size 1

Analogous to exclusion process j i rjrj rjrj

Frequency Move-to-Set RankTag 174music 110radio 96internet radio 77online radio 49last.fm 40online music 34fm 33streaming music 31streaming 28last fm 22web radio 19scrobbling 18lastfm 12listen 12new music 10mp3 10stream 9streaming radio Suggested tags: radio, music, online radio, internet radio radio last.fm Rank(radio) remains unchanged (radio suggested) Rank(last.fm) ++ (last.fm NOT suggested)

Only sufficiently popular tags eventually suggested frequency of suggesting tag i competing set suggestion set size harmonic mean of r 1,..., r |C| Tag i in the competing set iff:

Suggestion methods in action Tag rank i Frequency of tag suggestion TOP FMTS MTS NONE Tag rank i Norm. frequency of tag selection

Suggestion methods in action (contd) TOP FMTS MTS NONE

How did users appreciate the suggested tags? Web pageMethodThey were confusing They were OK, but not very relevant They were generally helpful engadgetTOP35.00%15.00%50.00% FMTS25.93% 48.15% MTS22.22%25.93%51.85% lastfmTOP22.22%55.56%22.22% FMTS25.00%32.14%42.86% MTS27.59%24.14%48.28% startupTOP39.13%21.74%39.13% FMTS50.00%29.17%20.83% MTS30.30%24.24%45.46% mitTOP21.74%30.44%47.83% FMTS23.08% 53.85% MTS24.24%18.18%57.68%

Why did I select these tags? Tags: gadgets technology engadget blog 2 1 I thought these are keywords that I would likely use later to find this item I thought these are categories that best describe the object else

Why did I select these tags? (contd) YOU find, search, describe, categorise, identify, remember, organise, classify wikipedia tag (meta data) definition describing the item keyword-based classification search

Why did I select these tags (contd)? Semantic analysis of tags, search and content keywords – May 2007 popular Web searches + delicious tags Tags similar to categories Small overlap with search keywords

Summary Social tagging poses interesting research challenges – Space for innovation A mix of control theory, user behaviour, information retrieval, interface design Aim at best design of tagging systems to support particular users tasks

Sample of research challenges User model? Rate of convergence Asymptotically accurate algorithms Select from the list only (e.g. remote controller/mobile device) What does it mean a tag is relevant? Make suggestions to improve users task (e.g. search, faceted browsing)? Beyond popularity ranking: Ranking across multiple lists Faces project ongoing work Tag to attract

Familiarity with tagging domainUsers microsoft.com44% hotmail.com15% gmail.com11% other30% Tagging frequencyUsers daily15% weekly25% monthly17% less frequently40% still used infrequently by many