Wisdom of Crowds and Rank Aggregation Wisdom of crowds phenomenon: aggregating over individuals in a group often leads to an estimate that is better than.

Slides:



Advertisements
Similar presentations
Chapter 23: Inferences About Means
Advertisements

Chapter 6 Sampling and Sampling Distributions
LECTURE 11: BAYESIAN PARAMETER ESTIMATION
Wisdom of Crowds in Human Memory: Reconstructing Events by Aggregating Memories across Individuals Mark Steyvers Department of Cognitive Sciences University.
Hypothesis testing Week 10 Lecture 2.
Distribution of Tau Distances Assessing Episodic and Semantic Contributions in Serial Recall Pernille Hemmer, Brent Miller & Mark Steyvers University of.
Chapter 8 Estimating Single Population Parameters
Wisdom of Crowds in Human Memory: Reconstructing Events by Aggregating Memories across Individuals Mark Steyvers Department of Cognitive Sciences University.
Using Informative Priors to Enhance Wisdom in Small Crowds.
Matching Experiment Class Results. Experiment Analyzed 114 subjects after removal of subjects who completed fewer than 4 problems 8 problems 2.
The Wisdom of Crowds in the Aggregation of Rankings Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Michael.
Wisdom of Crowds and Rank Aggregation Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Brent Miller, Pernille.
Introduction  Bayesian methods are becoming very important in the cognitive sciences  Bayesian statistics is a framework for doing inference, in a principled.
Chapter 11 Problems of Estimation
Chapter 7 Sampling and Sampling Distributions
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 7 Probability and Samples: The Distribution of Sample Means
Standard error of estimate & Confidence interval.
Chapter 9 Comparing Means
The Role of Prior Knowledge in Human Reconstructive Memory Mark Steyvers Pernille Hemmer University of California, Irvine 1.
Chapter 9 Statistical Data Analysis
Hierarchical Bayesian Models for Aggregating Retrieved Memories across Individuals Mark Steyvers Department of Cognitive Sciences University of California,
The Wisdom of Crowds in the Aggregation of Rankings Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Michael.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Albert Morlan Caitrin Carroll Savannah Andrews Richard Saney.
Confidence Interval Estimation
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Chapter 8 Introduction to Inference Target Goal: I can calculate the confidence interval for a population Estimating with Confidence 8.1a h.w: pg 481:
PARAMETRIC STATISTICAL INFERENCE
Wisdom of Crowds and Rank Aggregation Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Brent Miller, Pernille.
Wisdom of Crowds in Human Memory: Reconstructing Events by Aggregating Memories across Individuals Mark Steyvers Department of Cognitive Sciences University.
AP Statistics Section 7.2 C Rules for Means & Variances.
IRT Model Misspecification and Metric Consequences Sora Lee Sien Deng Daniel Bolt Dept of Educational Psychology University of Wisconsin, Madison.
Module 14: Confidence Intervals This module explores the development and interpretation of confidence intervals, with a focus on confidence intervals.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
The Wisdom of Crowds in the Aggregation of Rankings Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Michael.
1 Differences Between Observed and Latent Confidence in Rank Ordering Brent J. Miller Mark Steyvers University of California, Irvine.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Adjudicator Agreement and System Rankings for Person Name Search Mark Arehart, Chris Wolf, Keith Miller The MITRE Corporation {marehart, cwolf,
Latent regression models. Where does the probability come from? Why isn’t the model deterministic. Each item tests something unique – We are interested.
Chapter 9: One- and Two-Sample Estimation Problems: 9.1 Introduction: · Suppose we have a population with some unknown parameter(s). Example: Normal( ,
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
INFERENCE Farrokh Alemi Ph.D.. Point Estimates Point Estimates Vary.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Two Approaches to Estimation of Classification Accuracy Rate Under Item Response Theory Quinn N. Lathrop and Ying Cheng Assistant Professor Ph.D., University.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Quantitative methods and R – (2) LING115 December 2, 2009.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Chapter 6 Sampling and Sampling Distributions
Pairwise comparisons: Confidence intervals Multiple comparisons Marina Bogomolov and Gili Baumer.
Chapter 8 Confidence Interval Estimation. 8.1 Confidence Interval Estimation of the Mean This section deals with the case of known σ. There are two kinds.
A Bayesian approach to recommender systems
Inference: Conclusion with Confidence
The Wisdom of Crowds in the Aggregation of Rankings
Dani Navarro, Kristy Martire, Kaye Ballantyne, Bethany Growns
Mark Steyvers Department of Cognitive Sciences
Using Informative Priors to Enhance Wisdom in Small Crowds
Effect of Prior Knowledge and Wisdom of Crowds
The wisdom of crowds A/Prof Danielle Navarro compcogscisydney.org.
Statistics for the Social Sciences
Chapter 9: One- and Two-Sample Estimation Problems:
CS639: Data Management for Data Science
Introduction to Inference
Matching Experiment Class Results.
Presentation transcript:

Wisdom of Crowds and Rank Aggregation Wisdom of crowds phenomenon: aggregating over individuals in a group often leads to an estimate that is better than any of the individual estimates (e.g. Surowiecki, 2004) Goal: apply this idea to human ordering / ranking data: how can we aggregate the recollected orderings across individuals to best approximate some underlying ground truth? Approach: develop unsupervised Bayesian models for rank aggregation that take individual differences into account Experiment to collect human ordering data We tested 78 individuals on their ability to reconstruct from memory the order of items in 17 different tasks Example tasks: order of US presidents, the order of countries by landmass, the order of the ten commandments and the ten amendments. Performance was measured using Kendall’s Tau: The number of adjacent pair-wise swaps between recalled and true order. The Wisdom of Crowds in the Recollection of Order Information Mark Steyvers, Michael Lee, Brent Miller & Pernille Hemmer University of California, Irvine More information about our lab: do the experiments yourself at Example Raw Data Thurstonian Model v2: allowing Partial Knowledge Assumption: each individual has a unique variance (same for all items) but shares the same set of item means with the group. This model can represent varying degrees of “expertise” = 1 = 1+1 Ordering by Individual ABECD True Order ABCDE C D E ABAB ABCDEABCDE = 2 D A B C A B D C B A D CA C B D A D B C Generative Model ? ? latent ground truth Incorporate individual differences A. George Washington B. James Madison C. Andrew Jackson j individuals Strong wisdom of crowds effect across tasks Conclusion Using unsupervised Bayesian models for rank data, we can aggregate orderings across individuals such that aggregated ordering better approximates ground truth than any individual in the crowd: strong wisdom of crowd effect It is important to incorporate individual differences – some individuals are more expert than others. Models can estimate expertise levels in unsupervised fashion – individuals near consensus orderings are likely to be more expert (if individuals performed task independently) Mallows Model Distance-based model that assumes that observed orderings that are close to the group ordering are more likely than those far away. The probability of any observed order, given the group order is: Two-state model: an individual either produces an ordering according to a Mallows model (z=1) or a guessing process (z=0). We estimate the latent assignment z for each individual. This approach is related to Klementiev, Roth et al Thurstonian Model (v1) Items are represented by coordinates on interval scale. Normal distributions represent uncertainty about item position – to order items, each individual draws one sample from each normal distribution and orders the items according to the samples. Means and standard deviations are shared among all individuals Individual differences: each individual is in one of two states: the Thurstonian state (z=1) and a guessing state (z=0) where there are no differences between items 78 individuals Thurstonian state (z=1) Guessing state (z=0) ordering by individual group ordering scaling parameter normalization constant Kendall tau distance function Experiment with 26 individuals ordering all 44 US presidents Mean Kendall tau averaged over all 17 tasks True ordering