High-Throughput Crowdsourcing Mechanisms for Complex Tasks Guido Sautter, Klemens Böhm ViBRANT Virtual Biodiversity.

Slides:



Advertisements
Similar presentations
Números.
Advertisements

Panos Ipeirotis Stern School of Business
AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory
PDAs Accept Context-Free Languages
© Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems Introduction.
AP Stat Day Days until AP Exam Standard Normal Curves Z-scores Assessing Normality.
EuroCondens SGB E.
Worksheets.
STATISTICS HYPOTHESES TEST (I)
STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
1 Impact of Decisions Made to Systems Engineering: Cost vs. Reliability System David A. Ekker Stella B. Bondi and Resit Unal November 4-5, 2008 HRA INCOSE.
David Burdett May 11, 2004 Package Binding for WS CDL.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Create an Application Title 1Y - Youth Chapter 5.
CALENDAR.
Year 6 mental test 10 second questions
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Chapter 7 Sampling and Sampling Distributions
The 5S numbers game..
Solve Multi-step Equations
A Fractional Order (Proportional and Derivative) Motion Controller Design for A Class of Second-order Systems Center for Self-Organizing Intelligent.
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
The basics for simulations
Factoring Quadratics — ax² + bx + c Topic
EE, NCKU Tien-Hao Chang (Darby Chang)
On Comparing Classifiers : Pitfalls to Avoid and Recommended Approach
How To Use Google Forms to Create A Test Quick Easy Self-Graded!! Instant Reports.
ABC Technology Project
1 Generating Network Topologies That Obey Power LawsPalmer/Steffan Carnegie Mellon Generating Network Topologies That Obey Power Laws Christopher R. Palmer.
MM4A6c: Apply the law of sines and the law of cosines.
TCCI Barometer March “Establishing a reliable tool for monitoring the financial, business and social activity in the Prefecture of Thessaloniki”
Factors, Prime Numbers & Composite Numbers
Chapter 4 Inference About Process Quality
Statistics Review – Part I
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Measurements and Their Uncertainty 3.1
Progressive Aerobic Cardiovascular Endurance Run
Adding Up In Chunks.
Chapter 10 Software Testing
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
2011 WINNISQUAM COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=1021.
Before Between After.
2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.
ST/PRM3-EU | | © Robert Bosch GmbH reserves all rights even in the event of industrial property rights. We reserve all rights of disposal such as copying.
Addition 1’s to 20.
25 seconds left…...
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Detecting Spam Zombies by Monitoring Outgoing Messages Zhenhai Duan Department of Computer Science Florida State University.
Controlling as a Management Function
Week 1.
DTU Informatics Introduction to Medical Image Analysis Rasmus R. Paulsen DTU Informatics TexPoint fonts.
We will resume in: 25 Minutes.
Static Equilibrium; Elasticity and Fracture
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Chapter 8 Estimation Understandable Statistics Ninth Edition
Resistência dos Materiais, 5ª ed.
Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.
Testing Hypotheses About Proportions
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
9. Two Functions of Two Random Variables
Introduction into Simulation Basic Simulation Modeling.
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
Introduction Embedded Universal Tools and Online Features 2.
Schutzvermerk nach DIN 34 beachten 05/04/15 Seite 1 Training EPAM and CANopen Basic Solution: Password * * Level 1 Level 2 * Level 3 Password2 IP-Adr.
Presentation transcript:

High-Throughput Crowdsourcing Mechanisms for Complex Tasks Guido Sautter, Klemens Böhm ViBRANT Virtual Biodiversity

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 2 Crowdsourcing – What is this? Means to distribute work requiring human input …... to a large user community, often via the Internet

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 3 Crowdsourcing - Examples Real-world examples: –ReCAPTCHA: Double-keying words from images –Distributed Proofreaders: Proofreading OCR results –Search for Ken Fosset: Satellite image processing –... Scientific studies: –OntoGame: Ontology construction –GalaxyZoo: Classification of galaxies –Word sense disambiguation & other NLP tasks –Image labeling & classification –...

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 4 Crowdsourcing - Discussion Advantages –Less expensive than hiring full-time personnel –Faster data processing due to large workforce Challenges: –Contributing users not trained... –... hard to supervise... –... and possible dishonest  Requires specific means to ensure result quality

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 5 Overview Crowdsourcing Formal Model State of the Art Increasing Throughput Evaluation Summary & Outlook

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 6 Decisions and Tasks Decision D: single parameter to obtain –Choosing appropriate option O from available options Opts(D) –Examples: Galaxy class, image label, transcript of word image  Variation in complexity Task T = (D 1,..., D d ): list of decisions –Unit of work assigned to users –Decisions can be independent or connected –Examples: Structuring page image into blocks

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 7 States of Decisions & Tasks State of decision D: Option selected for D at some point –Original state: Option selected for D when entering system (may be pre-selected by AI algorithm or empty) –Input state of user U: Option user U selected for D –Result state: Actual state for D when leaving system –Correct state: Correct result for D when leaving system (what experts would agree on) Correspondingly for task T = (D 1,..., D d ): –List of respective state of decisions D 1,..., D d Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 8 Errors Two types of errors: –Miss Errors: User makes or fails to correct an error (can always happen) –Add Errors: User falsifies correct decision (can only happen with algorithmic pre-decisions) Sources of errors: –Mistakes: Resulting from sloppiness or misjudgment (both miss and add errors) –Cheating: User does not bother to check thoughtfully, accepting everything as correct (generally miss errors) –Others (e.g. destructive malevolence): User falsifies on purpose (more theoretical, same properties as mistakes)

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 9 Overview Crowdsourcing Formal Model State of the Art Increasing Throughput Evaluation Summary & Outlook

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 10 SotA: r-Redundancy Principle: –Gather input from r users –Result: Most frequently selected option Against all sorts of errors Value of r mostly ballpark figure so far, around 5 to 15 Problem: sub-optimal throughput of tasks Studies so far mostly focused on user behavior Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 11 SotA: reCAPTCHA Principle: –Test user with decision C system knows correct result for –Consider actual input only if input for C correct Mainly against cheating Problem: deviates working time from decisions to process Problem: only works with small tasks Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 12 Overview Crowdsourcing Formal Model State of the Art Increasing Throughput Evaluation Summary & Outlook

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 13 v-Voting - Idea Observation: r-Redundancy gathers inputs after result is clear Idea for increasing throughput: –Stop gathering input as soon as result emerges Maintains expected result accuracy of r-Redundancy while increasing throughput Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 14 v-Voting - Mechanism Principle: Incremental majority vote –Gather input from users until v users agree –Result: The agreed-upon option Possible: weighting of votes depending on user but not investigated here Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 15 Vote Boosting - Idea Observation: not all users make errors with same probability Idea for increasing throughput: –Measure error rate of users in v-Voting –Increate weight of input from users who make few errors Circumvents error prevention of v-Voting but increases throughput significantly Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 16 Vote Boosting - Mechanism Principle: If user U is first to make input on some task T –Take his input for correct with some boost probability... –... depending on how many errors U made in the past –Otherwise, fall back to v-Voting mechanism Multiple definitions of boost probability possible Here: based on statistical test (next slide) Result? Original StateInput State of user U x Result State Original State Multiple different users U 1,...,U u Aggregation of input states into result state

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 17 Boost Probability Two parameters: –C: minimum probability of correct result –m: maximum probability of undeserved vote boost To compute / estimate: P('user U makes no error') Observable: number of correct inputs from user U since last error  H(U) Boostability Hypothesis: "P('user U makes no error')  C" Approach: significance of accepting boostability hypothesis for user U based on H(U) observed correct inputs  s Boost probability BP(U) := m/s

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 18 Overview Crowdsourcing Formal Model State of the Art Increasing Throughput Evaluation Summary & Outlook

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 19 Simulation Setup Users: 9 populations of 1,000 each, different combinations of –Mean probability of making mistakes (1%, 4%, 15%) –Mean probability of cheating (1%, 4%, 15%) Tasks: 9 lists of 1,000,000 each, different combinations of –Options per decision (2, 3, 4) –Mean probability of correct initial states (80%, 90%, 95%) 46 methods of combining user inputs into results –1-Redundancy (Base Case without any error prevention) –r-Redundancy with r=3,5,7 –v-Voting with v=2,3,4... –... each v with 14 parameter combinations for Vote Boosting

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 20 r-Redundancy vs. v-Voting Aggregated over all user populations and task lists  v-Voting requires fewer user inputs per task  v-Voting leaves fewer errors  Increase in both throughput and result accuracy Base Case 3-Red. 2- Voting 5-Red. 3- Voting 7-Red. 4- Voting Remaining Error (in %) Inputs per Task

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 21 Cost of 99.5% Result Accuracy Inputs per task (accuracy actually achieved) v controls v-Voting, m and C control Vote Boosting  Cost of data quality strongly dependent on users  Cheating prevention compulsory  Strategy: –Start with conservative parameters –Periodically assess result quality and adjust parameters Cheating 15%4%1% MistakesMean Prob. of 1% 3.78 (99.55%) v=3 m= 2% C=98% 1.78 (99.63%) v=2 m=4% C=96% 1.14 (99.51%) v=2 m=8% C=92% 4% 4.48 (99.51%) v=4 m=4% C=96% 1.93 (99.51%) v=2 m=4% C=96% 1.42 (99.57%) v=2 m=4% C=96% 15% not achieved 5.38 (98.62%) v=4 m=0 4.6 (99.61%) v=4 m=2% C=98% 3.94 (99.65%) v=4 m=2% C=98%

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 22 Overview Crowdsourcing Formal Model State of the Art Increasing Throughput Evaluation Summary & Outlook

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 23 Summary Formal model of crowdsourcing systems –Tasks –Errors –System layout Better input aggregation improves results and throughput –v-Voting –Vote Boosting Cheating prevention compulsory for data quality

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 24 Outlook Vote Boosting: alternative definitions of boost probability Cheating prevention mechanisms Verify simulation results in real-world deployment

Guido Sautter KIT High-Throughput Crowdsourcing Mechanisms for Complex Tasks 25 Questions? Guido Sautter, Klemens Böhm: High-Throughput Crowdsourcing Mechanisms for Complex Tasks ViBRANT Virtual Biodiversity