A Study of Computational and Human Strategies in Revelation Games 1 Noam Peled, 2 Kobi Gal, 1 Sarit Kraus 1 Bar-Ilan university, Israel. 2 Ben-Gurion university,

Slides:



Advertisements
Similar presentations
Intelligent Tutoring System based on Belief networks Maomi Ueno Nagaoka University of Technology.
Advertisements

Empirical Studies in Computer-Mediated Interest-Based Negotiations Sohan Dsouza MSc IT, 2009 British University in Dubai.
Galit Haim, Ya'akov Gal, Sarit Kraus and Michele J. Gelfand A Cultural Sensitive Agent for Human-Computer Negotiation 1.
Distributed Advice-Seeking on an Evolving Social Network Dept Computer Science and Software Engineering The University of Melbourne - Australia Golriz.
Ultimatum Game Two players bargain (anonymously) to divide a fixed amount between them. P1 (proposer) offers a division of the “pie” P2 (responder) decides.
NEURAL NETWORKS Backpropagation Algorithm
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
1
Lecturer: Moni Naor Algorithmic Game Theory Uri Feige Robi Krauthgamer Moni Naor Lecture 8: Regret Minimization.
Games With No Pure Strategy Nash Equilibrium Player 2 Player
Game Theory and Computer Networks: a useful combination? Christos Samaras, COMNET Group, DUTH.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Preference Elicitation Partial-revelation VCG mechanism for Combinatorial Auctions and Eliciting Non-price Preferences in Combinatorial Auctions.
Short introduction to game theory 1. 2  Decision Theory = Probability theory + Utility Theory (deals with chance) (deals with outcomes)  Fundamental.
1 * Culture sensitive agent – takes the culture of the other agent into consideration when making decisions.
Adverse Selection Asymmetric information is feature of many markets
Active Learning and Collaborative Filtering
Minority Games A Complex Systems Project. Going to a concert… But which night to pick? Friday or Saturday? You want to go on the night with the least.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/21 3/7.
Approximate and online multi-issue negotiation S.S. Fatima Loughborough University, UK M. Wooldridge N.R. Jennings University of.
Lecture 1 - Introduction 1.  Introduction to Game Theory  Basic Game Theory Examples  Strategic Games  More Game Theory Examples  Equilibrium  Mixed.
BEE3049 Behaviour, Decisions and Markets Miguel A. Fonseca.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
Decision Making as Constrained Optimization Specification of Objective Function  Decision Rule Identification of Constraints.
DANSS Colloquium By Prof. Danny Dolev Presented by Rica Gonen
Value of Information for Complex Economic Models Jeremy Oakley Department of Probability and Statistics, University of Sheffield. Paper available from.
Towards Automated Bargaining in Electronic Markets: a Partially Two-Sided Competition Model N. Gatti, A. Lazaric, M. Restelli {ngatti, lazaric,
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/19 3/5.
Alternating-Offers Bargaining under One-Sided Uncertainty on Deadlines Francesco Di Giunta and Nicola Gatti Dipartimento di Elettronica e Informazione.
Sarit Kraus Bar-Ilan University
Sarit Kraus Bar-Ilan University
1
Efficient Model Selection for Support Vector Machines
1 General Opponent* Modeling for Improving Agent-Human Interaction Sarit Kraus Dept. of Computer Science Bar Ilan University AMEC May 2010.
NOBEL WP Szept Stockholm Game Theory in Inter-domain Routing LÓJA Krisztina - SZIGETI János - CINKLER Tibor BME TMIT Budapest,
Chapter 12 Choices Involving Strategy Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written.
Coalition Formation between Self-Interested Heterogeneous Actors Arlette van Wissen Bart Kamphorst Virginia DignumKobi Gal.
ECO290E: Game Theory Lecture 12 Static Games of Incomplete Information.
林偉楷 Taiwan Evolutionary Intelligence Laboratory.
Senior Project – Computer Science – 2015 Modelling Opponents in Board Games Julian Jocque Advisor – Prof. Rieffel Abstract Modelling opponents in a game.
Team Formation between Heterogeneous Actors Arlette van Wissen Virginia Dignum Kobi Gal Bart Kamphorst.
Cooperative Meeting Scheduling among Agents based on Multiple Negotiations Toramatsu SHINTANI and Takayuki ITO Department of Intelligence and Computer.
Bargaining Whoever offers to another a bargain of any kind, proposes to do this. Give me that which I want, and you shall have this which you want …; and.
Artificial Intelligence Methods Neural Networks Lecture 4 Rakesh K. Bissoondeeal Rakesh K. Bissoondeeal.
Partially Supervised Classification of Text Documents by Bing Liu, Philip Yu, and Xiaoli Li Presented by: Rick Knowles 7 April 2005.
1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.
Experimental Economics NSF short course David Laibson August 11, 2005.
Regret Minimizing Equilibria of Games with Strict Type Uncertainty Stony Brook Conference on Game Theory Nathanaël Hyafil and Craig Boutilier Department.
Experimental evidence of the emergence of aesthetic rules in pure coordination games Federica Alberti (Uea) Creed/Cedex/Uea Meeting Experimental Economics.
Monte-Carlo methods for Computation and Optimization Spring 2015 Based on “N-Grams and the Last-Good-Reply Policy Applied in General Game Playing” (Mandy.
Modeling Reasoning in Strategic Situations Avi Pfeffer MURI Review Monday, December 17 th, 2007.
Lecture 5 Financial Incentives This lecture is paired with our previous one that discussed employee benefits. Here we focus on the reasons why monetary.
GAME PLAYING 1. There were two reasons that games appeared to be a good domain in which to explore machine intelligence: 1.They provide a structured task.
1 - CS7701 – Fall 2004 Review of: Detecting Network Intrusions via Sampling: A Game Theoretic Approach Paper by: – Murali Kodialam (Bell Labs) – T.V. Lakshman.
Modeling Agents’ Reasoning in Strategic Situations Avi Pfeffer Sevan Ficici Kobi Gal.
Game tree search Chapter 6 (6.1 to 6.3 and 6.6) cover games. 6.6 covers state of the art game players in particular. 6.5 covers games that involve uncertainty.
Machine Learning 5. Parametric Methods.
Will Britt and Bryan Silinski
UNSW | BUSINESS SCHOOL | SCHOOL OF ECONOMICS Calling the shots Experimental evidence on significant aversion to non-existing strategic risk Ben Greiner.
Generalized Point Based Value Iteration for Interactive POMDPs Prashant Doshi Dept. of Computer Science and AI Institute University of Georgia
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Q 2.1 Nash Equilibrium Ben
Management support systems II
Human-Computer Negotiation in Three-Player Market Settings
Erasmus University Rotterdam
MSDM AAMAS-09 Two Level Recursive Reasoning by Humans Playing Sequential Fixed-Sum Games Authors: Adam Goodie, Prashant Doshi, Diana Young Depts.
Extensive-form games and how to solve them
When Security Games Go Green
Alternative-offer bargainging
Presentation transcript:

A Study of Computational and Human Strategies in Revelation Games 1 Noam Peled, 2 Kobi Gal, 1 Sarit Kraus 1 Bar-Ilan university, Israel. 2 Ben-Gurion university, Israel. 1

Outline ● What is this research about? ● An agent that negotiates with people in incomplete information settings. ● People can choose to reveal private information. ● Why is it difficult? ● Uncertainty over people’s preferences. ● People are affected by social/psychological issues. 2

Our approach ● Opponent modeling + machine learning. ● Results ● Agent outperformed people. ● Learned to exploit how people reveal information. ● People increased their performance when playing with agent (as compared to other people). 3

Why is it interesting? 4

Revelation games ● Incomplete information over preferences: ● Players’ types are unknown. ● Players can reaveal their type before negotiating for a finite number of rounds. ● Revelation is costless. ● No discount factor. 5

Our setting ● This is the minimal setting where each of the two players can make one proposal: 6 Revelation phase 1 st proposal Counter proposal

Roadmap ● Probabilistic model of people. ● The model explicity represent social features. ● We built an agent that uses this model for negotiation with people. ● Extensive empirical evaluation ● over 400 subjects from different countries! 7

Colored Trails ● Open source empirical test-bed for investigating decision making. ● Family of computer board games that involve negotiation over resources. ● Easy to design new games. ● Built in functionality for conducting experiments with people. ● Over 30 publications. 8

Revelation game in CT 9

Shortest path to the goal 10

Objective ● The objective is to maximize your score: ● Try to get as close as possible to the goal. ● Using as few chips as possible. ● End up with as many chips as you can. 11

If “me” chooses to reveal… 12

Example proposal of ‘me’ player ● Here, the ‘me’ player is signaling about it’s true goal location. 13

The other participant decides whether to accept the proposal If it accepts the proposal, the game will end If it rejects, it will be able to make a counter proposal 14

The end of the game The ‘me’ player was moved to its goal using one gray and one red chips The other participant was moved to his goal using 2 cyan chips 15

Agent design ● SIGAL: SIGmoid Acceptance Learning. ● Models people’s behavior using probability distributions: ● Making/accepting offers. ● Whether people reveal their goals. ● Maximizes its expected benefit given the model using backward induction. 16

SIGAL strategy: Round 2 ● As a responder: accepts any beneficial proposal in terms of score in the game. ● As a proposer: ● Calculates its expected benefit from each proposal: ● Choose offer that maximizes expected benefit. 17 Revelation phase 1 st proposal Counter proposal

SIGAL strategy: Round 1 ● As responder: accept any proposal that gives it more than the expected benefit from round 2. ● As a proposer: ● Estimate its benefit from the other player counter proposal. ● Calculates the expected benefit from each proposal. ● Choose the offer that maximizes expected benefit. 18 Revelation phase 1 st proposal Counter proposal

Maximum expected benefit 19

Revelation phase ● Use decision theory: SIGAL Calculates its expected benefit for both scenarios – revealing or not revealing. ● Picks the best option. 20 Revelation phase 1 st proposal Counter proposal

Modeling acceptance of offers ● Use logistic function (Camerer 2001) Acceptance probability Social utility function 21

Social utility function ● People are not fully rational – a proposal is not desirable only for its benefit. ● Weighted sum of social features. 22

Benefit feature ● Benefit = proposed offer score – no agreement score ● As example, the score of the ‘me’ player from the demo game was 135. ● Without reaching and agreement, its score is 30 ● Its benefit from the proposal is =105 23

Other social features ● Difference in benefit = proposer’s benefit from the offer – responder’s benefit from the offer ● Revelation decision. ● Previous round: ● The first proposal, if rejected, may affect the probability to accept the counter-proposal. 24

Learning ● We used a genetic algorithm to find the optimal weights for people’s social utility function. ● Use density estimation to learn how people make offers ● Cross validation (10-fold). ● Over-fitting removal: Stop learning in the minimum of the generalization error. ● Error calculation on held out test set. 25

Fit to data ● The percentages are averages over similar utility ranges (bins) 26

What did SIGAL learn? ● Which proposal to give in each round. ● Whether to accept proposals or not in each round. ● Whether to reveal its goal or not. 27

Empirical methodology ● Israeli CS students and users over the web. ● Subjects received an identical tutorial on revelation, and had to pass a test. ● Each participant played two different boards ● Compared SIGAL performance with people’s performance playing other people. 28

Performance 29

Why did SIGAL succeed? ● Learned to ask for more from people when they reveal their goal. ● Learned to make ‘fair’ proposals: ● People dislike proposals with high benefits difference in favor to the proposer. ● Learned to exploit generous people: ● If people propose a generous offer in the first round, they are more willing to accept the counter offer. 30

People also benefit from SIGAL! ● People playing with SIGAL got much more than people playing with people! 31

Web users: Amazon Turk ● Web based bulletin board for ‘Human Intelligence Tasks‘. ● Millions of ‘workers’ are exposed to your task. ● We got 140 ‘qualified’ participants in 8 hours! 32

Baseline equilibrium agent ● There are lots of equilibria in the game. ● We’ve developed an agent based on a pure equilibrium strategy. 33

Equilibrium agent didn't work 34

Related work ● Did not take a decision theoretic approach. ● Repeated negotiation (Oshrat et al. 2009). ● Bayesian techniques (Hindriks et al. 2008). ● Approximation heuristics (Jonker et al. 2007). ● Did not evaluate a computer agent ● Repeated one-shot take-it-or-leave-it games (Gal et al. 2007). 35

Conclusions ● Revelation games are a new setting to study how people reveal information in a negotiation. ● Using a simple model, an agent can learn to outperform people in revelation games. ● Behavioral studies can actually help agent design. ● Combining decision theory with learning is a good approach for agent-design. 36

Future work ● Extend the argumentation framework. ● More signaling and revelation possibilities. ● Develop a model which predict in which extant the private information should be revealed during the game. ● EEG: Does features in brain waves can improve the prediction model? 37