Winning concurrent reachability games requires doubly-exponential patience Michal Koucký IM AS CR, Prague Kristoffer Arnsfelt Hansen, Peter Bro Miltersen.

Slides:

Advertisements

Similar presentations

Tight Bounds for Distributed Functional Monitoring David Woodruff IBM Almaden Qin Zhang Aarhus University MADALGO.

Advertisements

Chapter 5: CPU Scheduling

Analysis of Computer Algorithms

Slide 1 Insert your own content. Slide 2 Insert your own content.

Optimal Algorithms for k-Search with Application in Option Pricing Julian Lorenz, Konstantinos Panagiotou, Angelika Steger Institute of Theoretical.

Routing Complexity of Faulty Networks Omer Angel Itai Benjamini Eran Ofek Udi Wieder The Weizmann Institute of Science.

1 On the Long-Run Behavior of Equation-Based Rate Control Milan Vojnović and Jean-Yves Le Boudec ACM SIGCOMM 2002, Pittsburgh, PA, August 19-23, 2002.

Milan Vojnović Microsoft Research Cambridge Collaborators: E. Perron and D. Vasudevan 1 Consensus – with Limited Processing and Signalling.

1 A Statistical Analysis of the Precision-Recall Graph Ralf Herbrich Microsoft Research UK Joint work with Hugo Zaragoza and Simon Hill.

IDSIA Lugano Switzerland Master Algorithms for Active Experts Problems based on Increasing Loss Values Jan Poland and Marcus Hutter Defensive Universal.

From Graph Models to Game Models Tom Henzinger EPFL.

Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.

Copyright © 2010 Pearson Education, Inc. Slide

DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION

MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.

How well do you KNOW your 2 times table ? Play the following game Are you ready ? X MULTIPLYTIMES PRODUCT.

Design & Analysis of Algorithms COMP 482 / ELEC 420 John Greiner.

Washington WASHINGTON UNIVERSITY IN ST LOUIS Concurrency: Deadlock Detection Fred Kuhns Department.

THE PRICE OF STOCHASTIC ANARCHY Christine ChungUniversity of Pittsburgh Katrina LigettCarnegie Mellon University Kirk PruhsUniversity of Pittsburgh Aaron.

Are There Any Outliers? Using the 1.5*IQR Rule Say we have the following data: 1,2,5,5,7,8,10,11,11,12,15,20 Notice that you must have ordered data before.

LT Codes Paper by Michael Luby FOCS ‘02 Presented by Ashish Sabharwal Feb 26, 2003 CSE 590vg.

Randomized Distributed Decision Pierre Fraigniaud, Amos Korman, Merav Parter and David Peleg Yes No Yes No DISC 2012.

Are lower bounds hard to prove? Michal Koucký Institute of Mathematics, Prague.

Constant, Linear and Non-Linear Constant, Linear and Non-Linear

CSE 4101/5101 Prof. Andy Mirzaian. Lists Move-to-Front Search Trees Binary Search Trees Multi-Way Search Trees B-trees Splay Trees Trees Red-Black.

Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN

Scalable and Dynamic Quorum Systems Moni Naor & Udi Wieder The Weizmann Institute of Science.

Reaching Agreements II. 2 What utility does a deal give an agent? Given encounter  T 1,T 2  in task domain  T,{1,2},c  We define the utility of a.

3 rd NW MIDPOINT REVIEW POWER POINT PROJECT 6.16, 6.5, 6.17, 6.3, 6.20, 6.11, 6.8, 6.10, 6.18 Mr. Churchya and Mrs. Howard.

Addition 1’s to 20.

25 seconds left…...

Dantzig-Wolfe Decomposition

Distributed Algorithms – 2g1513

Distributed Computing 9. Sorting - a lower bound on bit complexity Shmuel Zaks ©

Chapter 16 Part 4 CONTINUOUS RANDOM VARIABLES. When two independent continuous random variables are Normally distributed, so is their sum or difference.

The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.

Bart Jansen 1.  Problem definition  Instance: Connected graph G, positive integer k  Question: Is there a spanning tree for G with at least k leaves?

Secret Sharing, Matroids, and Non-Shannon Information Inequalities.

The Communication Complexity of Approximate Set Packing and Covering

MS 101: Algorithms Instructor Neelima Gupta

Name: Trương Hoài Anh Facebook: Quasar Hoaianh

C&O 355 Mathematical Programming Fall 2010 Lecture 12 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.

Fast Convergence of Selfish Re-Routing Eyal Even-Dar, Tel-Aviv University Yishay Mansour, Tel-Aviv University.

Energy and Mean-Payoff Parity Markov Decision Processes Laurent Doyen LSV, ENS Cachan & CNRS Krishnendu Chatterjee IST Austria MFCS 2011.

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

1 Learning with continuous experts using Drifting Games work with Robert E. Schapire Princeton University work with Robert E. Schapire Princeton University.

Krishnendu Chatterjee1 Graph Games with Reachabillity Objectives: Mixing Chess, Soccer and Poker Krishnendu Chatterjee 5 th Workshop on Reachability Problems,

Krishnendu Chatterjee1 Partial-information Games with Reachability Objectives Krishnendu Chatterjee Formal Methods for Robotics and Automation July 15,

Randomness for Free Laurent Doyen LSV, ENS Cachan & CNRS joint work with Krishnendu Chatterjee, Hugo Gimbert, Tom Henzinger.

Concurrent Reachability Games Peter Bro Miltersen Aarhus University 1CTW 2009.

Online Vertex-Coloring Games in Random Graphs Revisited Reto Spöhel (joint work with Torsten Mütze and Thomas Rast; appeared at SODA ’11)

Some Limits on Non-Local Randomness Expansion Matt Coudron and Henry Yuen /12/12 God does not play dice. --Albert Einstein Einstein, stop telling.

Stochastic Games Games played on graphs with stochastic transitions Markov decision processes Games against nature Turn-based games Games against adversary.

Rényi-Ulam liar games with a fixed number of lies Robert B. Ellis Illinois Institute of Technology University of Illinois at Chicago, October 26, 2005.

Online Paging Algorithm By: Puneet C. Jain Bhaskar C. Chawda Yashu Gupta Supervisor: Dr. Naveen Garg, Dr. Kavitha Telikepalli.

Mini-course on algorithmic aspects of stochastic games and related models Marcin Jurdzinski (University of Warwick) Peter Bro Miltersen (Aarhus University)

Decision Theory: Outcomes and Consequences Not Considered Decision Outcomes Consequences Decision: choose between betting on green or red chip. Objective:

Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.

List Decoding Using the XOR Lemma Luca Trevisan U.C. Berkeley.

Jane wins $21 if a die roll shows a six, and she loses $2 otherwise

MiniMax Principle in Game Theory Slides Made by Senjuti Basu Roy.

Krishnendu ChatterjeeFormal Methods Class1 MARKOV CHAINS.

Kristoffer Arnsfelt Hansen Rasmus Ibsen-Jensen Peter Bro Miltersen

The Duality Theorem Primal P: Maximize

Game Theory Just last week:

Lecture 20 Linear Program Duality

Presentation transcript:

Winning concurrent reachability games requires doubly-exponential patience Michal Koucký IM AS CR, Prague Kristoffer Arnsfelt Hansen, Peter Bro Miltersen Aarhus U., Denmark

2 Example Player 1 chooses A  {t,h} Player 1 chooses A  {t,h} Player 2 chooses B  {t,h} Player 2 chooses B  {t,h}If A = B then move one level up, A = B then move one level up, A  B = t then move to 1 st level, A  B = t then move to 1 st level, A  B = h then Player 1 loses. A  B = h then Player 1 loses. Entrance fee: $15 Win: $20 W

3 Entrance fee: $15 Win: $20 Observation: To break even, you need at least ¾ probability to win. Good news: you can win with probability arbitrary close to 1. Bad news: the expected time to win the game with probability at least ¾ is years (one move per day). … the age of universe: years

4 Concurrent reachability games [de Alfaro, Henzinger, Kupferman ’98, Everett ’57] Two players play on a graph of states. At each step they simultaneously (independently) pick one of possible actions each and based on a transition table move to the next state. … … … …

5 Goals:Player 1 wants to reach a specific state or states. Player 2 wants to prevent Player 1 from reaching these states. Strategy of a player: Memory-less (non-adaptive) – π : states  actions. Memory-less (non-adaptive) – π : states  actions. Adaptive – π : history  actions. Adaptive – π : history  actions. Probabilistic strategy: π gives a probability distribution of possible actions.  Patience of a memory-less strategy π = 1/min non-zero prob. in π … [Everett ’57]

6 Winning starting states: Sure – Player 1 has a winning strategy that never fails. Sure – Player 1 has a winning strategy that never fails. Almost-Sure – Player 1 has a randomized strategy that reaches goal with probability 1. Almost-Sure – Player 1 has a randomized strategy that reaches goal with probability 1. Limit-Sure – For every  > 0, Player 1 has a strategy that reaches goal with probability at least 1 – . Limit-Sure – For every  > 0, Player 1 has a strategy that reaches goal with probability at least 1 – .

7 Purgatory n Player 1 chooses A  {t,h} Player 1 chooses A  {t,h} Player 2 chooses B  {t,h} Player 2 chooses B  {t,h}If A = B then move one level up, A = B then move one level up, A  B = t then move to 1 st level, A  B = t then move to 1 st level, A  B = h then move to state H. A  B = h then move to state H. P n n … H

8 Our results Thm:1) For every 0 1/  2 n-2. 2) For every l 2 2 n-l-2. Thm:For every 0 61 actions in total, both players have  -optimal strategies with patience 61 actions in total, both players have  -optimal strategies with patience < 1/  2 42m.

9 Thm:1) For every 0 t then the expected time to win the game by any  ’-optimal strategy of Player 1 can be forced to be Ω( t ).  patience ~ expected time to win  patience ~ expected time to win All the results essentially hold also for adaptive strategies All the results essentially hold also for adaptive strategies Recall: the expected time to win Purgatory 7 with probability at least ¾ is years (one move per day).

10 Algorithmic consequences Three algorithmic questions: 1. What are *-SURE states?  PTIME [dAHK] 2. What are the winning probabilities of different states?  PSPACE [EY] 3. What is the (  -)optimal strategy?  EXP-EXP-TIME upper-bound [CdAH,…]  EXP-SPACE lower-bound [our results] Cor: Any algorithm that manipulates winning strategies in explicit representation must use exponential space. … explicit representation: integer fractions

11 Purgatory n p i – probability of playing t in state i in  -optimal strategy of Player 1. p i – probability of playing t in state i in  -optimal strategy of Player 1. Claim: 1) 0< p i < 1, for all i. 2) p i < , for all i. 3) p 1 ≤ p 2. p 3 … p n 4) p i ≤ p i+1. p i+2 … p n P n n … 1\2th tlevel+1loss hlevel=1level+1 p n p n-1 p3p3p2p2p1p1p3p3p2p2p1p1 Player 2 plays h Player 2 plays t  Player 2 plays h t t t t t

12 Open problems Generic algorithm for  - optimal strategy with symbolic representation? Generic algorithm for  - optimal strategy with symbolic representation? How to redefine the game to be more realistic? How to redefine the game to be more realistic?

13 Goals:Player 1 wants to reach a specific state or states. Player 2 wants to prevent Player 1 from reaching these states. Winning starting states: Sure – Player 1 has a winning strategy that never fails. Sure – Player 1 has a winning strategy that never fails. Almost-Sure – Player 1 has a randomized strategy that reaches goal with probability 1. Almost-Sure – Player 1 has a randomized strategy that reaches goal with probability 1. Limit-Sure – For every  > 0, Player 1 has a strategy that reaches goal with probability at least 1 – . Limit-Sure – For every  > 0, Player 1 has a strategy that reaches goal with probability at least 1 – .