• Keine Ergebnisse gefunden

Nash Equilibria in Reactive Strategies

N/A
N/A
Protected

Academic year: 2022

Aktie "Nash Equilibria in Reactive Strategies"

Copied!
22
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Nash Equilibria in Reactive Strategies

Artem Baklanov

(2)

Young Scientists Summer Program (YSSP)

Annually from 1 June - 31 August in Laxenburg, Austria Deadline: 11 January 2017

(3)

Motivation

Why infinitely repeated games?

Why cooperation?

Why stability?

Why complexity?

(4)

Related works and inspiration

Arkady Kryazhimskiy (2014)

Equilibrium stochastic behaviours in repeated games, 2012.

Main scope: infinitely repeated

game of 2 players x N strategies.

Q: Existence of equilibrium for

(5)

How does a tiny change in complexity of strategies influence properties of

the Nash equilibrium?

Big Question

What would you guess?

(6)

Strategies and payoff function

Infinitely repeated 2x2 game.

Payoff defined as limit of averages.

Reactive strategies = stochastic strategies

defined only on the last opponents action.

(7)

Reactive strategies

1st player (rows)

✓ A

1

A

2

B

1

B

2

C

1

C

2

D

1

D

2

2nd player (columns)

p

1

q

1

=

P

(1st row | last opponent’s action = 1st column)

=

P

(1st row | last opponent’s action = 2nd column)

=

P

(1st column | last opponent’s action = 1st row)

=

P

(1st column | last opponent’s action = 2nd row)

p

2

q

2

(8)

Complexity of strategies

Increasing complexity in 2x2 repeated games

mixed strategies in [0,1]

reactive strategies in [0,1]x[0,1]

1-memory strategies in

[0,1]x[0,1]x[0,1]x[0,1]

(9)

Rigorously answered questions

Q1. What are all possible pairs of reactive strategies leading to an equilibrium?

Q2. What are all possible symmetric games

admitting equilibria? How common are these

games?

(10)

Partly answered questions

Q3. Are there new effects of interactions in equilibria caused by the increase of strategy complexity?

Q4. If we replace reactive strategies with 1- memory ones, then what properties of

equilibria are affected?

(11)

For fixed strategies we observe Markov chain with stationary distribution on 4 states of one-shot game

Payoff equivalence

✓ A1A2 B1B2 C1C2 D1D2

s

1

1 s1

1 s2

s

2

Ji = Ais1s2 + Bis1(1 s2) + Ci(1 s1)s2 + Di(1 s1)(1 s2) Payoffs are Identical to one-shot game with mixed strategies

s1 = q2(p1 q1) + q1

1 (p1 q1)(p2 q2) s2 = q1(p2 q2) + q2

1 (p1 q1)(p2 q2)

✓ A1A2 B1B2 C1C2 D1D2

s

1

1 s1

1 s2

s

2

1 2

3 4

(12)

Sets of strategies

➡No Tit For Tat

➡Noise proof

➡First round does not matter

0 < p i , q i < 1

(13)

Equilibria generated by SD

ai, bi, ci are defined by one-shot game 8>

><

>>

:

q1 = c2s1+bc2s2+2a2s1s2

2+a2s2 , p1 q1 = bc2+a2s1

2+a2s2 , q2 = b1s1+cc1s2+2a1s1s2

1+a1s1 , p2 q2 = bc1+a1s2

1+a1s1 , 0 a2(p1 q1), 0 a1(p2 q2),

0 < p1, q1, p2, q2 < 1.

(p1, q1) and (p2, q2)

is a Nash equilibrium

(s1, s2)

with the corresponding SD

if

Theorem

(14)

Examples: Prisoners Dilemma

1

s1 Any level of C is possible

1

s1 Red region - both payoffs are higher than mutual C

Blue region = Equilibrium

Stationary

(15)

No brain game

Game with Pareto efficient pure equilibria

s2 1

s1

Red region = players’

payoffs > 7

Blue region = {all ESD}

(16)

Discontinuous equilibrium regions

1 s1

1 u1

All symmetric Nash equilibria p1

(17)

Main properties

Existence of equilibrium in games without mixed Nash equilibrium.

Reactive Nash equilibria yield same or higher payoffs for both players than traditional mixed Nash.

Continuum of equilibria is typical.

(18)

Main properties

Existence of equilibrium in games with Pareto efficient dominant pure Nash (no brain games).

Non-symmetric equilibria in games with symmetric payoff matrix, symmetric ESD in games with non- symmetric payoff matrix.

(19)

Attainability sets and stationary distributions

All feasible stationary distribution for a fixed

opponent’s strategy

p1

q1 q2 p2 8s2, s1<

0.0 0.2 0.4 0.6 0.8 1.0 s2

0.2 0.4 0.6 0.8 1.0

s1

AS for 1 player = red line AS for 2 player = green line

(20)

Necessary and sufficient conditions

Mixed strategies

Mutual indifference

(21)

Comparison

Dutta,P.K. & Siconolfi,P. Presented work

For high discount factor there is a simple criterion for the existence of Nash equilibrium

(reverse dominance)

Even for symmetric games the corresponding criterion requires much more tedious calculations.

Reverse dominance is not necessary.

Simple lower and upper bounds for equilibrium payoffs

There exist equilibria leading to higher payoffs than the upper bound for 1-memory strategies

Chance to have an equilibrium equals to 1/3

Chance to have an equilibrium equals to 31/96 (1/96 less)

(22)

Comparison

Dutta,P.K. & Siconolfi,P. Presented work

Payoff relevant indeterminacy holds true

(continuum of distinct equilibrium payoffs)

There is no folk theorem

Referenzen

ÄHNLICHE DOKUMENTE

Custom made uniform attenuator provides 15 dB of Custom made uniform attenuator provides 15 dB of attenuation up to 8000 Hz.. It uses an element that attenuation up to

By setting the unconditional benchmark as the least symmetric equilibrium payoff profile in memory-less RSs, we demonstrate that for most classes of symmetric stage games,

A pure Nash equilibrium is a state in which each player is choosing such a best response..

Hence, applying improvement steps is not an efficient (i.e. polynomial time) algorithm for computing Nash equilibria in these games.. However, it is possible to find a global optimum

We study the Nash equilibrium in infinitely repeated bimatrix games where payoffs are determined by reactive strategies [1]; we consider limit of means payoffs.. Reactive strategies

The third reason to be mentioned here is the multifactorial dependency of preferences. Preferences can be the result of a combination of factors such as the topic of the

INEFFICIENCY OF NASH EQUILIBRIA BY. PRADEEP

Our main result is that in the Wardrop network routing setting of multicommodity flow and infinitesimal agents, the flows will approach equilibrium in the sense that a 1 − ε fraction