• Keine Ergebnisse gefunden

Topics in Algorithmic Game Theory and Economics

N/A
N/A
Protected

Academic year: 2021

Aktie "Topics in Algorithmic Game Theory and Economics"

Copied!
214
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Topics in Algorithmic Game Theory and Economics

Pieter Kleer

Max Planck Institute for Informatics Saarland Informatics Campus

December 16, 2020

Lecture 6

Finite games III - Computation of CE and CCE

(2)

Hierarchy of equilibrium concepts

Finite (cost minimization) gameΓ = (N,(Si)i∈N,(Ci)i∈N)consists of:

Finite setN ofplayers.

Finitestrategy setSi for every playeri ∈N. Cost functionCijSj →Rfor everyi∈N.

PNE

Exists in any congestion game MNE

Exists in any finite game, but hard to compute CE

Computationally tractable CCE

(3)

Hierarchy of equilibrium concepts

Finite (cost minimization) gameΓ = (N,(Si)i∈N,(Ci)i∈N)consists of:

Finite setN ofplayers.

Finitestrategy setSi for every playeri ∈N. Cost functionCijSj →Rfor everyi∈N.

PNE

Exists in any congestion game MNE

Exists in any finite game, but hard to compute CE

Computationally tractable CCE

(4)

Hierarchy of equilibrium concepts

Finite (cost minimization) gameΓ = (N,(Si)i∈N,(Ci)i∈N)consists of:

Finite setN ofplayers.

Finitestrategy setSifor every playeri ∈N.

Cost functionCijSj →Rfor everyi∈N.

PNE

Exists in any congestion game MNE

Exists in any finite game, but hard to compute CE

Computationally tractable CCE

(5)

Hierarchy of equilibrium concepts

Finite (cost minimization) gameΓ = (N,(Si)i∈N,(Ci)i∈N)consists of:

Finite setN ofplayers.

Finitestrategy setSifor every playeri ∈N.

Cost functionCijSj →Rfor everyi∈N.

PNE

Exists in any congestion game MNE

Exists in any finite game, but hard to compute CE

Computationally tractable CCE

(6)

Hierarchy of equilibrium concepts

Finite (cost minimization) gameΓ = (N,(Si)i∈N,(Ci)i∈N)consists of:

Finite setN ofplayers.

Finitestrategy setSifor every playeri ∈N.

Cost functionCijSj →Rfor everyi∈N.

MNE CE

Computationally tractable CCE

(7)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles: σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n.

Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is xTAy =E(ak,b`)∼σx,y[CA(ak,b`)] = X

(ak,b`)∈SA×SB

σk`CA(ak,b`)

Remember thatAk`=CA(ak,b`).

(8)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles:

σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n. Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is xTAy =E(ak,b`)∼σx,y[CA(ak,b`)] = X

(ak,b`)∈SA×SB

σk`CA(ak,b`)

Remember thatAk`=CA(ak,b`).

(9)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles:

σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n.

Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is xTAy =E(ak,b`)∼σx,y[CA(ak,b`)] = X

(ak,b`)∈SA×SB

σk`CA(ak,b`)

Remember thatAk`=CA(ak,b`).

(10)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles:

σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n.

Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is xTAy =E(ak,b`)∼σx,y[CA(ak,b`)] = X

(ak,b`)∈SA×SB

σk`CA(ak,b`)

Remember thatAk`=CA(ak,b`).

(11)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles:

σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n.

Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is

xTAy =E(ak,b`)∼σx,y[CA(ak,b`)] = X

(ak,b`)∈SA×SB

σk`CA(ak,b`)

Remember thatAk`=CA(ak,b`).

(12)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles:

σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n.

Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is

Remember thatAk`=CA(ak,b`).

(13)

Two-player games with mixed strategies (recap)

Two-player game(A,B)given by matricesA,B ∈Rm×n.

Row player Alice and column player Bobindependentlychoose strategyx ∈∆Aandy ∈∆B.

Givesproduct distributionσx,y :SA× SB →[0,1]over strategy profiles:

σx,y(ak,b`) =σk` =xky` fork =1, . . . ,mand`=1, . . . ,n.

Example

Distribution over strategy profiles is given by x1y1 x1y2 x1y3 x2y1 x2y2 x2y3

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Thenexpected cost(for Alice)CAx,y)is xTAy =E(ak,b`)∼σx,y[CA(ak,b`)] = X

σk`CA(ak,b`)

(14)

Beyond mixed strategies

(15)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles = (sA,sB)∈ SA× SB. Givesindicator distributionσoverSA× SBwith

σ(t) =

1 t =s 0 t 6=s . MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B.

Givesproduct distributionσoverSA× SB, where σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(16)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles= (sA,sB)∈ SA× SB.

Givesindicator distributionσoverSA× SBwith σ(t) =

1 t =s 0 t 6=s . MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B.

Givesproduct distributionσoverSA× SB, where σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(17)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles= (sA,sB)∈ SA× SB. Givesindicator distributionσoverSA× SBwith

σ(t) =

1 t =s 0 t 6=s .

MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B. Givesproduct distributionσoverSA× SB, where

σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(18)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles= (sA,sB)∈ SA× SB. Givesindicator distributionσoverSA× SBwith

σ(t) =

1 t =s 0 t 6=s . MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B.

Givesproduct distributionσoverSA× SB, where σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(19)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles= (sA,sB)∈ SA× SB. Givesindicator distributionσoverSA× SBwith

σ(t) =

1 t =s 0 t 6=s . MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B.

Givesproduct distributionσoverSA× SB, where σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(20)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles= (sA,sB)∈ SA× SB. Givesindicator distributionσoverSA× SBwith

σ(t) =

1 t =s 0 t 6=s . MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B.

Givesproduct distributionσoverSA× SB, where σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(21)

Equilibrium concepts as distributions over S

A

× S

B

We have seen the following equilibrium concepts:

PNE: Strategy profiles= (sA,sB)∈ SA× SB. Givesindicator distributionσoverSA× SBwith

σ(t) =

1 t =s 0 t 6=s . MNE: Mixed strategies(x,y)withx ∈∆A,y ∈∆B.

Givesproduct distributionσoverSA× SB, where σ(ak,b`) =σk`=xky`.

(C)CE: (Coarse) correlated equilibrium will be defined as general distributionσ overSA× SB.

I.e., not induced by specific player actions.

(22)

Game of Chicken

Game of Chicken

Alice and Bob both approach an intersection.

Bob

Stop Go

Alice Stop (0,0) (3,−1) Go (−1,3) (4,4)

Two PNEs: (Stop, Go), (Go, Stop).

One MNE: Both players randomize over Stop and Go. Distributions over strategy profiles(a,b)for these equilibria are

0 1 0 0

,

0 0 1 0

and 1

4 1 1 4 4

1 4

.

(23)

Game of Chicken

Game of Chicken

Alice and Bob both approach an intersection.

Bob

Stop Go

Alice Stop (0,0) (3,−1) Go (−1,3) (4,4) Two PNEs: (Stop, Go), (Go, Stop).

One MNE: Both players randomize over Stop and Go. Distributions over strategy profiles(a,b)for these equilibria are

0 1 0 0

,

0 0 1 0

and 1

4 1 1 4 4

1 4

.

(24)

Game of Chicken

Game of Chicken

Alice and Bob both approach an intersection.

Bob

Stop Go

Alice Stop (0,0) (3,−1) Go (−1,3) (4,4) Two PNEs: (Stop, Go), (Go, Stop).

One MNE: Both players randomize over Stop and Go.

Distributions over strategy profiles(a,b)for these equilibria are 0 1

0 0

,

0 0 1 0

and 1

4 1 1 4 4

1 4

.

(25)

Game of Chicken

Game of Chicken

Alice and Bob both approach an intersection.

Bob

Stop Go

Alice Stop (0,0) (3,−1) Go (−1,3) (4,4) Two PNEs: (Stop, Go), (Go, Stop).

One MNE: Both players randomize over Stop and Go.

Distributions over strategy profiles(a,b)for these equilibria are 0 1

0 0

,

0 0 1 0

and 1

4 1 1 4 4

1 4

.

(26)

Game of Chicken

Game of Chicken

Alice and Bob both approach an intersection.

Bob

Stop Go

Alice Stop (0,0) (3,−1) Go (−1,3) (4,4) Two PNEs: (Stop, Go), (Go, Stop).

One MNE: Both players randomize over Stop and Go.

Distributions over strategy profiles(a,b)for these equilibria are

(27)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party). Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players. Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(28)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party). Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players. Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(29)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party). Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players. Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(30)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party).

Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players. Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(31)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party).

Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players. Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(32)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party).

Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players.

Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(33)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party).

Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players.

Tells Alice to go and Bob to stop (or vice versa)

Conditioned on this recommendation, the best thing for a player to do is to follow it.

(34)

Sensible ‘equilibrium’ would be the strategy profile distribution

σ =

0 12

1 2 0

.

Cannot be achieved as mixed equilibrium.

There arenox A,y Bsuch thatσk`=xky`for all k, `∈ {1,2}.

Idea is to introducetraffic light(mediator or trusted third party).

Traffic light samples/draws one of the two strategy profiles from distribution.

Gives realization as recommendation to the players.

Tells Alice to go and Bob to stop (or vice versa)

(35)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows. Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob. Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(36)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ. σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob. Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(37)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob. Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(38)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob. Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(39)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation! Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(40)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(41)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn. Player assumes all other players play private recommendation, i.e., Alice

assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(42)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(43)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation,

i.e., Alice assumes Bob follows his recommendation (and vice versa). In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(44)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa).

In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(45)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa).

In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE). Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(46)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa).

In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse

Therefore, for (C)CE, it’s not always necessary that all players know the distributionσ up front, nor that there is an actual third party that samples from it.

(47)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa).

In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse correlated equilibrium (similar algorithms exist converging to CE).

Therefore, for (C)CE, it’s not always necessary that all players know

nor that there is an actual third party that samples from it.

(48)

Correlated equilibrium (CE), informal

Correlated equilibriumσ :SA× SB →[0,1]can be seen as follows.

Mediator (third party) draws samplex = (xA,xB)∼σ.

σis known to Alice and Bob, but notx.

Givesprivate recommendationxAto Alice, andxB to Bob.

Alice and Bob do not know each other’s recommendation!

Game of Chicken is the exception to the rule.

Recommendations give players some info on whichx was drawn.

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa).

In CE, no player has incentive to deviate given its recommendation.

Remark

We will later seeno-regretalgorithms whose output is a coarse

(49)

Example

Distribution over strategy profiles is given by

σ =

σ11 σ12 σ13 σ21 σ22 σ23

=

0 1/8 1/8

2/8 1/8 3/8

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Suppose Alice getssecond rowa2as recommendation.

This gives Alice a (conditional) probability distributionρfor column privately recommended to Bob:

Columnb1with probability

2 8 2

8+18+38 = 26. Columnb2with probability

1 8 2 8+1

8+3

8

= 16. Columnb3with probability

3 8 2

8+18+38 = 36.

Assuming distributionρover Bob’s recommendation, notion of CE says Alice should have no incentive to deviate to first rowa1(in expectation).

Eρ[Rowa2] =3×2/6+0×1/6+1×3/6=9/6. Eρ[Rowa1] =0×2/6+1×1/6+2×3/6=7/6.

σas above is not a CE!

(50)

Example

Distribution over strategy profiles is given by

σ =

σ11 σ12 σ13 σ21 σ22 σ23

=

0 1/8 1/8

2/8 1/8 3/8

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Suppose Alice getssecond rowa2as recommendation.

This gives Alice a (conditional) probability distributionρfor column privately recommended to Bob:

Columnb1with probability

2 8 2

8+18+38 = 26. Columnb2with probability

1 8 2 8+1

8+3

8

= 16. Columnb3with probability

3 8 2

8+18+38 = 36.

Assuming distributionρover Bob’s recommendation, notion of CE says Alice should have no incentive to deviate to first rowa1(in expectation).

Eρ[Rowa2] =3×2/6+0×1/6+1×3/6=9/6. Eρ[Rowa1] =0×2/6+1×1/6+2×3/6=7/6.

σas above is not a CE!

(51)

Example

Distribution over strategy profiles is given by

σ =

σ11 σ12 σ13 σ21 σ22 σ23

=

0 1/8 1/8

2/8 1/8 3/8

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Suppose Alice getssecond rowa2as recommendation.

This gives Alice a (conditional) probability distributionρfor column privately recommended to Bob:

Columnb1with probability

2 8 2

8+18+38 = 26. Columnb2with probability

1 8 2 8+1

8+3

8

= 16. Columnb3with probability

3 8 2

8+18+38 = 36.

Assuming distributionρover Bob’s recommendation, notion of CE says Alice should have no incentive to deviate to first rowa1(in expectation).

Eρ[Rowa2] =3×2/6+0×1/6+1×3/6=9/6. Eρ[Rowa1] =0×2/6+1×1/6+2×3/6=7/6.

σas above is not a CE!

(52)

Example

Distribution over strategy profiles is given by

σ =

σ11 σ12 σ13 σ21 σ22 σ23

=

0 1/8 1/8

2/8 1/8 3/8

b1 b2 b3

a1 (0,2) (1,0) (2,1) a2 (3,0) (0,1) (1,4)

Suppose Alice getssecond rowa2as recommendation.

This gives Alice a (conditional) probability distributionρfor column privately recommended to Bob:

Columnb1with probability

2 8 2

8+18+38 = 26.

Columnb2with probability

1 8 2 8+1

8+3

8

= 16. Columnb3with probability

3 8 2

8+18+38 = 36.

Assuming distributionρover Bob’s recommendation, notion of CE says Alice should have no incentive to deviate to first rowa1(in expectation).

Eρ[Rowa2] =3×2/6+0×1/6+1×3/6=9/6. Eρ[Rowa1] =0×2/6+1×1/6+2×3/6=7/6.

σas above is not a CE!

Referenzen

ÄHNLICHE DOKUMENTE

Today, the goal is to give a “quasi-polynomial” time algorithm that computes an -approximate mixed Nash equilibrium.. Supports of mixed strategies play an important

Once the support is fixed, the computation of an equilibrium (with that support) reduces to solving a linear program.. 6

Player assumes all other players play private recommendation, i.e., Alice assumes Bob follows his recommendation (and vice versa)!. In CE, no player has incentive to deviate given

In general, we assume to have an all-knowing, adaptive adversary Can choose which element to present in step i, based on.. Choices of online algorithm in

Side note: Original version of secretary problem asks for maximizing probability with which best element is selected.. If one picks maximum weight element

Sponsored search auctions (e.g., Google) Online selling platforms (e.g., eBay) (Stable) matching problems.. Matching children

Sponsored search auctions (e.g., Google) Online selling platforms (e.g., eBay) (Stable) matching problems.. Matching children

There exists a strategyproof 1 e -approximation for the online bipartite matching problem with uniform random arrivals of the bidders.. Strategyproof