• Keine Ergebnisse gefunden

Dynamic Allocation and Pricing: A Mechanism Design Approach

N/A
N/A
Protected

Academic year: 2022

Aktie "Dynamic Allocation and Pricing: A Mechanism Design Approach"

Copied!
73
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Dynamic Allocation and Pricing: A Mechanism Design Approach

Benny Moldovanu

September 2011

(2)

Modern Revenue Management

Starts with US Airline Deregulation Act of 1978.

Today mainstream business practice (airlines, trains, hotels, car rentals, holiday resorts, advertising, intelligent metering devices, etc..) Considerable gap between practitioners and academics in the …eld Major academic textbook : The Theory and Practice of Revenue Management by K. T. Talluri and G.J. van Ryzin

(3)

Basic RM Questions (Talluri & van Ryzin)

Quantity decisions: How to allocate capacity/output to di¤erent segments, products or channels ? When to withhold products from the market ?

Structural decisions: Which selling format ? (posted prices, negotiations, auctions, etc..). Which features for particular format

?(segmentation, volume discounts, bundling, etc..)

Pricing decisions: How to set posted prices, reserve prices ? How to price di¤erentiate ? How to price over time ? How to markdown over life time ?

(4)

Towards a Modern Theory of RM

Necessary blend of

1 The elegant dynamic modelsfrom the OR, MS, CS , Econ (search) literatures with historical focus on "grand, centralized

optimization" and/or "ad-hoc", intuitive mechanisms.

2 The rich, classical mechanism design literature with historical focus oninformation/incentives in staticsettings.

Blend fruitful for numerous applications.

(5)

Reviewed Papers (joint work with A. Gershkov)

Dynamic Revenue Maximization with Heterogeneous Objects: A Mechanism Design Approach,AEJ: Microeconomics 2009

E¢ cient Sequential Assignment with Incomplete Information,GEB 2010

Revenue Maximization for the Dynamic Knapsack Problem, TE 2011 (also joint with D. Dizdar)

Learning About The Future and Dynamic E¢ ciency, AER 2009 Optimal Search, Learning, and Implementation,mimeo 2010.

E¢ cient Dynamic Allocation with Strategic Arrivals, mimeo 2011

(6)

The Gallego & van Ryzin Model (MS 94 ) I

Revenue Maximizing (RM) seller has n identical objects that perish after deadlineT.

Agents arrive according to a Poisson process with intensity λ.

Agents’values are private information, represented by I.I.D random variables Xi =X on[0,+∞)with common c.d.f. F.

Agents desire one object only, and can only be served upon arrival (no recall)

After an item is assigned, it cannot be reallocated.

(7)

The Poisson Process

The Poisson process with intensity λis a continuous-time counting process fN(t),t 0gsuch that:

1 N(0) =0

2 Independent increments: the numbers of events in disjoint time intervals are random variables independent from each other.

3 Stationary increments: the probability distribution of the number of events in any time interval only depends on the length of the interval.

4 N(t+τ) N(t), the number of events in time interval (t,t+τ] follows a Poisson distribution with parameterλτ :

P[N(t+τ) N(t) =k] = e

λτ(λτ)k

k! , k =0,1,2...

5 The probability distribution of the waiting time until the next event is an exponential distribution.

(8)

Gallego & van Ryzin II

Gallego & van Ryzin restrict attention to posted prices: at each point in timet, seller sets pricept that needs to be paid by any buyer that arrives att.

Main Results:

1 Optimal posted-price revenue is concave in the number of objects, and in time until deadline.

2 Time pattern of optimal prices: Downward trend, interrupted by upward jumps after each sale.

3 Fixed price is approximately optimal if min (n,λT)is large enough.

(9)

New Research Questions

General Mechanisms

Multiple, Heterogenous Objects Multi-Unit Demand

Learning about Demand Recall and Strategic Arrivals

(10)

Derman-Liebermann-Ross (MS 72)

each item i =1, ..,m is characterized by a "quality"qi with 0 qm qm 1 ... q1

n agents arrive sequentially, one agent per period agents can only be served upon arrival

after an item is assigned, it cannot be reallocated each agent is characterized by a "type"xj

if an item with typeqi 0 is assigned to an agent with type xj, this agent enjoys a utility of qixj

agents’types are IID random variablesXi =X on [0,+)with common cdf. F

(11)

DLR Model II

Theorem (DLR 72)

Consider the arrival of an agent with type xk in period k 1. There exist k+1constants0=ak,k a1,k ... a0,k = such that:

The dynamically e¢ cient policy assigns the item with quality q(i) if xk 2 (ai,k,ai 1,k]

The constants are given recursively by ai,k+1 =

Z ai 1,k

ai,k

xdF(x) +ai,kF(ai,k) +ai 1,k[1 F(ai 1,k)]

where we set + 0= 0=0.

In a problem with n periods total expected welfare is given by Wn =ni=1q(i)ai,n+1.

(12)

Majorization: De…nition

De…nition

For any n tupleγ= (γ1,γ2, ..,γn) letγ(j) denote thejthlargest

coordinate (so that γ(n) γ(n 1) ... γ(1)). Vector α= (α1,α2, ..,αn) is majorized byβ = (β1,β2, ..,βn)and we writeα βif the following system of n 1 inequalities and one equality is satis…ed:

α(1) β(1) α(1)+α(2) β(1)+β(2)

... ...

α(1)+α(2)+..α(n 1) β(1)+β(2)+β(n 1) α(1)+α(2)+..+α(n) = β(1)+β(2)+..+β(n)

We say that αisweakly sub-majorized byβ and we writeα w βif all relations above hold with weak inequality.

(13)

Majorization: Main Results (Marshall & Olkin, 79)

Theorem

1 α β if and only if there exists a doubly stochastic matrix Q such that α= βQ.

2 A function Ψ:Rn !R is called Schur-convex ifΨ(α) Ψ(β)for any α,β such that α β.

3 Let Ψ:Rn !R be symmetric with continuous partial derivatives.

Then Ψ is Schur-convex if and only if for all(y1, ...,yn)2Rn and all i,j 2 f1, ..,ngit holds that

∂Ψ(y1, ...,yn)

∂yi

∂Ψ(y1, ...,yn)

∂yj (yi yj) 0.

(14)

Majorization in the DLR Model

Let X(i),k denote thei th highest order statistic out ofk copies of X,and let µ(i),k denote its expectation.

Let Sn denote the set of permutations off1,2, ..,ng.For each permutation σ2 Sn,de…ne µσn = µ(σ(1)),n , µ(σ(2)),n , ..., µ(σ(n)),n Total expected welfare in a static problem where n agents arrive simultaneously is given by ∑ni=1q(i)µ(i),n.

Theorem

The n tuple fai,n+1gni=1 is majorized by the n tuplefµi,ngni=1 . In particular, there exist n permutations fσigni=1 and non-negative weights fwigni=1 such that ∑i=1wi =1 and

ni=1wiµσ

i

n = (a1,n+1,a2,n+1, ..,an,n+1).

(15)

Albright’s Model (MS 74 ) I

Welfare Maximizing (WM) seller hasm items.

Each itemi =1, ..,n is characterized by a "quality"qi with 0 qn qn 1 ... q1

If an item with qualityqi 0 is assigned to an agent with typexj this agent has utility qixj

Complete information: current type is known upon arrival; future types are IID random variablesXi =X on [0,+∞) with common c.d.f. F.

Poisson arrivals, unit demand, deadline, etc..as in Gallego & van Ryzin.

(16)

The Welfare Maximizing (WM) Allocation

Theorem (Albright)

Denote by Πt the set of items available at t, with cardinality kt.There exist n+1 unique functions

∞ y0(t) y1(t) ..yn(t) 0, 8t

which do not depend on the q’s such that if an agent with type x arrives at time t, it is optimal to assign him the j’th highest element of Πt if x 2[yj(t), yj 1(t)) and not to assign any object if x <ykt(t).

1 For each k,the function yk(t)satis…es yk0(t) = λRyk 1

yk (1 F(x))dx.

2 The expected welfare from timet on is given byWt =ki=t1q(i)yi(t), whereq(i) is the i’th highest element ofΠt.

(17)

Albright: lllustration.

Example

There are three objects; λ=1; the distribution of agents’types is F(x) =1 e x.The following …gure depicts the solution forT =5:

0 1 2 3 4 5

0.0 0.5 1.0 1.5 2.0

t y

(18)

Second Order Stochastic Dominance I

Theorem

Consider two distributions of agents’types F and G such that

µF =µG =µand such that F second-order stochastically dominates G (in particular F has a lower variance than G).Then it holds that:

1 For any time t,and for any n,the vector fyiF(t)gni=1 of optimal cuto¤s under F is weakly sub-majorized by the vector fyiG(t)gni=1 of optimal cuto¤s under G.

2 For any time t and for any set of available objects at t, Πt 6=∅, the expected welfare in the e¢ cient dynamic allocation under F is lower than that under G.

(19)

SSD: Illustration

Example

Let F(x) =x on[0,1]so that F is IFR and thusF SSD

G(x) =1 e 2x.There is one object;λ=1;T =5.y1F(t) =1 72t is the dashed line, y1G(t) = 12ln[6 t]is the solid line.

0 1 2 3 4 5 6

0.0 0.2 0.4 0.6 0.8 1.0 1.2

t y

(20)

Second Order Stochastic Dominance (Static Counterpart)

Theorem (De La Cal & Caracamo, J. Appl. Prob. 06)

Consider two distributions of agents’types F and G . The following two assertions are equivalent:

1 F second-order stochastically dominates G .

2 For any n,the n tuple fµF(i),ngni=1 of mean order statistics under F is majorized by the n tuple fµG(i),ngni=1of mean order statistics under G. Corollary

Consider two distributions of agents’types F and G such that

F second-order stochastically dominates G . Then, for any set of available objects, expected welfare in the e¢ cient static allocation under F is lower than that under G .

(21)

The Loss from Sequentiality I

Assume that at timet there aren objects left.

Consider scenario where the allocation to all subsequently arriving agents can be made at the deadline T.

Expected welfare at t is given by ∑ni=1q(i)zi(t),wherezi(t)

represents the expected type of an agent who arrives aftert, and who get assigned to the object with the i th highest quality.

Let Prl(t) =e λ(T t)λl(Tl! t)l be the probability that there will be l arrivals,l 1,after timet. Then

zi(t) =

l=i

Prl(t)µ(l i+1),l =e λ(T t)

l=i

λl(T t)lµ(l i+1),l l! , 8i. where let µ(i),l is the expectation of thei th highest order statistic out of l copies ofX.

(22)

The Loss from Sequentiality II

Intuitively,∑ni=1q(i)[yi(t) zi(t)]measures the welfare loss due to the sequentiality constraint.

Theorem

1 For any period t,and for any n,the vector fyi(t)gni=1 of optimal cuto¤s in the dynamically e¢ cient allocation of n objects is weakly sub-majorized by the vectorfzi(t)gni=1.

2 Moreover, limn!ni=1yi(t) =limn!ni=1zi(t) =λ(T t)µ, where µis the mean of the distribution of agents’types.

(23)

In…nite Horizon & Discounting

In…nite horizon (T =), discount rate r(t) =e αt.

Arrival process is a renewal with general inter-arrival distributionB Theorem (Albright MS 1974)

The dynamically e¢ cient cuto¤ curves are stationary (i.e., independent of time) yn yn 1... y1.These constants do not depend on the q’s, and are given by the implicit recursion:

(yk +yk 1+..y1) = LB(α) 1 LB(α)

Z

yk

(1 F(x))dx , 1 k n where LB is the Laplace-transform of the inter-arrival distribution.

Recall that for random variableX with densityf,theLaplace Transform is given by L(s) =E[e sX]

(24)

Inter-Arrival Distributions I

De…nition

Let β,γ be two non-negative random variables governed by distributions B,C,respectively. Then B dominates C in theLaplace transform order if

E[e ] E[e ] for alls >0 Theorem

Consider two inter-arrival distributions B and C such that B Lt C .

1 For any …xed distribution of agents’characteristics F,and for any n, the n tuple fyiBgni=1 of optimal cuto¤s under B is weakly

sub-majorized by the n tuple vectorfyiCgni=1 of optimal cuto¤s under C.

2 In particular, for any t,and for anyΠt 6=,the expected welfare from t on in the e¢ cient dynamic allocation under B is lower than

(25)

Inter-Arrival Distributions II

De…nition

A non-negative random variableW is called NBU (NWU) if, for every y >0,W is stochastically larger (smaller) than the conditional random variable (W y/W y).

Example

Assume that there is one object, and consider a situation with an NBU (NWU) distribution of abilities with mean µ, and another NBU (NWU) distribution of inter-arrival times with mean ω.Then, the expected welfare under the e¢ cient dynamic policy is lower (higher) than

µLambertW( 1 ωα)

where the increasing functionLambertW(x) is implicitly de…ned by:

LambertW(x)eLambertW(x) =x

(26)

Mechanisms and Incomplete Info. I

Assume types are private information. If an item with quality qi 0 is assigned to an agent with type xj for pricepi, this agent has utility qixj pi.

W.l.o.g. restrict attention to deterministic, Markovian and direct mechanisms where every agent, upon arrival, reports his characteristic xi and where, at any point in time t, the mechanism speci…es:

1 a non-random allocation rule (which object is allocated, if any) that only depends ont,on the declared type of the arriving agent, and on the inventory of items available at t.

2 a payment to be made by the arriving agent which depends on t,on the declared type of the agent, and on the inventory of items available att.

(27)

Mechanisms II

Theorem

1) An non-randomized, Markovian policyfQt(x,Πt)gt is implementable i¤ at each t it partitions the set of agents’types into kt +1disjoint intervals such that all types in a given interval obtain the same quality, and such that higher types obtain a higher quality.

2) The associated payment scheme is given by Pt(x,Πt) =

kt

i=j

(q(i:Πt) q(i+1:Πt))yi,Πt (t) if x 2[yj,Πt (t),yj 1,Πt(t)) and by zero otherwise.

(28)

Mechanisms III

Corollary

The WM policy is implementable. Payments are the dynamic analogue of the Vickrey-Clarke-Groves mechanism (see also Bergemann and Valimäki, EC 10)

Payments have intuitive interpretation: look at static case with kt objects andkt +1 agents where, in addition to agent with typex, there are kt "dummies" with typesy1,Πt (t), y2,Πt(t), ...,ykt,Πt (t). Payment for object with the j-highest quality,

ki=tj(q(i:Πt) q(i+1:Πt))yi,Πt (t)represents the externality imposed by agent with type x on dummies

(29)

RM with Heterogenous Objects

Theorem

Assume that the virtual value x 1fF(x()x) is increasing. Then the RM allocation is given by n cut-o¤ functions that do not depend on the available qualities. These functions satisfy:

yi(t) 1 F (yi(t)) f (yi(t)) +λ

Z T

t

[1 F (yi 1(s))]2

f (yi 1(s)) ds =λ Z T

t

[1 F(yi(s))]2 f (yi(s)) ds or, equivalently

yi(t) 1 F (yi(t))

f (yi(t)) +R(1i 1,t) =R(1i,t)

where R(1j,t)is the expected revenue at time t from the optimal policy if j identical objects with q =1 are still available at that time.

(30)

Proof (Sketch) I

One object with qualityq is available at timet ;agent arriving att gets object ifxi y1(t),and no object otherwise. Expected revenue given by

q Z T

t

y1(s)h1(s)ds

whereh1(s)is the density of waiting time till the …rst arrival of an agent with a value abovey1(s).

By theColoring Theorem,this density equals the density of the …rst arrival in a non-homogenous Poisson process with rate

λ(1 F(y1(s)),given by

h1(s) =λ(1 F(y1(s))e λRts[1 F(y(z))]dz for t s T Perform calculus of variation exercise with respect to y1(t) Use backward induction.

(31)

Clearance Sales

Percentage Markdown: Di¤erence between prices of the same product att =0 and t =T, divided by the price att =0.

Pashigian and Bowen (QJE 91) empirically …nd that :

"More expensive apparel items within each product line are frequently sold at a higher average percentage markdown"

Theorem

Assume an RM seller and consider the scenario where at time t =0there are n1>0 items of quality q and n2 >0items of quality s <q,while at time t =T there are l1 >0 items of quality q , and l2 >0items of quality s left unsold. Then the percentage markdown is always higher for the higher quality.

(32)

Optimal Inventory Design

Theorem

Let y =fyi(t)gni=1 denote the allocation underlying the RM policy with n objects, and assume that the cost of producing qualities (q1,q2, ..qn)is given by C(q1,q2, ..qn) =ni=1φ(qi)whereφ:R !R is strictly increasing, convex and satis…es φ(0) =0. Then

1) The optimal number of objects n is characterized by φ0(0)2(yn +1(0) 1 F(yn +1(0))

f (yn +1(0)) , yn (0) 1 F (yn (0)) f (yn (0)) ] 2) The optimal qualities qi are given by:

φ0(qi) =yi(0) 1 F (yi(0))

f (yi(0)) , i =1, ...,n

(33)

Multi-Unit Demand: Dynamic Knapsack

An RM seller has capacity C 2R+ that perishes afterT periods.

In each period, impatient agent arrives with quantity request w, and per-unit value v.Type (w,v)is private information to the arriving agent.

Type (w,v)’s utility is given bywv p if at pricep he is allocated a capacityw0 w and by p if he is assigned an insu¢ cient capacity w0 <w.

Demands are I.I.D across periods, governed by c.d.f. F(w,v)with density f(w,v)>0 on [0,∞)2. For allw, the conditional virtual valuev 1fF(v(vjw)

jw) is an unbounded, strictly monotone function ofv.

Complete information optimization model due to Kleywegt &

Papastavrou,OR 01; they do not consider payments.

(34)

Dynamic Knapsack: Implementable Policies

A deterministic, Markovian allocation rule for timet with remaining capacityc has the form αct :[0,+)2 ! f1,0gwhere 1 (0)means that the reported capacity demand w is satis…ed (not satis…ed).

Theorem

A policy fαctgt,c is implementable i¤ for every t and every c it satis…es:

1) 8(w,v),v0 v, αct(w,v) =1 )αct(w,v0) =1.

2) The function wptc(w)is non-decreasing in w,where pct(w) =inffv / αct(w,v) =1g.

The maximal, individually rational payment function that implements fαctgt,c is given by

qct(w,v) =

(wpct(w) if αct(w,v) =1 0 if αct(w,v) =0

(35)

IIlustration: Implementable Policies

Example

Assume T =1. Weightw is realized according to an exponential distribution with parameter λ. Per-unit value is sampled from the following distribution

F(vjw) = 1 e

λv if w >w

1 e λv if w w

where λ>λand w 2(0,c). For observable weight requests, seller charges λ1 (1

λ)per unit if the w ( )w .This implies that wptc(w) =

( w

λ if w >w

w

λ if w w .

and therefore, wptc(w) is not monotone.

(36)

Dynamic Knapsack: RM I

Theorem Assume that:

1 For any w,the hazard rate 1f(Fv(jvw)

jw) is non-decreasing in v.

2 For any w0 w , and for any v, 1f(Fv(jvw)

jw)

f(vjw0) 1 F(vjw0).

For each c,t,w let pct(w)denote the recursive solution to the system w pct(w) 1 F(pct(w)jw)

f(ptc(w)jw) =R (c,T t) R (c w,T t). where R denotes the optimal revenue with R (c,0) = 0 for all c.

Then the underlying allocation whereαct(w,v) =1i¤ if v ptc(w)is implementable. In particular, the above system determines the RM policy.

(37)

Dynamic Knapsack: RM II

Optimal revenue may not be concave in capacity (see Kleywegt &

Papastavrou,OR 01 for non-concavity in WM) - this is connected to implementation problems.

Under a concavity conditions on the distribution of types, revenue is concave and above system also determines RM policy.

RM policy requires price adjustments for everyc,t,w - complicated dynamics. We construct a static nonlinear price schedule (it uses correlations betweenw andv !) that is asymptotically optimal if min(C,T)is large enough.

(38)

Learning about Values: Illustration 1

one object

two agents arrive sequentially, one per period each agent can only be served upon arrival after an item is assigned, it cannot be reallocated valuations xi are private

valuations distributed independently and uniformly on [0,2]

(39)

Illustration 1: continued

The dynamically e¢ cient allocation is given by

…rst agent gets the object if x1 1 second agent gets the object if x1<1 Implemntation with payment scheme:

for the …rst player: P1(x1) = 1 if x1 2[1,2] 0 if x1 2/[1,2] for the second player: P2(x2) =0

(40)

Learning about Values: Illustration 2

one object

two agents arrive sequentially, one per period each agent can only be served upon arrival

the designer does not know the distribution of values

with probability 0.5 the distribution of both agents’types is uniform on[0,1]

with probability 0.5 the distribution is uniform[1,2]

(41)

Value of keeping vs. value of allocating object

0 1 2

0 1 2

x

(42)

Illustration 2: continued

The dynamically e¢ cient allocation is

…rst agent gets the object if x1 2[0.5,1][[1.5,2] second agent gets the object if x12/[0.5,1][[1.5,2]

The payment scheme for the …rst agent (P,p)should satisfy:

x1+P p for any x12 [0.5,1] x1+P p for any x12 [1,1.5] But this is IMPOSSIBLE!

(43)

Complete Information & Learning

Theorem (Albright, 1977)

Consider the arrival of an agent with type x in period k 1,and denote by χk the vector of past reported signals. There exist k+1 functions

0= a0,k(χk,x)) a1,k(χk,x) a2,k(χk,x) ... ak,kχk,xk) =∞ such that the dynamically e¢ cient policy assigns the item with the i ’th smallest type if x 2 (ai 1,k(χk,x),ai,kχk,x)].

Each ai,k+1(χk+1,xk+1)equals the expected value of the agent’s type to which the item with i th smallest type is assigned in a problem with k periods before the period k signal is observed. These functions do not depend on the q’s.

(44)

Incomplete Information & Learning-I

Question: When do cuto¤s exist which

1 are independent of the signal of the current agent and

2 replicate the e¢ cient allocation ? The key element is the set

Ai,k(χk) =fx :ai 1,k(χk,x)) x<ai,k(χk,x))g.

(45)

B 4 - Incomplete Information & Learning-II

Theorem (Necessary and su¢ cient condition)

The e¢ cient allocation is implementable if and only if for any period k, any object qi and any history Hk the set Ai,k(χk)is convex.

Theorem (Su¢ cient condition)

Assume that for any k, χk, i 2 f0, ..,kg, the cuto¤ ai,k(χk,xk)is a Lipschitz function of xk with constant 1. Then, the e¢ cient dynamic policy is implementable.

Insight taken from the theory of mechanism design with interdependent values (Jehiel & Moldovanu,EC 01)

(46)

Su¢ cient Conditions I

Theorem

Assume that for any k,and for any pair of ordered lists of reportsχk χ0k that di¤er only in one coordinate, the following conditions hold:

1 Fek(xjχk)%FOSD Fek(xjχk0)

2 E(xjχk) E(xjχ0k) k1where∆ is size of the di¤erence between χk andχ0k

Then, the e¢ cient dynamic policy is implementable.

(47)

Su¢ cient Conditions II

Theorem

Assume that the conditional distribution function Fek(xjxn, ,xk+1)and density efk(xjxn, ...,xk+1) are continuously di¤erentiable with respect to xk+i for all x and for all n k i 1. If

0

∂xk+iFek(xjχk) 1

n kfek(xjχk) for all x and all n k i 1 then the e¢ cient dynamic policy is implementable.

(48)

Illustration

Example

Considerex N(µ,1)with unknown meanµ, and prior beliefs e

µ N(µ0,1

τ)with τ>0.

After observing xn, ..xk+1, posterior isµe N(µ,τ+1n k)with µ= τµτ+(0+n kx)i , and

Fek(xjxn, ...,xk+1) = N(µ,1+ 1 τ+n k); Fek(x+ z

τ+ (n k)jxn, ..,xi+z, ..,xk+1) = Fek(xjxn, ..,xi, ..,xk+1) SD holds, and

Fek(xjxn, ..,xk+1)

∂xk+i

= 1

τ+n kfek(xjxn, ..,xk+1) 1 fe(xjx , ..,x )

(49)

Remarks

Remark

1 The right-hand side condition says that the function e

Fk x+ n kz jxk+1, .,xk+i +z,xk+i+1, .,xn is non-decreasing in z. In other words, after having made n k observations, a small shift to the right - which moves the value of the distribution upwards - is enough to compensate the downward e¤ect on the distribution’s value caused by an (n k)times larger upward shift in one of the past observations (recall that, by stochastic dominance, higher

observations move the entire distribution downwards).

2 The condition guarantees that8i,k,n,∂ai,k∂x(χkk,xk)

1

n k although

∂ai,k(χk,xk)

∂xk 1 seems su¢ cient for the implementation of the e¢ cient allocation. Nevertheless, the long-term e¤ect of each non-terminal observation makes it impossible to obtain tighter conditions that apply generally.

(50)

Search for the Lowest Price (Rothschild1973):

Consumer obtains a sequence of prices, and must decide when to stop the (costly) search for a lower price.

Beliefs about distribution of prices updated (in a Bayesian way) after each observation.

Without learning, optimal stopping rule is characterized by a reservation price R : stops search at any price less than equal to R, and continue search at any price higher than R.

If all customers follow such a rule=) well-behaved demand function where sales are a non-increasing function of price

When does the optimal stopping rule with learning have the

reservation price property ? I.e.,when does a priceR(s)exist for each state s such that prices above are rejected, and prices below are accepted ?

(51)

Answers to Rothschild’s Question

Stochastic Dominance u:

Rosen…eld and Shapiro (1981): For all x,k,χk and alln k i 1 Z

x

∂xk+i

Fek(yjχk)dy 1

n k(1 Fek(xjχk) Seierstad (1992) : For all x,k andχk

n k

i=1

∂xk+i

e

Fk(xjχk) fek(xjχk)

Us (multiple objects !): For all x ,χk ,and alln k i 1

∂xk+i

Fek(xjχk) 1

n kfek(xjχk)

(52)

Non-Bayesian Learning I

Before stage n the designer’s prior belief about the distribution of the …rst type xn is given by G. Conditional on observingxn,xn 1...,xk+1 at stages n,n 1, ...k+1,the belief about the distribution of the next typex =xk is given by:

Fek(xjxn, ...,xk+1) = (1 βnk)G(x) +βnk 1

n k

n i=k+1

1[xi,)(x), k =1,2, ...n 1 where 0 <βnk <1 and where1[z,)(x) denotes the indicator function of the set [z,∞).

Theorem

Under "empirical" learning , the e¢ cient dynamic policy can always be implemented under incomplete information.

(53)

Non-Bayesian Learning II

Prior to any observation, the designer estimates the unknown distribution to be uniform. Suppose that mobservations were made, and order them in increasing order n

x(1), ...,x(m)o

, and let x(0)=0 and x(m+1)=1. The belief about the distribution of the next type x is given by:

fk(xjxn, ...,xn m+1) =

m+1 i

=1

1[x(i 1),x(i))(x) (m+1) x(i) x(i 1)

.

Theorem

Under "maximum entropy" learning, the e¢ cient dynamic policy can always be implemented under incomplete information.

(54)

Characterization of Second-Best Mechanism

Theorem

1 At each period k, expected welfare (calculated before the arrival of the period k agent) is a Schur-convex, linear function of the available qualities at that period.

2 The incentive compatible, optimal mechanism (second best) is

deterministic. That is, for every history at period k, Hk, and for every type xk of the agent that arrives at that period, there exists a quality q that is allocated to that agent with probability1 .

3 At each period, the optimal mechanism partitions the type set of the arriving agent into a collection of disjoint intervals such that all types in a given interval obtain the same quality with probability 1, and such that higher types obtain a higher quality.

(55)

Outline of Proof

Schur-convexity of expected welfare in a deterministic, incentive compatible mechanisms follows by induction.

) At each periodk it is optimal to leave for the future the "most disperse" set (in the sense of majorization) of feasible qualities that is consistent with incentive compatibility.

) Periodk’s optimal allocation must be the "most concentrated"

one that is consistent with monotonicity.

) Periodk’s allocation should be either deterministic, or should randomize among at most two neighboring qualities.

Finally, randomization among two neighboring qualities cannot be optimal. This is related to an argument due to Riley & Zeckhauser who studied revenue maximization for 1 seller-1 buyer problem without monotone virtual values.

(56)

Patient Agents & Complete Info. (Bertsekas 05)

Seller has 1 object.

Time is discrete, deadlineT.

One new buyer arrives at each period t (arrivals observed by seller).

Buyers have IID values for the object represented by random variable e

v

Buyers are long lived; common discount factor δ.

(57)

Patient Agents: Main Result

Theorem

The dynamically e¢ cient policy awards the good in period t if and only if the set of present buyers contains a buyer whose value exceeds a cuto¤ xt. If the good is allocated in period t , it is awarded to the agent with the highest value. The cuto¤s xt satisfy:

1 xT =0

2 xt =δE[max(ev,xt)],for t <T.In particular the optimal cuto¤ is constant in periods t <T.

Intuition: At the optimum in period t <T, the seller should be indi¤erent between awarding the good to a buyer with value xt, and waiting one more period and getting another draw. Note thatxt, t >T,also equals the optimal cuto¤ in the in…nite horizon problem with impatient agents !

(58)

Patient Agents: Incomplete Information (Board &

Skrzypacz 10)

Assume now that values are private information.

Optimal allocation can be implemented through a series of second-price auctions.

Although optimal cuto¤ is constant for all t <T,reserve prices that implement these cuto¤s decrease over time !

The reserve price makes the cuto¤-type indi¤erent between buying today and tomorrow; Prices thus decline both because discounting and because the risk of losing to a new bidder that comes tomorrow.

Agents with value below cuto¤ but above reserve price refrain from buying: they wait for the "…re-sale" at the last period.

(59)

Strategic and Unobservable Arrivals

Designer endowed with indivisible object;

Stream of randomly arriving, long-lived (or patient) agents; arrivals described by a counting processfN(t),t 0gin continuous time Agent’s private information is two-dimensional: arrival time t 0 and value v 0 for the object. Arrival time and value are independent of each other.

Values are I.I.D. random variables with common c.d.f. F,continuos density f.

If an agent arrives att, gets the object at τ t and pays p at τ0 2[t,τ], then her utility is given bye δτv e δτ0p where δ 2(0,1)is the discount factor.

Designer’s utility is given bye δτv (WM) ore δτ0p (RM).

(60)

Renewals under Complete Information

Theorem (Zuckerman)

A non-negative random variable W is called NBU (new better than old) if, for every y >0,W is stochastically larger than the conditional random variable (W y/W y).

Assume that the arrival process is a renewal, and that the inter-arrival distribution G satis…es the NBU property. Let φdenote the Laplace Transform of G . Then, the optimal policy allocates the object to the …rst arrival whose value is above v where v is the unique solution to

v = φ(δ) 1 φ(δ)

Z

v (v v )dF(v)dv.

In particular, recall is never used by the optimal policy, and all allocations occur upon arrival.

(61)

Application to RM (Gallien 06)

Private information is characterized by (vi,ti,di) wherevi is the value,ti is the arrival time, and di ti is the exit time. The designer’s goal is to maximize revenue.

Assume that arrivals are governed by a renewal process with a NBU distribution, and that the virtual valuation function x 1f(Fx()x) is strictly increasing, and denote byH the implied distribution of virtual values. Then, for any di, di ti,i =1,2, ..,the revenue maximizing policy is to charge a constant price P whereP is the unique solution to

p= φ(δ) 1 φ(δ)

Z

p

(z p)dH(z)dz

The solution coincides with the one found by Gallien for the case di = (perfect recall) and by GM for the case di =ti (no recall).

(62)

Learning from Arrivals: Example

Distribution of values isU[0,1].

Inter-arrival time is known: 1) U[1,2] or 2) U[2,3]. The optimal cuto¤s for a WM designer are

x[i,j](δ) = 1 φ[i,j](δ) 1

q

1 (φ[i,j](δ))2 where

φ[i,j](δ) = e

iδ e jδ

δ .

is the respective Laplace transform.

Fix any δ and note thatx[1,2] =x[1,2](δ)>x[2,3](δ) =x[2,3]. No recall used

(63)

Unknown Arrival Process / Complete Information

Designer does not know the distribution of inter-arrival times:

believes that it is eitherU[1,2]or U[2,3], with equal probabilities.

Under complete information optimal policy is:

1 For T 2(1,2]the cuto¤ is x[1,2]

2 For T 2(2,3]the cuto¤ is x[2,3] if there were no arrivals before time 2, otherwise the cuto¤ isx[1,2]

3 For T >3, the cuto¤ isx[1,2] if the …rst arrival happened during time interval (1,2], whereas the cuto¤ is x[2,3] if the …rst arrival happened during (2,3].

(64)
(65)

Unknown Arrival Process / Incomplete Information

The allocative externality payment, which needs to be paid for the object by an agent arriving att t1 is

P(t1) = x[1,2] ift1 2[1,2] x[2,3] ift1 2(2,3] .

Consider type (t,v)with t 2(1,2)andv 2(x[2,3],x[1,2]). Truthful reporting yields zero utility since object not allocated to him. But, a report (t0,v)wheret0 =t+12(2,3)yields utility

e δt0 v x[2,3] >0 .

Truthful reporting under standard Clarke-Groves-Vickrey mechanism is not optimal, and WM dynamic allocation cannot be implemented ! Problem: Informational externality on designer induced by early arrivals.

(66)

Subsidizing Early Arrivals I

Subsidy to be paid to agent that arrives at t, independently of whether he gets the object:

S(t) = x[1,2] x[2,3] >0 if t 2[1,2]

0 if t 2 .

General intuition: LetU(t,t0,v)be the utility of an agent with type (t,v)who arrives at t0 and optimally reportsv0 (given(t0,t,v)) in the version of the model where arrivals are observable and where the designer uses the dynamically e¢ cient mechanism.

Choose subsidy scheme S(t0)such that the function

U(t,t0,v) +e δt0S(t0)decreases in reported arrival time t0 for anyt andv.

(67)

Subsidizing Early Arrivals II

Theorem

Assume that there exists M 0such that for any t t0 t00 in [0,T] and for any v it holds that

U t,t00,v U(t,t0,v) M(t00 t0)

Then the subsidy S(t0) =eδt0M(T t0)together expected allocative externality payment implements the e¢ cient policy for any history which stops before T.

Let L(t0) =supv,t t0∂U(t,t0,v)

∂t0 and assume that there exists c(t0) L(t0)such thatC =R

0 c(z)dz< ∞.The subsidy function S(t0) =et0δ C

Z t0

0

L(z)dz works for T =∞.

(68)

Winner-Pay Mechanisms I

De…nition

A mechanism is calledwinner-pay mechanism if the transfers to all agents that do not get the object are zero.

Theorem

Assume that arrivals are unobservable, and that the arrival process is a known renewal with inter-arrival distribution G . Then a subsidy is not needed for e¢ cient implementation.

Proof.

(Sketch): Show that U(t,t0,v)decreases in t0 for anyv and for any t t0.At allocation, elapsed time from last arrival must be the same, independently of reported arrival time. Price charged depends on elapsed time since last arrival, and on second highest value reported so far. Later arrival postpones allocation ! more arrivals!increases second highest

(69)

Winner-Pay Mechanisms II

Theorem

Assume that arrivals are governed by a non-homogenous Poisson/pure-birth process with arrival rate λi(t) whereλ is

non-decreasing in both elapsed time t and the number of arrivals i up to t.

Suppose that there exists a non-decreasing function β(t) such thatλi(t) β(t)for all i,t,and such that R

0 β(t)e δtdt < .Then:

1 An optimal stopping rule exists for the setting with observable arrivals.

2 Charging for the object a payment equal to the optimal cuto¤ under observable arrivals, implements the e¢ cient allocation also if arrivals are non-observable. In other words, a subsidy is not needed for e¢ cient implementation.

(70)

Calculated Example

Example

Assume that the distribution of values is F(v) =1 e v , and consider a non-homogenous Poisson process with rate λ(t) =δ(t+2)ln(t+2) 1.

λis positive and increasing in t,and Z

0 [δ(t+2)ln(t+2) 1]e δtdt <∞

Optimal cuto¤y(t) in the optimal stopping problem where arriving agents are short lived is given by:

y0 δy = [δ(t+2)ln(t+2) 1]e y

The solutiony(t) =ln(t+2) increases int and satis…es all optimality conditions for δ 2 ln 21 =0.72.Charging P(t) =ln(t+2)implements the e¢ cient dynamic allocation also in the problem with recall and

1

Abbildung

Illustration 1: continued
Illustration 2: continued

Referenzen

ÄHNLICHE DOKUMENTE

Crystal structures of XenA have been determined for the oxidized enzyme alone and in complex with two different substrates bound to the active site (4,6).. A structure of

he CEPS Carbon Market Forum (CMF) feels that the clean development mechanism (CDM) has reached a very important moment and that the reflection undertaken through the High- Level

Because the morbidity rates and survival probabilities are relatively constant in different regions, it is possible to use the estimation of the morbidity rates and/or

(The higher values may be an artifact of the threshold and adjustment values used in the base case. I have assumed that target decreases are only half as fast as

To study the energy transduction theoretically, a combination of electrostatic calculations (MCCE) and molecular dynamic simulations (MD) is used to evaluate the impact of a

Polterovich, Victor and Levin, Mark and Lakhman, Iosif.

Heavy viable trajectories are the ones associated to prices in the feedback map which evolve as slowly as possible : at each instant, the norm of the velocity of the price

Therefore, the debt service of Hungarian municipalities can be financed from decreasing reserves (total deficit), from financing income which translates into increasing