Algorithms and Uncertainty Summer Term 2020

(1)

Thomas Kesselheim July 1, 2020

Alexander Braun Due: July 8, 2020 at noon

Algorithms and Uncertainty Summer Term 2020

Exercise Set 9

Exercise 1: (2+2 Points)

Let each l^(t)_i ∈ {0,1}. We consider the following Greedy algorithm. In each step t, the algorithm selects It which satisfies It = arg mini∈[n]L^(t−1)_i , i.e. the expert with the best cumulative cost so far (ties are broken adversarially).

(a) Show that L^(T_Alg⁾ ≤n·min_iL^(T_i ⁾+ (n−1)

(b) Is the result of (a) surprising? Argue by the use of an appropriate lower bound.

Exercise 2: (5 Points)

State a no-regret algorithm for the case that `^(t)_i ∈[−ρ, ρ] for all i and t. Also give a bound for the regret. You should reuse algorithms and results from the lectures.

Hint: Chapter 4 of lecture 16 might be helpful.

We consider a different form of feedback. After stept, the algorithm does not get to know`^(t)_i for allibut a noisy version. More precisely, an adversary first fixes the sequence`⁽¹⁾, . . . , `^(T⁾, where all costs are in [0,1]. Afterwards, from this sequence ¯`⁽¹⁾, . . . ,`¯^(T⁾ is computed, where

`¯^(t)_i =`^(t)_i +ν_i^(t) and ν_i^(t) is an independent random variable on [−, ] withE[ν_i^(t)] = 0.

State a no-regret algorithms and a bound for the regret. Use the previous exercise and the ideas presented in lecture 17.

In the lecture, we used that E h

miniPT t=1`^(t)_i

i

≤miniE hPT

t=1`^(t)_i i

orE h

maxiPT t=1r^(t)_i

i

≥ max_iEh

PT t=1r^(t)_i i

respectively. Give a proof of this inequality.