Ergodic Theory - A Summary

(1)

Ergodic Theory - A Summary

Ergodic Theory is the study of dynamical systems by means of invariant measures.

Denition: A measure is T -invariant ifµ(T⁻¹(A)) =µ(A)for every set A in the algebra ofµ-measurable sets.

Equivalently: for every measurable function f :X →R, Z

Xf dµ= Z

X f ◦T dµ.

A measure is callednon-atomicifµ({x}) =0 for every x ∈X . A measure is calledabsolutely continuous (w.r.t. Lebesgue m)if m(A) =0 impliesµ(A) =0.

(2)

Existence of Invariant Measures

The existence of invariant measures is (usually) guaranteed by the:

Theorem of Krylov-Bogol'ubov: If T :X →X is a continuous map on a nonempty compact metric space X , then the set of invariant probability measuresM(X,T)6=∅.

Examples:

I If T^p(x) =x, then the equidistribution _p¹P_p−1

j=0 δ_Tj(x) is an invariant measure.

I If P_Tf(x) =P

Ty=x f(y)

|DT(y)| is the transfer operator w.r.t.

Lebesgue measure, then

dµ=f dx is invariant if P_Tf =f.

(3)

Ergodicity

Denition: A measure µfor a dynamical system(X,T) isergodicif µ(A) =0 orµ(A^c) =0

for every measurable set A⊂X such that T⁻¹(A) =Amodµ. This says that (modµ), the space doesn't decompose into parts that don't communicate with each other.

Equivalent expression for ergodicity are:

I The only T -invariant functions ψ∈L¹(µ) i.e., ψ◦T =ψ µ-a.e., are constant µ-a.e.

I ¹_nP_n

j=0µ(A∩T^jB)−µ(A)µ(B)→0for all measurable sets A,B ⊂X .

(4)

Ergodicity

Examples:

I Thedoubling map T :S¹→S¹, x 7→2xmod1 preserves Lebesgue measure m, and it is ergodic. However, ¹₂m+¹₂δ₀ is invariant but not ergodic.

I TheGauÿ map G : [0,1)→(0,1], x 7→ _x¹ − b¹_xcpreserves the measure dµ= _log¹₂₁^dx₊_x, and it is ergodic (Folklore Theorem).

I Circle rotations

Rα:S¹ →S¹, Rα(x) =x+αmod1. preserve Lebesgue measure.

I Ifα∈Q, then every orbit is periodic. Lebesgue measure is not ergodic.

I Ifα /∈Q, then every orbit is dense inS¹. Lebesgue measure is ergodic; in fact it is the only Rα-invariant probability measure.

(5)

Unique Ergodicity

Denition: A system (X,T)is called uniquely ergodicif there is exactly one T -invariantprobability measure.

This measure is automatically ergodic.

Oxtoby's Theorem: Let X be a compact space and T :X →X continuous. A transformation(X,T) is uniquely ergodic if and only if, for every continuous functionψ andevery pointx ∈X , the Birkho averages

1 n

n−1

X

i=0

ψ◦Tⁱ(x) convergeuniformly to a constant.

(6)

Birkho's Ergodic Theorem

Birkho's Ergodic Theorem formalizes a frequent observation in physics:

Space Average = Time Average (for typical points).

This is expressed in:

Birkho's Ergodic Theorem: Letµbe a probability measure and ψ∈L¹(µ). Then theergodic average

ψ^∗(x) := lim

n→∞

1 n

n−1

X

i=0

ψ◦Tⁱ(x)

existsµ-a.e., andψ^∗ is T -invariant, i.e.,ψ^∗◦T =ψ^∗ µ-a.e.

If in additionµis ergodicthen ψ^∗=

Z

Xψ dµ µ-a.e.

(7)

Absolutely Continuous Measures

Denition: A measure µis called absolutely continuousw.r.t. the measureν (notation: µν) if ν(A) =0 impliesµ(A) =0. If both µν andν µ, thenµandν are calledequivalent.

Theorem of Radon-Nikodym: If µis a probability measure and µν then there is a function h ∈L¹(ν)(calledRadon-Nikodym derivativeordensity) such that µ(A) =R

Ah(x)dν(x) for every measurable set A.

Notation: h(x) = ^d_d^µ(_ν(^x_x⁾₎.

Suppose thatµν are both T -invariant probability measures, with a commonσ-algebraB of measurable sets. If ν is ergodic, thenµ=ν.

(8)

Poincaré Recurrence

The Poincaré Recurrence Theorem: If (X,T, µ) is a measure preserving system withµ(X) =1, then for every measurable set Y ⊂X of positive measure, µ-a.e. y ∈Y returns to Y , i.e., the rst return timeto Y : τ_Y(y)<∞ µ-a.e.

Denition: A system (X,T,B, µ) is calledconservative if for every set A∈ B withµ(A)>0, there is n≥1 such that

µ(Tⁿ(A)∩A)>0. The Poincaré Recurrence Theorem thus implies that probability measure preserving systems are conservative.

If not conservative, then the system is calleddissipative. It is called totally dissipativeif for every set A∈ B,

µ({x ∈A:Tⁿ(x)∈A innitely often}) =0.

(9)

Kac' Lemma

Kac's Lemma quanties the expected value of the rst return time τ_Y to Y ⊂X .

Kac' Lemma: Let(X,T) preserve an ergodic measureµ. Take Y ⊂X measurable such thatµ(Y)>0, and letτ :Y →Nbe the rst return time to Y . Take Y ⊂X measurable such that

µ(Y)>0. Then Eµ(τ_Y) =

Z

Y τ_Ydµ=X

n≥1

nµ(Y_n) =µ(X)

for Yn:={y ∈Y :τ(y) =n}.

(10)

Induced Systems

Proposition: Let (X,B,T, µ) be an ergodic dynamical system and Y ∈ B a set withµ(Y)>0. Let TY =T^τ^Y be the rst return map to Y .

Ifµ is T -invariant, thenν(A) := _µ(¹_Y₎µ(A∩Y) is T_Y-invariant.

Conversely, ifν is TY-invariant, and Λ :=

Z

Y τ(y)dν <∞, then

µ(A) = 1 Λ

∞

X

j=1

ν(T⁻^j(A)∩ {y ∈Y :τ(y)≥j})

is a T -invariant probability measure. Moreoverµis ergodic for T if and only ifν is ergodic for T_Y.

(11)

Isomorphic Systems

Denition: Two measure preserving dynamical systems (X,B,T, µ) and(Y,C,S, ν) are calledisomorphic if there are X⁰ ∈ B, Y⁰ ∈ C andφ:Y⁰ →X⁰ such that

I µ(X⁰) =1,ν(Y⁰) =1;

I φ:Y⁰ →X⁰ is a bi-measurable bijection;

I φis measure preserving: ν(φ⁻¹(B)) =µ(B) for all B ∈ B. I φ◦S =T ◦φ.

That is, the below diagram commutes, andφ:Y →X is one-to-one almost everywhere.

(Y,C, ν) −→^S (Y,C, ν)

φ↓ ↓φ

(X,B, µ) −→^T (X,B, µ)

(12)

The Bernoulli Property

Denition: Let (X,B, µ,T) be a measure preserving dynamical system.

1. If T is invertible, then the system is calledBernoulli if it is isomorphic to a 2-sided Bernoulli shift.

2. If T is non-invertible, then the system is called one-sided Bernoulli if it is isomorphic to a 1-sided Bernoulli shift.

3. If T is non-invertible, then the system is called Bernoulliif its natural extension is isomorphic to a 2-sided Bernoulli shift.

(13)

Mixing

Denition: A probability measure preserving dynamical system (X,B, µ,T) ismixing (orstrong mixing) if

µ(T⁻ⁿ(A)∩B)→µ(A)µ(B) as n→ ∞ for every A,B ∈ B.

This says that the events A and B are asymptotically independent.

Equivalently, a probability preserving dynamical system(X,B,T, µ) is mixing if and only if

Z

X f ◦Tⁿ(x)·g(x) dµ→ Z

X f(x) dµ· Z

Xg(x) dµas n→ ∞ for all f,g ∈L²(µ)

(14)

Measure-Theoretic Entropy

Given a nite partitionP of a probability space (X, µ), let Hµ(P) =−X

P∈P

µ(P) log(µ(P)). (1)

For a T -invariant probability measureµon (X,B,T), and a partitionP, dene theentropy ofµ w.r.t.P as

hµ(T,P) = lim

n→∞

1 nHµ(

n−1

_

k=0

T⁻^kP). (2)

Finally, themeasure theoretic entropy of µis

hµ(T) = sup{hµ(T,P) : P is a nite partition of X}. (3)

(15)

Measure-Theoretic Entropy

Remarks concerning measure-theoretic entropy I The existence of the limit in (2) depends on:

Fekete's Lemma: If(a_n)_n≥1 is subadditive, then

nlim→∞

a_n n = inf

q≥1

a_q q .

I By Sina's Theorem, instead of taking the supremum over all partitions, it suces to take a generating partition.

I Entropy is preserved under isomorphism (and is non-increasing under taking measure-theoretical factors).

I The entropy of the (p₁, . . . ,p_n)-Bernoulli shift (both one-sided and two-sided) is h(µ_p) =−P

ip_ilogp_i.

(16)

The Shannon-Breiman-McMillan Theorem

The Shannon-Breiman-McMillan Theorem uses entropy to measure how large sets in the n-th jointP_n are. Typically, they decrease exponentially and the exponential rate is exactly the

measure-theoretical entropy.

Shannon-McMillan-Breiman Theorem: Let (X,B, µ,T) be a measure-preserving transformation andP a (countable or nite) partition with H(P)<∞Let P_n=W_n−1

k=0T⁻^k(P) andP_n(x) the element ofP_n containing x. Then

− lim

n→∞

1

n logµ(P_n(x)) =h(P,T) µ-a.e.

(17)

Topological Entropy

Denition: Topological entropy can be dened as the xponential growth rate of the

I (Adler-Konheim-McAndrew) minimal cardinality of subcovers of joints:

h_top(T) = lim

ε→0sup

U nlim→∞

1

nlogN(Uⁿ), (4) where the supremum is taken over all openε-covers U. I (Bowen) maximal cardinalty of n, ε-separated sets:

h_top(T) = lim

ε→0lim sup

n→∞

1

nlogs_n(ε). (5)

I (Bowen) minimal cardinalty of n, ε-spanning sets:

htop(T) = lim

ε→0lim sup

n→∞

1

nlogrn(ε). (6)

(18)

Topological Entropy

For maps T on the interval we have:

Theorem of Szlenk & MisiurewiczLet T : [0,1]→[0,1]has nitely many laps. Then

htop(T) = lim

n→∞

1

nlog`(Tⁿ)

= lim sup

n→∞

1

nlog #{clusters of n-periodic points}

= max{0, lim

n→∞

1

n logVar(Tⁿ)}.

where two n-periodic points are in the same cluster if they belong to the same lap of Tⁿ.

(19)

The Variational Principle

Topological and measure-theoretical entropy are related by the Variational Principlewhich say that (for continuous map on compact metric space)

h_top(T) = sup{hµ(T) :µis T -invariant probability measure} Ifµ is such that hµ(T) =htop(T), thenµis called a measure of maximal entropy.

If there is aunique measure of maximal entropyµ_max, then(X,T) is calledintrinsically ergodic. In this case,µ_max is ergodic.

(20)

The Variational Principle

I The full shift on N symbols (one-sided or two-sided) has entropy logN, and the measure of maximal entropy is then(_N¹, . . . ,_N¹)-Bernoulli measure.

I A subshift of nite type with transition matrix A has the logarithm of the leading eigenvalue as entropy. The Parry measure is the measure of maximal entropy.

I Interval maps with constant slope±s for s >1 have an absolutely continuou smeasure, which is the measure of maximal entropy.

(21)

Mixing

Review of ergodic properties:

I Bernoulli if it is isomorphic to a two-sided Bernoulli shift.

I strong mixing if for all A,B∈ B

µ(T⁻ⁿ(A)∩B)−µ(A)µ(B)→0. I weak mixing if for all A,B ∈ B the average

1 n

n−1

X

i=0

|µ(T⁻ⁱ(A)∩B)−µ(A)µ(B)| →0.

I ergodic if T⁻¹(A) =Amodµ impliesµ(A) =0 or µ(A^c) =0.

I conservative if for all A∈ B with µ(A)>0 there is n≥1 such that µ(Tⁿ(A)∩A)>0.

(22)

Bernoulli - Mixing - Ergodic - Conservative

TheoremWe have the implications:

Bernoulli ⇒mixing ⇒ weak mixing ⇒ ergodic⇒ conservative.

None of the reverse implications holds.

(23)

Classifying Systems up to Isomorphism

Denition: Two dynamical systems (X,T)and(Y,S) are topologically conjugateif there is a homeomorphismφ:Y →X such thatφ◦S =T ◦φ.

We can classify dynamical systems up to conjugacy, and measure preserving systems up to isomorphism.

The one does not imply the other or vice versa:

I The doubling map (with Lebesgue measure) and the one-sided (¹₂,¹₂)-Bernoulli shift are isomorphic. They are not conjugate (note: they are dened onnon-homeomorphic spaces).

I The doubling map with Lebesgue measure is conjugate to the doubling map with δ₀, but they are not isomorphic.

(Conjugacies are topological objects; they don't care about measures).

(24)

Classifying Systems up to Isomorphism

The following properties and quantities are preserved under isomorphisms.

I ergodicity, weak and strong mixing, the Bernoulli property I measure-theoretic entropy. Moreover,

Ornstein's Theorem: Twotwo-sidedBernoulli shifts (X, µ_p, σ) and(X⁰, µ_p⁰, σ) are isomorphic if and only if h(µ_p) =h(µ_p⁰).

This fails for one-sided Bernoulli shifts.

I eigenvalues of the Koopman operator U_Tf =f ◦T . Moreover, Theorem (Halmos & Von Neumann) Two measure-preserving dynamical systems with pure point spectra are isomorphic if and only if their eigenvalues are the same.

This fails without the assumption of pure point spectrum.

(25)

Toral Automorphims

Denition: Atoral automorphism T :T^d →T^d is an invertible linear map on the (d-dimensional) torusT^d. Each such T is of the form TA(x) =Ax (mod 1), where the matrix A satises:

I A is an integer matrix withdet(A) =±1;

I To avoid degenerate examples including A=Id, we assume that A isprimitive, i.e., Aⁿ is strictly positive for some n ≥1.

I If the eigenvalues of A are not on the unit circle, then the toral automorphism is called hyperbolic.

(26)

Toral Automorphims

Properties of the toral automorphisms T_A:

I A preserves the integer lattice Z^d, so T_A is well-dened and continuous.

I det(A) =±1, so Lebesgue measure m is preserved(both by A and T_A). Also A and T_A are invertible, and A⁻¹ is still an integer matrix (so T_A⁻¹ is well-dened and continuous too).

I One can show that Lebesgue measure is ergodic if and only if A has no eigenvalues that are roots of unity.

I Hyperbolic toral automorphisms have a Markov partition w.r.t.

which the symbolic dynamics is asubshift of nite type, and Lebesgue measure is the measure of maximal entropy.

I Hyperbolic toral automorphisms are mixing w.r.t. Lebesgue.