• Keine Ergebnisse gefunden

Playable Differential Games

N/A
N/A
Protected

Academic year: 2022

Aktie "Playable Differential Games"

Copied!
29
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

W O R K I N G P A P E R

PLAYABLE DIFFERENTLAL GAMES

I Marc Quincampoiz

August 1989

W P-89-60

l n t e r n a t ~ o n a l I n s t i t u t e for Appl~ed Systems Analysis

(2)

PLAYABLE DIFFERENTIAL GAMES

Marc Quincampoiz

August 1989 WP-89-60

CEREMADE, Universite de Paris-Dauphine

Working Papers are interim reports on work of the International Institute for Applied Systems Analysis and have received only limited review. Views or opinions expressed herein do not necessarily represent those of the Institute or of its National Member Organizations.

INTERNATIONAL INSTITUTE FOR APPLIED SYSTEMS ANALYSIS A-2361 Laxenburg, Austria

(3)

Playable Differential Games

Marc Quincanlpoix

CEREMADE, UNIVERSITC DE PARIS-DAUPHINE

(4)

FOREWORD

Playability conditions of differential games are studied by using Viability Theory.

First, the results on playability of time independent differential games are extended to time dependent games. In fact, time is introduced in the dynamics of the game, in the state dependent contraints bearing on controls and in state contraints.

Second, some examples of pursuit games are studied. Necessary and sufficient conditions of playability of the game are provided. Here, pursuit games are directly considered as "games of kind" (in Isaacs's sense) and are not considered as "games of degree". The viability condition does not always provide the "optimal strategy" to be as close as possible to a certain goal, but it supplies strategies allowing the system to reach a given goal.

Alexander B. Kurzhanski Chairman System and Decision Science Program

(5)

Contents

1 Introduction 1

2 Time dependent differential games 4

3 Some applications t o pursuit Games: 12

. . .

3.1 The target guardian problem 13

. . .

3.2 Pursuit game with certain capture: 14

. . .

3.3 An affine differential pursuit game: 16

. . .

3.3.1 General case: 16

. . .

3.3.2 An example in a two dimensional space: 17

. . .

3.4 End time of a capture: 18

. . .

3.4.1 A general model: 18

. . .

3.4.2 A very simple example: 20

(6)

Playable Differential Games

M a r c Q u i n c a m p o i x

1 Introduction

We consider a two players differential game whose dynamics are described by:

( b )

{ 9

Yt(t) = g(t, ~ ( t ) , Y ( t ) , v ( t ) )

2 2 ) v ( t ) E V ( t , x(t), Y ( t ) )

The constraints of the game are the time dependent g a m e r u l e s P and Q P : R x Y - X & Q : X x R - Y

The playability t u b e of this game is:

K ( t ) := { ( x , Y )

/

a: E P ( ~ , Y ) and Y E Q ( z ( t ) , t )

The playability p r o p e r t y of the game holds when for all time to and for every initial state ( x o , yo) in K ( t o )

,

there exist solutions of the differ- ential game starting at time to for ( x o , yo) such that :

We shall characterize it by constructing t h e regulation m a p RpBQ in which we could choose playable controls. This map is built thanks t o

(7)

contingent derivatives of the rules1. We can introduce the subset of dis- criminating controls which allow the players to associate t o any control v played by the second player a t least a control u such that (u, v) is playable.

A P , Q ( x , ~ , Y , v ) := { U E U ( t , ~ , ~ ) ; ( u , v ) E R P , Q ( x , ~ , Y ) )

We also introduce the pure control map BpVQ which allows the first player t o choose a control u such that (u, v) is playable for any v E V(t, x , y).

Before going further, it may be use ful t o relate these concepts t o Isaacs- Hamilton- Jacobi equations.

Next, we adress the question of general pursuit games. A pursuer wants t o catch an invader. Of course the meaning of "to catch" will depend on each example, but, generally, it means t o be near enough as we shall see in the construction of the playability domains. At the beginning, we shall write a condition of playability for the famous R. Isaacs problem: The target and its guardian.

We solve the case of certain capture with playability rules of the form:

P ( t , y) = y

+

y ( t ) C and Q(x,t) = a:

-

y ( t ) C We then apply this t o affine differential games:

The regula.tion map of this game will be conducted in an example.

'Let us recall the contingent c o n e at x to a subset K:

TK ( t ) := { v

/

lim inf d ( t

+

hv, K ) / h = 0 )

h-O+

The contingent derivative of the set valued map Q X H Y is the set valued map D Q ( t , y) : X H Y defined by:

or, equivalently, by:

(8)

Another problem is the end time of a capture. So, we give conditions for a time T to be the end time of the pursuit. For this purpose, we consider an increasing nonnegative function w and write the viability tube in the following way:

(9)

2 Time dependent differential games

Let us consider two players Xavier and Yves. Xavier acts on a state space X with a control u

,

and, Yves on a state space Y with a control v . The controls depend on players states and on time through the set valued maps

U

and V respectively. Their actions on their states are governed by the controlled system:

Where X , Y are finite dimensional spaces, and where

f : GraphU X g : GraphV I-+ Y are single valued maps.

The influence between the two players is exerted through the r u l e s o f t h e g a m e :

P : [ o , T ] x Y H X & Q : X X [ O , T ] I - + Y It means that constraints of the game are:

V t E (0, TI x(t) E P ( t , ~ ( t ) ) and ~ ( t ) E Q(x(t),t) So we can define a playability t u b e :

K ( t ) =: {(x, y)

/

(x, t , y) E GraphQ

n

GraphP-' )

For any (xo, to, yo), let's also introduce the solution map S(xo, to, yo) of solutions to (1) starting on (x0,yo) a t to.

Now, we always assume that the playability domain is non empty and that graphs of P and Q are closed.

We need a suitable definition of playability:

Definition 2.1 The game enjoys the playability property if and only if:

v

to E [O, T[

v

(xo, Yo) E h'(t0) 3 (x(-1, ~ ( 9 ) E S(x0, to, Yo) v t E [to,T[ (x(t), Y(t)) E K ( t )

ii) if T

<

w V t >_ T (x(t), y(t)) E K ( T )

(10)

Before writting our first proposition, let us assume that:

2) f and g are continuous with linear growth and are afFine with respect to u and v

ii) The controls maps

U

and V are upper semi continuous with compact convex images and with a linear growth.

We have t o define notions of transversality and sleek sets:

Deflnition 2.2 A s e t K i s sleek a t z if a n d o n l y if t h e s e t valued m a p TK(-) i s l o w e r s e m i c o n t i n u o u s . ( A l l conuez subsets are sleek.) A s e t valued m a p i s sleek i f a n d o n l y if i t s graph i s sleek.

Deflnition 2.3 T h e rules will be said "transversal" i f a n d o n l y if:

v

t E [O, T]

v

(x,Y) E K ( t )

T ~ r a p h q ( x * t , y )

-

T ( ~ r a p ~ - l ) ) ( z , t , y ) = X x R x Y

P r o p o s i t i o n 2.4 U n d e r a s s u m p t i o n s ( 3 ) and if t h e rules

P,Q

are "sleek"

a n d transversal, a necessary and s u f i c i e n t c o n d i t i o n of g a m e playability i s t h e following Haddad's c o n t i n g e n t condition:

ii) if T

<

oo g ( T , x , ~ , v ) E DQ(x,T,y)(f(T,x,y,u),O)

f

(T, x, Y, u) E D P ( x , T, Y)(O,

f

(T, x, Y, 4 )

R e m a r k - The transversality condition is assumed because it is useful t o separate the rules following way:

( 4 ) T ~ ~ ~ ~ h ~ ~ ~ ~ a ~ h ~ p - 1 ) ( ~ t, Y) = T ~ r a p h ~ ( ~ , t, Y) T~raphp-I (z, t , Y) It is an obvious consequence of (2.3).

(11)

A necessary and sufficient condition for the transversality of the rules is that for all perturbations (e, f , g ), there exists ( u , 7, V) E X x R x Y such that:

See corollary 4-3 of [I]

P r o o f -

Let us consider H(x, s, y) =:

I

if { f ( s , x , Y , U ) } s E [O, T [ x { 1

1

x {g(s,x, y,v))

I

(u,v) E U(t, +, Y) x V(t,x, Y)}

The system now becomes:

( ~ ' ( t ) , s1(t), Yt(t)) E H(z, 3, Y)

Applying Haddad's Viability Theorem (see [2]), there exist viable solutions if and only if:

V s V (z, y) E K ( s )

H(x,s, y)

n

TK(x, S, y)

# 0

with K := GraphQ

n

G r a p h P 1

According to the definition of the transversality and the contingent derivative, it is possible to write:

T G ~ ~ ~ ) ( x t> ~ ~Y) ~= GraphDQ(x, t , Y) G ~ ~ ~ ~

n

Graph(DP(Y, ( ~ - I '1 z)-' ) With the expression of H we have proved the previous proposition

.

R e m a r k - In some particular cases, we can compute directly the contingent cone TGraphQ GraphP-], without assuming the transversality condition. In fact, very often it is more simple to write TGraphQnGraph(P-l) for instance when it is impossible t o separate the constraints sets of the two players (see further the example of pursuit game with certain capture).

(12)

We need to choose controls satisfying the previous proposition. For that purpose, let us define the retroaction rules C and D acting on the controls:

Definition 2.5 Xavier's retroaction rule b the set-valued map :

and Yves's retroaction rule is the set-valued map:

These maps allow t o replace the initial differential game by a game on controls parametrized by the state and the rules through the following regulation map. With these retroaction rules, we can define subsets in which it is possible t o choose playable controls, discriminating controls, pure controls, respectively.

Definition 2.6 We associate with the retroaction rules C and D the reg- ulation map R of playable controls defined by:

R P , Q ( ~ , 2, Y ) = { (21, v)

1

u E C(t, X , Y; U ) & v E D(t, X, Y; u)

1

The discriminating set valued map :

A P , Q ( ~ , ~ , Y , v ) := { u E U ( t , x , ~ ) ; ( u , ~ ) E RP,Q(~,x,Y)

1

The set valued map B:

B P , Q ( x , ~ , Y) := ~ V~ ( t . t , y ) A ~ , Q ( t , E 2, Y, U )

The concept of playable rules:(P and Q are playable if:) v t E [O,

T] v

(x, Y) E K ( t ) Rp,Q(t,z, Y)

#

0

(13)

Let's remark that RpVg is the set of fixed-points t o the set valued map C x D .

An obvious consequence of these definition is the easy result:

C o r o l l a r y 2.7 If the domain3 of the retroaction rules are equal to the con- trol set valued map3

U

and V, then the con~traint set and the regulation set are nonempty.

It can, then, be useful to translate the viability conditions of the game to Isaacs-Hamilton-Jacobi contingent equations2. Playability can be ex- pressed by an Isaacs Hamilton Jacobi equation thanks to contingent epi- derivatives.Consequently,let us recall the definition of the contingent. epi- derivative of V at x in the direction v :

Definition 2.8 Let V : X -+ R U { m )

D l V(x)(v) := lim i n f ~ - ~ + , ,,,(V(x

+

hu) - V(x))/h)

or in shorter way:

TEpigraphv(~, V ( ~ ) ) = Epigraph (DV(x))

P r o p o s i t i o n 2.9 The regulation map i~ nonempty if and only if:

ii)

if T < m

Here, the rules are characterized by indicators functions of their graphs

W p

WQ.

=See [2]

(14)

W P ( ~ , t , y ) := 0 i f z E P ( t , y ) 0 i f Y E Q ( z , t )

oo else oo else

Proof - It is only the translation of (2.4), if we notice that:

0 2 D, W Q ( x , t , y)(a, 1 , b) if and only if:

( a , 1 , b ) E T',aphg(x, t

,

9 )

To proceed further, it is convenient to write the differential game in a more compact form. The state ( z , y ) is now z E X x Y and this system includes the playability rules in the set valued maps U and V:

U ( t , z ) :=

0

if ( t , z )

4

GraphP V ( t , z ) :=

0

if ( t , z )

4

GraphQ

This is given by the following equations with the single valued map h ( t , z , u , v ) describing the evolution:

with constraints:

V t E [0, TI z ( t ) E K ( t ) := { z

/

U ( t , ~ ( t ) )

# 0

& V ( t , ~ ( t ) )

# 0

) We assume that:

' i ) h : X : = R x R n x R P x R q ~ R n

is continuous with a linear growth and is f f i n e with respect to u and v.

i i ) K is sleek.

i i i ) U, V are upper semi continuous with compact convex images and with a linear growth.

Under assumptions ( 6 ) , we can write the Haddad's contingent condition for the game playability:

V t E [0, T [ V z E K ( t ) 3 ( u , v ) E U ( t , 2 ) x V ( t , z ) i ) i f t E [0, TI h(t, z , u , v ) E D K ( t , z ) ( l ) ii) i f T < o o h ( T , z , u , v ) E D K ( T , z ) ( O )

(15)

We can, then, translate this viability condition into the following Isaacs- Hamilton-Jacobi contingent equation:

V t E [O,T] V z E K ( t )

i ) infuE v ( t , ~ ) v ( ~ , z ) D , W K ( ~ , z ) ( l , h ( t , z , u , v ) ) = 0

i i ) i f T < m

D W K ( T , z)(0, h ( T , 2 , u , v ) ) = 0 i n f u ~ U ( T , z ) WE V ( T , z ) 1

with W K ( t , z ) = 0 if z E K ( t ) im else

In the same way, we can associate to the control system four Isaacs- Hamilton- Jacobi contingent equations.

i i ) i f T

<

m

inf WE V ( T , z ) inf UE V ( T , z ) Dl @ ( T , z)(O, h ( T , 2 , u , v ) )

5

0

i i ) i f T < m

SUP^^ V(T.2) U ( T , z ) z ) ( O , h(T, z , u , v ) )

I

O

t i ) i f T < m

supv, V ( T , Z ) i n f u ~ U(T.2) Dl @ ( T , z)(O, h ( T , 2 , u , v ) ) 0

I "'I

ii) infu, i f V ( T , z ) T < m SUPvE V ( T , z ) ' ) ( O , h(T, ' 9 u ,

' 1 ) 5

O Theorem 2.10 We assume that :

10

(16)

The function h i~ c o n t i n u o u ~ with linear growth, ~ e t valued map3 U and V are c l o ~ e d with linear growth,and that

@ : R x X -, R U {oo} rJ nonnegative, contingently epidifferentiable ( ~ e e (2.8)) and that it3 domain i~ contained in the i n t e r ~ e c t i o n of domain3 of U and V .

T h e n the equation ( 9 ) rJ equivalent to :

- a ) If

U

and V have convez value3 and h i~ a f i n e with r e ~ p e c t t o the V ( s , Z ) E Dom(@) 3 z(.) solution to ( 5 )

two control^

.

v

t E [O, TI @ ( t ,

4 ) ) 5

@ ( s , z )

- p )

i f h i~ uniformely lipsitzchean

V ( s , z ) E Dam(@), V z(.) solution to ( 5 ) , V t E [O, T ] @ ( t , z ( t ) ) L @ ( s , z )

-7) If V a3 lower ~ e m i c o n t i n u o u ~

U

and V with convez value3 and h a f i n e with respect t o the two controb.

For any closed loop control C(s, z ) E V ( s , z )

V ( s , z ) E Dam(@) 3 z(.) solution to ( 5 ) with 6 ( t , z ( t ) ) such that V t E [0, T ] @ ( t , z ( t ) )

5

@ ( s , z )

-6) V i~ lower ~ e m i c o n t i n u o u ~ with convez value3 and T = oo.

B = { ti E U ( s , z) , i n f u € U ( t , z ) SUPvE V(1.t)

D t @ ( t , z ) ( l , h ( t , 2 , u ,

4)

= SUPvE V(,,,) Dl @ ( t , z ) ( l , h ( t , 2 , ti, v ) ) ) i~ lower ~ e m i c o n t i n u o u ~ with convez valued.

T h e equation ( 9 ) 6 is ~ a t i ~ f i e d if and only if:

There exists ii(s, z ) E U ( s , z ) played by Xavier

such that for any'closed loop strategyC(s, z ) E V ( s , z ) V ( s , Z ) E Dom(@) 3 z(.) solution to ( 5 ) with fixed C and ii such that: V t E [0, T ] @ ( t , z ( t ) )

5

@ ( s , z )

Remark - If @ = maxw,,w, the case a means that Dom(@) = K is a playability tube; the game has the playability property. In the case /3, K is an invariant tube; the game has the winnability property.

The case 7 define Xavier's discriminating property

(V 6 A p , ~ ( t , z , C)

#

0). The last case define Xavier's leading prop- erty ( B P , Q ( ~ , ~ )

#

0)-

'

(17)

P r o o f - For sake of simplicity, we only prove this theorem when

T

= a.

Let be:

First, it's convenient t o notice the following Lemma.

Lemma 2.11 We have D,iP(t,z)(l, h(t, z , u , u))

5

0 if and only if:

P r o o f - It's the obvious consequence of the definition of Hamilton Jacobi contingent equations for the system :

(st, zt, wt) E H ( s , z) x (0) with Epigraph O as a viability tube.

Equivalences cr and /3 are the application of the invariant tube and viability tube theorems

.

The lemme (2.11) shows that implications

( 2 . 1 0 ) a (9) are a simple translation.

Let's prove the third implication. According t o Michael's selection The- orem, for any (s, z , ) E Dom(iP), for any vo E V(s, z)

there exists cont,inuous 6 in the set valued map V such that 6(s, z) = uo.

Hence,inf,(D,O(t, z)(l, h(t, z, u, 6))

5

0) means that we can applie a similar lemme (that (2.11)) to H;. Consequently, Viability tubes Theorem proves the implication.

Finally, let's prove the last result:

According t o Michael's selection Theorem ( B is lower semi continuous with closed convex values), there exists continuous ii in the set valued map B and for He thanks to the lemme we can conclude.

3 Some applications to pursuit Games:

Let us study some cases t o which we can apply last results:

(18)

3.1 The target guardian problem.

[see Isaacs 1.9 p.181

We consider a game between a guardian (Xavier) and an invader (Yves).

The guardian's task is to guarantee that no one can go near some target (a set C ) and the invader has the opposite goal. The guardian's coordinates are z and, his opponent's coordinates are y. The evolution of the state (2, y) is given by equations (1).

If

the distance between Yves's state and C is lower than I(.) the invader wins; if the distance between Xavier's state and Yves's state is lower than w(.) the Guardian wins. These cases determinate the end of the game.

We can write this, using a viability tube, in the following way:

K(t) := { (2, Y)

/

4x7 Y) 2 w(t) and d(C, Y)

L

l(t)

1

We immediately give a viability condition for this system:

Proposition 3.1 If w and 1 are two nonnegative single valued C' difler- entiable maps

,

if the set C is reduced to a point {p), then the game is playable if and only if:

t / ( x , y ) E K(t) 3 (u,v) E U(t,x,y) x V(t,z,y) such that i) if d(x,y)=w(t)

<

5

-

Y, (f (z, Y,

4

- 9(z, Y, v )

>

-wt(t).w(t)

L

0 ii) if Ily - pll = l(t)

<

( Y - p),g(z, Y, v )

>

-wt(t).w(t) 2 l(t)

Before prooving this proposition let's write the following proposition for tangent cones calculus, it is an obvious consequence of (corollary 4-1 in [I]).

P r o p o s i t i o n 3.2 Let be X , Y two finite dimensional Banach spaces, A X H Y a map C1 -diflerentiable around z.

If

VA(x)(X) = Y, and if M i~ sleek, then

(VA(z))-' .TM (A(z)) = T A - ~ ( M ) ( ~ )

P r o o f - We have to compute the contingent-derivative of the set valued map K when the set C is a point p

.

(19)

Let be : A(x, y, t) := (I, IIx

- ~ J J ' -

w(t)', (ly

-

P ~ ~ 2

-

l(t)') The map A is obviously C1 (because w and 1 are C1 too) and Graph K = A-'(R+ x R+ x R+).

As w(t) and l(t) are nonnegative, VA(x, y,t) is surjective and we can apply (3.2):

Consequently:

T G r a p k ( t , Y) = (VA(t, I , Y)-~.TRxR+ X R + A(t, X , Y))

R e m a r k - In the case i , for instance, the condition means that if Xavier is near the prey, the game will be playable if and only if the relative velocity v j - v j has with the vector y2 an angle less than or equal to 90".

Here, it's easier to compute directly the cone, without separating rules.

Now, with these formulas, it will be possible to choose open loop and closed loop controls, in practical cases.

3.2 Pursuit game with certain capture:

Let us consider a pursuit between two players, Xavier the pursuer and Yves the quarry. We know that the evader can escape from Xavier if he is far enough : outside of a set which may depend on time (this is realistic, for example Xavier can have less and less energy in a two planes pursuit). We shall study the case with a certain capture. For this, let's introduce a set

C

of final states and a single valued map cp(t) which defines a tube. Players have to move in this tube. Here, for sake of simplicity let's assume that the end time T = oo. This is not very important because we can always modify the function 9 such that it is constant (=I) as soon as t

2

T.

The viability constraint is then:

A reasonable assumption is to have cp larger than or equal to 1 and C1 differentiable.

(20)

P r o p o s i t i o n 3.3 Let us posit the same assumptiom in first section. If C rj locally compact, the evader cannot escape if and only if:

V t E R+ V(x,y) E K(t) 3 (u,v) such that:

-

Y)

+

v(t)(f(t, +, !I, u)

-

x, Y, v)) E T c ( Z )

P r o o f - We can calculate in fact the contingent derivative of the tube thanks t o (3.2). As GraphP-I = GraphQ the consequence (4) of the transversality is satisfied.

In fact, here: GraphK = A-'(C) with A(t, x, y) :=

3

a C1 differen- tiable function.

Hence :

if (5, Y) E K ( t )

(u, v) E D K ( t , x, y ) ( ~ ) if and only if

'(cpf(t)(x C P ( ~ ) ~ - y1.r

+

v(t)(u - v)) E T c ( 3 ) because

T A - ~ ( c ) ( ~ , x, Y) = (VA(t, x, Y ) ) - ~ - ~ c ( ~ )

In fact, VA(x, y, t) is surjective because y

>

1 and we can use (3.2). D

We study more concrete cases:

For instance, in R3, if C = { x

/

llxll

<

1 )

this equation can easily be interpreted:

(cpf(t)(x

-

Y)

+

~ ( t ) ( f

0 ,

x, Y, u) - g(t, x, Y, v))).(x

-

Y)

5

0

It means that there is an angle less than 90' between the vector y 5 and cpf(t)(x - Y)

+

v(t)(f (t, x, Y, u)

-

g(t, x, Y,

4)

Very often, it is necessary to specify the function cp

,

for instance a "good one" is :

cp(t) = 1

+

~ e - ~ '

and, of course we should be able to choose a and b allowing the pursuit is possible for every pair of controls (u, v) just solving (if f := u

and g := v) :

V ( u , v )

-

ab11x - ylJ2

+

(1

+

~ e - ~ ' ) ( u

-

v)(x

-

y)

5

0

This is not very useful because this condition is depending on time, we shall try, now, t o have a condition independent on time. A way t o do this is t o determinate all suitable functions cp(.) t o describe the tube K(t). Let 's find such functions solving the following system :

(21)

In this case, it means that

K 1 ( t ) := { ( x , y , ~ ) E X x

Y

x

R+ /

(1s

- Y I ( 5

Q ) is a viability tube of this new system. We can write a necessmy and sufficient condition on W for this :

L e m m a 3.4 The function W will provide s o l u t i o ~ l ~ if and only if:

If

IIx

- Y I I

= 9

3 ( u , V ) such that ( x

-

y, u

-

v ) - W ( q ) q

5

0

P r o o f - Let's define B ( x , y, Q ) := Ilx

- Yl12 -

Q* and let's notice that

B-'(R- )

= Graphh" and thanks to (3.2) the lemme is proved. In fact, as soon as ( ( x - y ) , ( x - y ) , 9 )

#

( 0 , 0 , 0 )

,

V B ( x , y, 9 ) is surjective.

Let us study a case when f and g have explicit forms.

3.3

An affine differential pursuit game:

3.3.1 General case:

We are in the case when two players act on the same state z ( . ) . The first player tries to brake the system and the second player tries to accelerate it by using two controls u and v .

The evolution of the system is given by the following differential equa- tion:

( 1 2 )

{

i ) z1 = A z ( t )

-

u ( t )

+

v ( t )

i i ) u ( t ) E U ( t , z ( t ) ) v ( t ) E V ( t , ~ ( t ) )

The goal is to drive the system near a given target C. Consequently, let us consider the following constraints:

(22)

With C := { z E

Rn /

Mb = M z ) and (b,r,A) E

Rn

x

R+

x

R+

Let us write the playability condition of this game:

P r o p o s i t i o n 3.5 Let A:

Rn

I+

Rn , M: Rn

I+

Rk

be linear

.

The pur~uit h po~~ible if and only i f :

whenever z E e-At(r

+

C )

V t 3 ( u , v ) E U ( t , z ) x V ( t , z ) such that:

M[Az

+

A.z

-

u

+

v] = 0

P r o o f - According t o (3.2), the necessary and sufficient condition of playability is:

V Z E K ( t ) 3 ( u , v ) E U ( t , z ) x V ( t , z ) / A z - u + v E D K ( t , z ) ( l ) But, we know that (thanks t o (3.2)):

T ~ r a p h ~ ( t , z ) = Graph D K ( t , z ) = T L - l ( q = V L ( t , 2)-'.Tc(L(t, z ) ) with L(t, z ) := eAtz

-

r

And we can notice that:

V L ( t , z ) = (AeAtz, e A t )

is obviously always surjective. Hence:

x E D K ( t , z ) ( T ) @ [AeAt.zr

+

eAt.x] E kerM

We can write the discriminating set valued map:

A(t, z, u ) := { v E V ( z , t )

/

v E kerM

-

( A

+

A)z )

Now, let us apply this proposition t o the following example:

3.3.2 A n e x a m p l e i n a t w o d i m e n s i o n a l space:

As we just saw:

17

(23)

Proposition 3.6 T h i ~ game i ~ playable if and only if:

Xz

+

( 1

-

X ) y = u

+

v

Proof - It is just the translation of previous proposition with:

3.4

End time of a capture:

3.4.1 A general model:

Let us consider a two player pursuit game in which one player has to catch the other one in a finite time. The evolution of the game is governed by (1).

For this, let us introduce a constraint function w ( - ) which is the largest possible distance between the two players at time t . Let us consider the end time T as a variable related to t . Hence, we can write a condition for the existence of solutions to this game.

Then solutions have to belong to :

K := { ( 2 , y , t , T )

/

d ( z , Y ) I w ( T - t ) ) With the following assumption on w : V s S O w ( s ) = 0 and V s w ( s ) > O

It means that the distance between two players is equal to zero after cap- ture.

We need another assumption because the two players coordinates do not have to change after Xavier has caught Yves. Consequently f and g are such that:

V z E X f ( . , z , z , . ) = 0 V z E X g ( . , z , z , . ) = O

It means that as soon as z = y (the capture) the system does not evolve, forever the state will remain constant forever.

The set { ( z , y )

/

z = y) x

R+

x

R+

is a viability tube of the game.

(24)

P r o p o s i t i o n 3.7 Under assumptions (J), a necessary and suficieni con- dition for the game playability is :

forall ( x , y , t , T ) such that d(z,y) = w(T

-

t ) there exists ( u , v ) such that

P r o o f - We now write the inclusion to which we shall apply Haddad's theorem in the form:

The viability set is:

K := { ( x , y , s , T )

/

~ ( X , Y )

5

w ( T - 3 ) )

It is necessary to calculate the contingent cone a t K in ( z , y , t , T ) ; it is easy with assumption of C1 differentiability of w .

For this calculus let introduce the following C1 different,iable map:

A ( z , y , t , T ) := 112

-

yl12 - w(T

-

t ) 2

then K =

A-'(R- )

Hence (See 3.2)

T K ( ~ , Y , t , T ) = V ( A ( 2 , Y , t , T))-' .TR- ( A ( z , Y , t , T ) )

because V A ( z , y , t , T ) is surjective as soon as z

#

y or w'(T

-

t )

#

0 We can calculate the cone:

a- if d(z, y )

>

w(t

-

T ) then: T K ( z , y , t , T ) = 0(it is outside K ) because TR- ( A(z, y , t , T ) ) =

0.

b- if d ( z , y )

<

w(t

-

T ) then: T K ( z , y , t , T ) = X x

X

x R x R (it is in the interior of K )

because TR- ( A(z, y , t

,

T ) ) = R

(25)

c- if d(x, y) = w(t

-

T) then:

TK(x, y, t, T) = {(u, V, a, T)/(x

-

y).(u

-

v)

-

w(T

-

t)w'(T

-

t ) ( ~

-

0 )

5

0) (on the boundary of K)

because TR- (0) =

R-

0

3.4.2 A very simple example:

The two players can only choose their velocities u and v the norms of which have to be less than or equal to respectively cr and

p

(nonnegative numbers).

(u, v ) E B(O,cr) x B(O, P)

The playability condition now becomes:

If d(x,y) = w(t

-

T)

(U

-

v).(x - y)

+

w(T - t)wl(T - t)

5

0

It is always possible if :

-(a

+

P)))x

-

yll

+

w(T

-

t)wl(T

-

t)

5

0 i.e cr

+ P >

wl(T - t)

This condition means that the two players have to move fader than the

"slope

"

of the tube when they are on its boundary.

(26)

References

[I] AUBIN J.-P. (1987) Diflerential Calculus of set-valued maps:

an update. IIASA W P

[2] AUBIN J.-P. (1987) Viability Tubes and the Target Problem

.

Proceedings of the IIASA Conference, July 1986, IIASA WP-86 [3] AUBIN J.-P. & CELLINA A. (1984) DIFFERENTIAL IN-

CLUSlONS. Springer-Verlag (Grundlehren der Math. Wis- senschaften, Vo1.264, 1-342)

(41 AUBIN J.-P. & EKELAND I. (1984) APPLIED NONLINEAR ANALYSIS. Wiley-Interscience

[5] AUBIN J.-P. & FRANKOWSKA H. (1985) Heavy viable tra- jectories of controlled systems. Annales de 1'Institut Henri-

Poincard, Analyse Non Lindaire, 2, 371-395

[6] AUBIN J.-P. & FRANKOWSKA H. (1987) Observability of Systems under Uncertainty. IIASA W-P-87

[7] AUBIN J.-P. & WETS R. (1986) Stable approzimations of set- valued maps. IIASA WP-87

[8] BERNHARD P. (1979) Contribution B l'dtude des jeux diffkr- entiels B somme nulle et information parfaite. Thkse Universitk de Paris VI

[9] BERNHARD P. (1980) Ezact controllability of perturbed continuous-time linear systems. Trans. Automatic Control, 25, 89-96

[lo]

BLAQUIERE A. (1976) Dynamic Games With Coalitions and Diplomacies. in Directions in Large-Scale Systems, 95-115 [ll] BREAKWELL J.V. (1977) Zero -sum diflerential games with

terminal payofl. In Diflerential games and Applicatioru, Hage- dorn P.

,

Knobloch H. W.

,

Olsder G.H. Ed. Lecture Notes in Control and information Sciences N 3 Springer-Verlag.

(27)

(121 CORLESS M. & LEITMANN G. (1984) Adaptive Control for Uncertain Dynamical Systems. Dynarnical Systems and Micro- physics Control Theory and Mechanics, 91-158

(131 CORLESS M.

,

LEITMANN G. & RYAN E.P. (1984) R a c k i n g in the Presence of Bounded Uncertainties. Proceeding of the Fourth International Conference on Control Theory

(141 FALCONE M. & SAINT-PIERRE P. (1987) Slow and quasi-slow solutions of differential inclusions. J. Nonlinear Anal.,T.,M.,A., 3, 367-377

[15] FRANKOWSKA H. (1987) L 'e'quation d'Hamilton- Jacobi con- tingente. Comptes Rendus de 1'Acadimie des sciences, PARIS, [16] FRANKOIVSKA H. (1987) Optimal trajectories associated t o a

solution of contingent Hamilton-Jacob;. IIASA WP-87-069 [17] GUSEINOV H. G.

,

SUBBOTIN A. I. & USHAKOV V. N.

(1985) Derivatives for multivalued mappings with applications t o game theoretical problems of control. Problems of Control and Information Theory, Vo1.14, 155- 167

[18] HADDAD G. (1981) Monotone trajectories of differential in- clusions with m e m o r y . Israel J . Maths, 39, 38-100

[19] HAJEK 0.(1975) Pursuit games Academic Press.

[20] ISAACS R. (1965) DIFFERENTIAL GAMES. Wiley, New York [21] KRASOVSKI N. N. (1986) THE CONTROL OF A DYNAMIC

SYSTEM. Nauka, Moscow

(221 KRASOVSKI N. N. & SUBBOTIN A. I. (1974) POSITIONAL DIFFERENTIAL GAMES. Nauka, Moscow

[23] KURZHANSKII A. B. (1977) CONTROL A N D OBSERVATION

UNDER CONDITIONS OF UNCERTAINTY. Nauka

(28)

[24] KURZHANSKII A. B. (1986) O n t h e analytical properties of viability tubes of trajectories of differential s y s t e m . Doklady Acad. Nauk SSSR, 287, 1047-1050

[25] KURZHANSKII A. B. & FILIPPOVA T. F. (1986) O n viable solutioru f o r uncertain s y ~ t e m s .

[26] LEDYAEV Y U.S. (1985) Regular differential g a m e s w i t h m i z e d constraints o n the controls. Proceedings of the Steklov Institute of Mathematics, 167, 233-242

[27] LEITMANN G. (1980) Guaranteed avoidance strategies. Jour- nal of Optimization Theory and Applications, Vo1.32, 569-576 [28] LEITMANN G. (1981) THE CALCULUS OF VARIATIONS A N D

OPTIMAL CONTROL. Plenum Press

[29] LEITMANN G.

,

RYAN E.P. & STEINBERG A. (1986) Feed- back control of uncertain sysiems: robustness w i t h respect t o neglected actuator and sensor dynamics. Int. J. Control, Vo1.43, 1243-1256

[30] LIONS P. -L. (1982) GENERALIZED SOLUTIONS OF HAMILTON- JACOBI EQUATIONS. Pitman

[31] MICHAEL E. (1956) Continuous selections I. Annals of Math., 63, 361-381

[32] MICHAEL E. (1956) Continuous selections II. Annals of Math., 64, 562-580

[33] MICHAEL E. (1957) Continuous selections III. Annals of Math., 65, 375-390

[34] RAY A. & BLAQUIERE A. (1981) S u f i c i e n t Conditions for O p t i m a l i t y of Threat Strategies in a Differential G a m e . Journal of Optimization Theory and Applications, 33, 99-109

[35] SHINAR J. & DAVIDOVITZ A. (1987) Unified Approach for t w o - target G a m e Analysis. loth IFAC World Congress, Munich, F.R.G.

(29)

[36] SUBBOTIN A. I. (1985) Conditioru for o p t i m a l i t y of a guar- anteed outcome in g a m e problem3 of control. Proceedings of the Steklov Institute of Mathematics, 167, 291-304

[37] SUBBOTIN A. I. & SUBBOTINA N. N. (1983) D i f e r e n t i a b i l - i t y propertiea of t h e value f u n c t i o n of a d i f e r e n t i a l g a m e w i t h integral t e r m i n a l coata. Problems of Control and Information Theory, 12, 153-166

[38] SUBBOTIN A. I. & TARASYEV A. M. (1986) Stability proper- tiea of the value function of a diflerential g a m e and viacoaity ao- lutions of Hamilton- Jacobi equationa. Problems of Control and Information Theory, 15, 451-463

139) TCHOU N. A. (1988) Ezistence of alow m o n o t o n e aolutiona t o a d i f e r e n t i a l inclusion. J . Math. Anal. Appl.

Referenzen

ÄHNLICHE DOKUMENTE

in which the controls do not appear explicitly. These are obviously instances of differential inclusions. But, besides this array of mathematical and physical motiva- tions,

The subject nodes covered a breadth of areas including: Upper Limb Muscles – forearm and hand, Lower Limb – nerves, and Head and Neck Anatomy Once the content had been finalized

In nature, the role of asymmetries is much more pronounced still, and soon after the introduction of game theory in the study of biological contests, a series of papers

We shall prove the existence of such continuous aingle-valued playable feedbacks, as well as more constructive, but discontinuous, playable feed- backs, such as the

Since the convex selection procedure SG has a closed graph and convex values, the right-hand side is upper semicontinuous set-valued map with nonempty compact

This problem was studied in Aubin-Clarke [2] when U is convex, and by many other authors. The abstract theorems of Section 3 can be applied as well to this new problem, but we

A fractional order asymmetric game is shown to have a locally asymptotically stable internal solution. This is not the case for its integer

A fractional order asymmetric game is shown to have a locally asymptotically stable internal solution1. This is not the case for its integer