Mixed Strategies

Games which are not strictly determined can be further analysed by considering what happens when R and C play a game many times. Instead of choosing the same row (or column) on each occasion, R and C can adopt ``mixed'' strategies and thereby improve their expected payoff.

Definition 6..10   A strategy for the player R is a vector u in $ \mathbb {R}$m with $ \sum_{{i=1}}^{m}$ui = 1 and u$ \ge$ 0. A strategy for the player C is a vector v in $ \mathbb {R}$n with $ \sum_{{j=1}}^{n}$vj = 1 and v$ \ge$ 0.

What we have in mind is that if R has strategy u then he chooses row ai with probability ui. That is, over a series of games, he chooses ai with the frequency that this specifies. Similarly, if C has strategy v then he chooses column aj with the frequency given by the probability vj.

Definition 6..11   For strategies u for R and v for C, the expectation E(u,v) is given by

E(u,v) = $\displaystyle \sum_{{i=1}}^{m}$$\displaystyle \sum_{{j=1}}^{n}$uiaijvj.

The expectation gives the 'expected value' of the payoff that R receives from C. Using matrix notation, one has E(u,v) = uTAv. Note that Rowman is sure to win $ \min_{{\v}}^{}$E(u,v) if he uses mixed strategy u. If he now maximises his certain winnings by choosing u sensibly, he can be sure of winning

$\displaystyle \alpha^{*}_{}$ = $\displaystyle \max_{{\u}}^{}$$\displaystyle \min_{{\v}}^{}$E(u,v).

In the same way, Columnman, using strategy v, will lose at most $ \max_{{\u}}^{}$E(u,v), and he can reduce this loss as much as possible by choosing his strategy v sensibly; he then looses

$\displaystyle \beta^{*}_{}$ = $\displaystyle \min_{{\v}}^{}$$\displaystyle \max_{{\u}}^{}$E(u,v).

Definition 6..12   Let

$\displaystyle \alpha^{*}_{}$ = $\displaystyle \max_{{\u}}^{}$$\displaystyle \min_{{\v}}^{}$E(u,v) and $\displaystyle \beta^{*}_{}$ = $\displaystyle \min_{{\v}}^{}$$\displaystyle \max_{{\u}}^{}$E(u,v).

Then $ \alpha^{*}_{}$ and $ \beta^{*}_{}$ are the optimum lower and optimum upper values of the game A. Any u such that

$\displaystyle \min_{{\v}}^{}$E(u,v) = $\displaystyle \alpha^{*}_{}$

is an optimum strategy for R and any v such that

$\displaystyle \max_{{\u}}^{}$E(u,v) = $\displaystyle \beta^{*}_{}$

is an optimum strategy for C.

Ian Craw 2002-09-11