Next: About this document ...
Up: Computing the Optimal Policy
Previous: Uniqueness of
Example:
Using the same example we calculate the
optimal return value to be:
Thus
If we examine different values of we get different
optimal actions in S1.
For example:
Note that as increases the optimal policy at S1
changes from a12 to a11.
Yishay Mansour
1999-11-24