Next: Example 1
Up: Infinite Horizon Problems
Previous: Infinite Horizon Problems
The Return Function
We will suggest three possible return functions for the infinite horizon problem:
- 1.
- The expected sum of the immediate rewards, i.e.
Note that this return function may diverge.
- 2.
- The expected discounted sum of the immediate rewards, i.e.
In this case, a suffice condition for convergence can be for example:
Under this condition we can find an upper bound to the return function:
Note that this bound is very sensative to the value of the paramter .
- 3.
- The expected average reward
This limit does not always exist. A sutisfactory demand for the limit's existance may be
- (a)
- S is finite
- (b)
-
is Markovian and stationary
- (c)
- the system is non periodic
These conditions will be discussed further in a later lecture.
Yishay Mansour
1999-11-18