Example 1

Next: The Expected Discounted Sum Up: Infinite Horizon Problems Previous: The Return Function

Example 1

This example is an expantion of exmaple 2 given in lecture 3.
We will first examine the value gathered from different return functions using two specific policies:

1.: $\pi_1$ - always chooses a₁₁ when in state s₁
2.: $\pi_2$ - always chooses a₁₂ when in state s₁

**Figure:** Infinite Horizon Example

Let us start by calculating $V^{\pi}_N$ :

$\begin{eqnarray*}V^{\pi_2}_N & = & 10 - (N-2) = 12 - N\\ V^{\pi_1}_N & = & 5 + ... ...N-2) + (1-\frac{1}{2^{N-2}})\\ & = & 13 - N - \frac{6}{2^{N-2}} \end{eqnarray*}$