Next: Example
Up: Large State Space
Previous: Evaluation Of Approximate Policy
Approximate Value Iteration
We will make the following assumption.
This implies that using L operator on the inequality
We have also the next inequality
Using both inequalities
Thus for each k we have
If we look at
Therefore,
Although calculations are much simpler than in PI. The method is less natural.
Yishay Mansour
2000-01-11