Next:
Learning from experience
Learning from experience
Average Calculation
Convergence
Convergence rate
Weighted average
Stochastic Models
Using
to control the convergence rate
Choosing
Evaluating the average
Evaluating Policy Reward
The Naive Approach
First Visit
Every Visit
Example
About this document ...
Yishay Mansour
1999-12-16