Next: conclusion:
Up: Policy Sampling
Previous: Policy Sampling
Problem of sampling
We can estimate policy
from samples of another policy .
The problem is the variance can be big -infinte. (variance in example 1).
Big variance can cause the error of the estimation grow.
Yishay Mansour
2000-01-07