Proof:step 1: we show that for every constant ,
for an infinite number of t's.
So the variance could be expressed as :
Since
and
, we have that
.
Let us assume (by contradiction) that there exists T such that for all
t>T,
and
. (since
). In such a case we have
.
Hence :
.
Since for any m we have
,then it has to be the case that
, which is a contradiction to the assumption in the lemma.
Therefore, for any
, there exists T such that
and
.
step 2: we show that
.
Under the hypothesis that
we showed that
, this implies that
.
we can substitute :
Remark : the condition
will
insure convergence with probability one.
We say that
with probability one if and only if
.
Next: Evaluating Policy Reward
Up: Stochastic Models
Previous: Choosing
Yishay Mansour
1999-12-16