next up previous
Next: Proof for Phased Q-Learning Up: Reinforcement Learning - Final Previous: The Main Theorem -

   
My Results

The article presents no proof to its theorem, so the core of my project is providing a proof to the theorem. The results I've accomplished depend on |A| (the size of the actions-set), but under the assumption of a constant |A|, the result are the same:

Yishay Mansour
2000-05-30