Next:
Introduction
Introduction
The Problem
states and actions
Immediate reward and the probability of transition
Policy and Decision Rules
Policy
Example 1
Example 2
Finite Horizon
Expected Total Reward
Return Estimation of a Given Policy
Dynamic Programming Algorithm for
Computational Complexity
The Optimality Principal
About this document ...
Yishay Mansour
1999-11-15