Reinforcement Learning and Dynamic Programming Using Function Approximators: 39