Reinforcement learning with tensorflow paperback