Deep Reinforcement Learning

Approximate value iteration

Q-learning