Reinforcement Learning: Moving Agent to Target

The agent (blue square) learns to reach the target (red square).