Lunar Lander Reinforcement Learning Agent
Introduction
The OpenAI Gym provides a set of environments for training
a reinforcement learning agent. For this project, I trained an agent for
the Lunar-Lander-v2 environment. The environment consists of a model
of a lunar lander that must learn to land on a target landing pad.
The pad is represented by the coordinates
Do Nothing
Fire Left Enginge
Fire Right Engine
Fire Main Engine
An agent is considered successful if it scores an average of 200 points or more over the last 100 episodes. Here, I use a Deep-Q Learning algorithm called Double DQN to train a lander that achieves an average score of 272 over 100 episodes.