817 B
817 B
Lunar Lander
This is an example of an Actor-Critic learning Agent as part of the ASIM RL Tutorial
It uses gym for the environment and torch as the basis for the A-C Network
Action Space
- do nothing,
- fire left engine,
- fire bottom engine,
- fire right engine
Observation Space
- x,
- y,
- linear velocity in x,
- linear velocity in y,
- angle,
- angular velocity
- ground contact leg 1
- ground contact leg 2
Rewards
- landing at landing pad +[100-140] points.
- crash: -100 points
- landing: +100 points
- landing on leg: +10 points per leg
- firing an engine: -0.3 points per engine per frame
it counts as solved at 200 points
Starting State
The lander starts at the top center of the viewport with a random initial force applied to its center of mass.