207 B
207 B
Tic Tac Toe
A simple game of tic tac toe
Q-Learning example as part of the asim reinforcement learning tutorial. the agent (player) learns not to lose the game, given sufficiently long training period.