Many copies of a single reinforcement learning agent trained to play snake using the deepq learning algorithm in convnetjs. The food is hidden but the agents all see the same food item, leading to a mixture of random and coordinated movements across the screen. The agents were trained on a smaller 5x5 grid, which is what they continue to see, repeated across the canvas.


Log in with to leave a comment.