Hi John,

I was actually working on training my A3C with Breakout yesterday. It turns out the model doesn’t converge with the hyperparameters used here, so it will need a little tweaking before it is competitive with the other implementations you’ve likely found around.

If I get something working well though, I will add it to the github repository, and comment here with the results.

Research Scientist. Interested in Artificial Intelligence, Neuroscience, Philosophy, and Literature.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store