Hi Daniel,

Thanks for your comment. I may have put too much emphasis on the word “deterministic.” The issue with MR is not just that it is deterministic, but that there is a single fixed path which is optimal, and using a demonstration of that path is sufficient to solve the level. In the other games you mention, there is no such thing as a single universal demonstration that would be sufficient for an agent to use to learn to play SCII, for example.

PhD. Interests include Deep (Reinforcement) Learning, Computational Neuroscience, and Phenomenology.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store