Hi Daniel,

Thanks for your comment. I may have put too much emphasis on the word “deterministic.” The issue with MR is not just that it is deterministic, but that there is a single fixed path which is optimal, and using a demonstration of that path is sufficient to solve the level. In the other games you mention, there is no such thing as a single universal demonstration that would be sufficient for an agent to use to learn to play SCII, for example.

PhD. Interests include Deep (Reinforcement) Learning, Computational Neuroscience, and Phenomenology.

