Hi Ronan,

That is a cool challenge to take on! One thing you want to be sure of in Model-based approaches is that the model is actually learning the dynamics of the environment accurately. If the model doesn’t accurately predict future states, or termination conditions, then using it for planning won’t be successful. The model-based approach I used for CartPole was pretty tailored to that task, so you may need to tweak a number of things to ensure the model is learning well. Good luck!

PhD. Interests include Deep (Reinforcement) Learning, Computational Neuroscience, and Phenomenology.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store