Hi Sameh,

You are correct. Any action taken in the current state doesn’t affect future states. Environments with temporal dynamics like those you described are called Markov Decision Processes. All of the tutorials following this one contain descriptions of MDPs, and provide algorithms that can learn to solve them.

Hope that helps.

PhD. Interests include Deep (Reinforcement) Learning, Computational Neuroscience, and Phenomenology.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store