Hi,
just for a curiosity, did played with reinforcement learning to solve sokoban? Is there any repo, which would allow to play with it?
Thanks for response,
Tomas
Hi,
just for a curiosity, did played with reinforcement learning to solve sokoban? Is there any repo, which would allow to play with it?
Thanks for response,
Tomas
I am not aware of any existing Julia repo but here are some thoughts.
I think that learning a good sokoban player is a pretty hard problem given the current state of the art in RL. I would not expect an approach based purely on learning (DQN, policy gradient) to be successful unless you throw some Deepmind’s level of computing power at it.
Instead, I would really bet on a combination of learning and search, as done in AlphaZero. (Note that I may be a bit biased here as I just released the first version of AlphaZero.jl).
With a few modifications, I think AlphaZero could work pretty well. However:
Another big challenge in Sokoban is that some moves are irreversible and the game is never-ending from an impossible position. Therefore, you may want the agent to “learn when to give up” (or just put a limit on the length of episodes).
Hi Jonathan,
I am aware of the complexity of Sokoban and thats why I have choosen that. My students mainly use python and I wanted to know, if there is something in python. We are indeed interested in combination of RL a search method. Sad, that we cannot easily test AlphaZero.jl.
Tomas