I’ve been playing with reinforcement learning in Julia for the past few months. I’ve been using Crux.jl implementation of the proximal policy optimization (PPO) algorithm. The current master branch of Crux.jl (but not the latest release version in the general registry) works fine on my laptop as well as a remote server. You can try the classic CartPole training example from Gymnasium with the following example code:
https://github.com/sisl/Crux.jl/blob/master/examples/rl/cartpole.jl
if you also install the other dependencies (Flux, POMDPs, POMDPGym).
A more specialized implementation of Deep Q Networks (DQN) is available in DeepQLearning.jl. I recently posted an example, again for the CartPole environment:
https://discourse.julialang.org/t/reinforcement-learning-packages-for-cartpole-example-with-julia-v1-11-or-v1-10/125261/3
Please let me know if either of the above options works! I didn’t write any of the packages but submitted minor issues and PRs to fix broken dependencies recently.