Flux Pendulum DDPG example fails on GPU

Is ACTION_BOUND a Float64 number? Maybe const ACTION_BOUND = Float32(actions(env, env.state).hi[1]) helps… (I didn’t test).

If the problem is with Reinforce.jl you may have more luck with the DDPG implementation in ReinforcementLearning.jl.

1 Like