BoundsError: attempt to access 2-element Vector in DDPG algorithm

GK_197 · June 8, 2021, 12:03pm

Hi,

I was implementing DDPG algorithm for 1-D vector action space but, during run it shows error as:

BoundsError: attempt to access 2-element Vector{Float64} at index []

Now, when looked into DDPG policy on Github

# TODO: handle Training/Testing mode
function (p::DDPGPolicy)(env)
    p.step += 1

    if p.step <= p.start_steps
        p.start_policy(env)
    else
        D = device(p.behavior_actor)
        s = state(env)
        s = Flux.unsqueeze(s, ndims(s) + 1)
        action = p.behavior_actor(send_to_device(D, s)) |> vec |> send_to_host
        clamp(action[] + randn(p.rng) * p.act_noise, -p.act_limit, p.act_limit)
    end
end

It has action[], so is it expecting action to be scalar. As we have action_space of form [1..3,2..3].
I am using start policy in DDPG agent as -

start_policy = RandomPolicy(action_space(env);rng)

Any help or suggestion on how this is usually done is appreciated!

findmyway · June 14, 2021, 4:21am

Ah, it’s still not fixed yet after a year…

Topic		Replies	Views
Continuous action space array Machine Learning question , machine-learning	3	834	June 8, 2021
Flux Pendulum DDPG example fails on GPU General Usage cudanative , cuda , flux , zygote	2	675	November 12, 2020
DDPG using Flux General Usage	2	888	May 26, 2020
Using PPOPolicy with custom environment with action masking in ReinforcementLearning.jl Machine Learning question	14	1237	October 16, 2021
Using AbstractEnv from CommonRLInterface with POMDPs Machine Learning	22	776	September 27, 2021

BoundsError: attempt to access 2-element Vector in DDPG algorithm

Related topics