Set model parameters as Dictionary and define some as trainable

Sunny · April 9, 2021, 9:02am

I defined the following model with a dictionary containing weights

mutable struct Affine
  params
end

Affine(in::Integer, out::Integer) =
  Affine(Dict("W"=>randn(out, in), ("b"=>randn(out))))

# Overload call, so the object can be used as a function
(m::Affine)(x) = m.params["W"] * x .+ m.params["b"]

a = Affine(1, 1)

#a.params
#Dict{String,Array{Float64,N} where N} with 2 entries:
#  "W" => [0.559984]
#  "b" => [0.841258]

Then I tried to specify params[“W”] as trainable

Flux.@functor Affine
Flux.trainable(a::Affine) = (a.params["W"],)

grads = Flux.gradient(() -> a(3)[1], params(a))
for p in grads.grads
    println(p)
    println("--")
end

# Output
Pair{Any,Any}([0.5599836013031454], nothing)
--
Pair{Any,Any}(Dict{String,Array{Float64,N} where N}("W" => [0.5599836013031454],"b" => [0.841258238965482]), Dict{Any,Any}("W" => [3.0],"b" => [1.0]))
--

But gradient seems to be taken with respect to both parameters.
Can I choose only one I need?

Topic		Replies	Views
Stacking layers example Flux - Flux.params empty New to Julia question	2	830	December 12, 2019
How to make parameters of function within Flux.Chain trainable? Machine Learning	2	822	August 25, 2021
Announcement: Knet-1.1.0 and AutoGrad-1.1.0 have been released with callable object support Machine Learning	9	1316	September 14, 2018
Flux:the trainable parameter is empty New to Julia question , flux	2	701	March 16, 2022
Zygote Update with Parametric Type Machine Learning	5	224	May 10, 2023

Set model parameters as Dictionary and define some as trainable

Related topics