Accessing a specific layer's weights in a Flux Chain

bad_at_math · July 6, 2021, 9:47pm

In PyTorch, I can define a network like this:

from torch import nn

class Network(nn.Module):
    def __init__(self, ...):
        ...
        self.ln_1 = nn.Linear(64, 32)
        self.ln_2 = nn.Linear(32, 16)
        ...

The named_parameters method (Module — PyTorch 1.12 documentation) lets you iterate through the modules in a network and access them (and their gradients) by name (e.g., ln_1).

Does Flux have similar functionality? MWE:

using Flux 
network = Chain(Dense(64, 32, tanh), Dense(32, 16, tanh))
ps = Flux.params(network)
point = ... # data point
criterion = ... # loss function
gs = Flux.gradient(ps) do 
    loss = criterion(point...)
    return loss_val
end

then I can access the gradient of the first layer’s gradients with gs[network[1].weights], but this is maybe a little less interpretable than in PyTorch, since gs’s keys are arrays, not strings.

ToucheSir · July 7, 2021, 1:15am

This is exactly what explicit parameters were designed for:

...
criterion(m, ...) = ... # loss function
gs = Flux.gradient(network) do m
    loss = criterion(m, point...)
    return loss
end

size(network[1].weights) == size(gs[1].weights) # true

Topic		Replies	Views
How to initialize a subset of params of a NN New to Julia flux	2	793	January 16, 2020
How to access a layer's parameters in Flux 0.10? Machine Learning	2	782	January 13, 2020
Initializing Flux weights the same as PyTorch? Machine Learning	4	1083	February 9, 2021
How to get the gradient of NN wr to its input? Machine Learning flux	6	958	November 15, 2019
How to obtain the gradients of intermediate variables with Flux Machine Learning question , flux	11	1308	March 24, 2022

Accessing a specific layer's weights in a Flux Chain

Related topics