Initializing Flux weights the same as PyTorch?

DevJac · February 5, 2021, 5:05am

I came up with this function to initialize the weights the same way PyTorch does:

function Linear(in, out, activation)
    Dense(in, out, activation,
          initW=(_dims...) -> Float32.((rand(out, in).-0.5).*(2/sqrt(in))),
          initb=(_dims...) -> Float32.((rand(out).-0.5).*(2/sqrt(in))))
end

At least, for PyTorch’s Linear layers that’s how it works. You can easily verify this by creating a PyTorch Linear layer and looking at the minimum and maximum weight and bias values.

Topic		Replies	Views
How to implement custom weight initialization in Flux? General Usage	1	609	March 14, 2022
Impose initialization adn normalization on layers in Flux Machine Learning first-steps	2	730	September 11, 2020
Flux has no Lecun Normalization weight init function? New to Julia flux , machine-learning	0	84	October 9, 2024
Initialize weights for Flux.Dense New to Julia flux	1	1164	August 8, 2020
How to create Dense with initialed weights in vector General Usage flux	0	31	October 14, 2024

Initializing Flux weights the same as PyTorch?

Related topics