[ANN] Lux.jl: Explicitly Parameterized Neural Networks in Julia

TheLateKronos · August 26, 2023, 1:27pm

Wound it not be sufficient to do

ps, st = Lux.setup(rng, model)
ps = Float64.(ps)

?

bertschi · August 26, 2023, 2:03pm

That would work if ps were a flat vector, but Lux uses nested named tuples for its parameters.

Seems that the state st is even more tricky as it contains leafs of different types, e.g., training = Val{true} besides Vector{Float32} parameters. Thus, ComponentVector(st) will not work. On the other hand, as soon as you run the model a single time via y, st = Lux.apply(model, x, ps, st) the state will be promoted to the type of x and ps anyways, so you can use that to convert instead.

avikpal · August 28, 2023, 2:42pm

The latest stable release has the functions available Utilities | LuxDL Docs

Ashwani_Rathee · September 6, 2023, 5:15pm

I love the style and theme of the website! Getting Started | LuxDL Docs could be more descriptive though, given its an introduction despite focusing on the coding aspect which is alright, I think more useful and descriptive comments could be added.

braamvandyk · September 7, 2023, 6:19am

I started looking at Lux based on this thread, and I must say that I am truly appreciative of the documentation. Thanks to everyone who put in the time to work on the package and the website!

In the spirit of giving constructive feedback, there is just one proposal I would like to make. The font used for code blocks makes minimal distinction between a period and a comma. One example from the very first tutorial in “Julia and Flux for the Uninitiated” in the section on “(Im)mutability”:

If you know any Julia, this won’t be a problem, but this tutorial is specifically aimed at people who don’t, which may result in a little (or a lot of) head-scratching.

For comparison, when I copy and paste the code to my REPL:

In any event, I shall be directing my team members interested in learning Julia and deep-learning to the Lux docs. It does an infinitely better job at explaining things than I have been doing.

avikpal · April 8, 2024, 3:47pm

Some New Major Updates (till v0.5.33)

Lux now has in-built distributed training support via MPI – Distributed Data Parallel Training | Lux.jl Documentation. It is effectively a rewrite of my older package GitHub - avik-pal/FluxMPI.jl: Distributed Data Parallel Training of Deep Neural Networks (that is now archived), but allows NVIDIA GPU communication via NCCL!
SimpleChains is available as a backend (Switching between Deep Learning Frameworks | Lux.jl Documentation) for small neural networks, which means you can write everything in Lux and still use SimpleChains with just 1 additional line of code (MNIST Classification with SimpleChains | Lux.jl Documentation)

gdalle · April 9, 2024, 6:03am

That’s awesome! IIRC, one of SimpleChains.jl’s main selling points is that it can be allocation-free: how does this pair with Lux’s purely functional style?

avikpal · April 9, 2024, 3:18pm

It still maintains the purity in the sense of no-side effects and same inputs => same outputs. But yeah you can’t get full performance of simple chains staying in the lux api or for that matter using the chainrules api. FWIW most simple chains users from sciml point would use the chain rules api (See Faster Neural Ordinary Differential Equations with SimpleChains · SciMLSensitivity.jl) and not the fully non-allocating train_batched! API.

Lux API is simpler to use and more people are familiar with it (since we mimicked the flux api for layers). SimpleChains is extremely fast but the API is somewhat non-traditional. So now users can write the model in Lux and (if possible) convert it to simple chains (getting the performance boost) and never have to learn the details of how to write in SimpleChains

gdalle · April 9, 2024, 3:57pm

Perhaps SimpleChains would be a good candidate for a DifferentiationInterface binding, which could then be accessed from Lux

avikpal · April 16, 2024, 4:03pm

New Additions (v0.5.36)

DynamicExpressionsLayer lets you now embed Symbolic Expressions into Lux models. So we can fit a model using SymbolicRegression.jl and swap it into a lux chain (see this new tutorial Solving Optimal Control Problems with Symbolic Universal Differential Equations | Lux.jl Documentation). (Thanks @milescranmer for a starter code on this)
@compact is now the recommended API for creating custom models. It part of the stable API (as of v0.5.35), and its capabilities have been extended to allow accessing parameters inside the model. For example, writing a UDE is now as simple as:

ude = @compact(; nn, p_true, solver=Tsit5(), tspan=(0.0,1.0),
        kwargs...) do x, ps
    # Just an arbitrary UDE
    dudt(u, p, t) = nn(x, p.nn) .+ sum(p.p_true)
    prob = ODEProblem{false}(dudt, x, tspan, ps)
    return solve(prob, solver; kwargs...)
end

stevengj · April 27, 2024, 5:33pm

20 posts were split to a new topic: Nested AD with Lux etc

Topic		Replies	Views
Deep learning in Julia Machine Learning	35	10134	April 22, 2024
Error in building run time function by Lux.jl model with other equations General Usage symbolics , lux	4	149	July 16, 2024
Questions on NeuralPDE.jl Modelling & Simulations	5	747	July 7, 2022
Lux (And Flux), "parallel" Network Input. When Input is flat, Zygote gradient works, when input is not flat it doesn't Machine Learning flux , zygote , lux	10	674	February 5, 2024
Using `Lux.WrappedFunction` for pre/post processing in Lux model General Usage neural-network , lux	5	47	May 22, 2025

[ANN] Lux.jl: Explicitly Parameterized Neural Networks in Julia

Some New Major Updates (till v0.5.33)

New Additions (v0.5.36)

Related topics