Input Convex Neural Network is not Convex at Origin in Lux.jl

Lizuo_Liu · October 31, 2025, 12:26am

I made a few changes, when the first layer uses relu, the Hessian at origin is SPD.

function create_icnn(n_vars::Int, hidden_dims::Vector{Int}=[32, 32]; 
                     T::Type=Float64, rng::AbstractRNG=Random.default_rng())
    @assert !isempty(hidden_dims) "hidden_dims cannot be empty"
    
    layers = []
    
    # First layer
    push!(layers, ICNNFirstLayer(n_vars, hidden_dims[1], relu; T=T))
    
    # Hidden layers
    for i in 1:(length(hidden_dims)-1)
        push!(layers, ICNNLayer(n_vars, hidden_dims[i], 
                                hidden_dims[i+1], softplus; T=T))
    end
    
    # Output layer (no activation, with quadratic terms)
    push!(layers, FinalICNNLayer(n_vars, hidden_dims[end], 1, identity; 
                                 use_bias=false, use_quadratic=true, T=T))
    
    model = ICNNChain(layers...)
    ps, st = Lux.setup(rng, model)
    
    return model, ps, st
end

julia> include("examples/mwe.jl")
6.539322516230143
true
true

It somehow is a bug?

Topic		Replies	Views
Is it possible to solve "convex" neural network using `Convex.jl`? General Usage question , flux , optimization	5	652	December 23, 2020
Convex.jl and Icnn Optimization (Mathematical)	4	507	May 12, 2022
How to calculate the Hessian vector product in a Flux model? Machine Learning question , gpu , flux , zygote , forwarddiff	1	317	June 13, 2023
Best practice to combine Flux.jl and Convex.jl General Usage optimization , machine-learning	4	933	March 14, 2021
Lux, optimization on gpu Optimization (Mathematical) question , gpu , cuda , optimization , sciml	8	414	January 13, 2025

Input Convex Neural Network is not Convex at Origin in Lux.jl

Related topics