Minimizing a logsumexp function with many terms (Convex.jl)

leapsheep · February 27, 2024, 3:57pm

I am trying to minimize a function that depends on only few parameters, but contains a logsumexp with many terms in the sum, i.e. a function of the form

f(x) = \log \left(\sum_{i=1}^n e^{a_i \cdot x}\right)

where where a_i, x \in \mathbb{R}^m. In my case, the number of terms n is very large, but the number of parameters m is small. Typically, I might have n \sim 10^5 and m \sim 10.

Naively putting this function into Convex.jl will result in O(n) variables and constraints being passed to the solver, which I am hoping to avoid.

Is there any way of representing this function efficiently?

gdalle · February 27, 2024, 6:21pm

Have you tried JuMP?

ericphanson · February 27, 2024, 6:49pm

Could you post a minimal example? (See also Please read: make it easier to help you). It looks like one might be able to avoid ~n variables/constraints but it would be easier to figure it out with some code to work with.

leapsheep · February 28, 2024, 8:52am

Thanks for your replies. Here is a minimal example for a moderate number of terms:

using Convex, SCS
A = randn(1000, 10)
x = Variable(10)
problem = minimize(Convex.logsumexp(A*x))
solve!(problem, SCS.Optimizer)

This example has 10 variables and 1000 terms, and looking at the output of SCS shows that 1012 variables and 3002 constraints are passed to the solver:

problem:  variables n: 1012, constraints m: 3002
cones:    z: primal zero / dual free vars: 1
          l: linear vars: 1
          e: exp vars: 3000, dual exp vars: 0
settings: eps_abs: 1.0e-04, eps_rel: 1.0e-04, eps_infeas: 1.0e-07
          alpha: 1.50, scale: 1.00e-01, adaptive_scale: 1
          max_iters: 100000, normalize: 1, rho_x: 1.00e-06
          acceleration_lookback: 10, acceleration_interval: 10
lin-sys:  sparse-direct-amd-qdldl
          nnz(A): 13002, nnz(P): 0

I found that problems with more than 10.000 terms easily consume >20GB of memory.

It is my understanding that JuMP would require me to formulate the problem in a solver-compatible way on my own, which I am not sure how to do (with less than O(n) variables).

gdalle · February 28, 2024, 9:25am

I found a relevant section of the docs, haven’t tried it myself though:

https://jump.dev/JuMP.jl/stable/tutorials/conic/tips_and_tricks/#Log-sum-exp

gdalle · February 28, 2024, 9:32am

The following seems to work but it has many variables still

using JuMP, SCS

N, M = 10, 1000
A = randn(M, N)

model = Model(SCS.Optimizer)

@variable(model, x[j = 1:N])
@variable(model, y[i = 1:M])
@variable(model, u[i = 1:M])
@variable(model, t)

@objective(model, Min, t)

@constraint(model, A * x .== y)
@constraint(model, sum(u) <= 1)
@constraint(model, [i = 1:M], [y[i] - t, 1, u[i]] in MOI.ExponentialCone())

optimize!(model)
value(t)

leapsheep · February 28, 2024, 9:57am

Thanks a lot! I just tried your JuMP code with a larger number of terms and it seems to work. While this actually creates more variables and constraints than the Convex.jl version, the memory requirement is much, much lower (50000 terms seem to use ~1GB). So maybe a clever reformulation of the problem is actually not needed.

Maybe the excess memory requirement is an issue with Convex.jl (possibly related to this)?

gdalle · February 28, 2024, 10:14am

More generally, Convex.jl is not actively maintained, whereas the JuMP people are extremely reactive, so betting on the latter is a good call for current projects

ericphanson · February 28, 2024, 11:03am

You could try Convex#master. Convex has not been maintained much over the years but we merged a big refactor and Oscar (who does an amazing job maintaining tons of JuMP packages) also spent some time cleaning up Convex, fixing bugs and writing tests. Those changes haven’t made it to a release yet though.

odow · February 29, 2024, 7:17am

The smaller JuMP formulation is:

using JuMP, SCS
N, M = 10, 1_000
A = randn(M, N)
model = Model(SCS.Optimizer)
@variable(model, x[1:N])
@variable(model, u[1:M])
@variable(model, t)
@objective(model, Min, t)
@constraint(model, sum(u) <= 1)
@constraint(model, [i in 1:M], [A[i, :]' * x - t, 1, u[i]] in MOI.ExponentialCone())
optimize!(model)
value(t)

gdalle · February 29, 2024, 9:38am

Do you think it changes anything in terms of performance?

odow · February 29, 2024, 6:02pm

Not really. It just avoids adding the y variable and constraints.

Topic		Replies	Views
Implementing the log-sum-exp function Optimization (Mathematical) jump	10	549	November 23, 2024
How to implment `logsumexp` function in JuMP? Optimization (Mathematical) jump , differentiation , autodiff	7	756	July 18, 2022
Huge increase in variables/constraints with Convex.jl Optimization (Mathematical) convex	8	299	June 11, 2024
Mixing models between JuMP.jl and Convex.jl Optimization (Mathematical)	2	1808	May 10, 2017
Finding the right JuMP package to solve a mixed-integer convex program Optimization (Mathematical)	3	1135	January 30, 2022

Minimizing a logsumexp function with many terms (Convex.jl)

Related topics