Optimize 0-1 loss in MIP

Maximilian · January 6, 2020, 3:14pm

I am currently trying to solve the following optimization problem

Where N is the number of observations on \mathbf{x} predictors, and y \in \{-1, 1\} is a binary vector of length N, \lambda has to be integer, C_0, C_1 are tuning parameters penalizing complexity of the model. Illustrative example: suppose you would like to predict if a mushroom is poisonous (y) and have predictors like its odor and it’s color (x_1, x_2). I would like to obtain a solution similar to this:

However, I am new to optimization problems (I have a statistical background) and thought I’d be a nice opportunity to get to know Julia better.
Now I am a bit lost - I understood how to generally formulate and solve MIP problems with JuMP. However, I don’t know how to optimize the 0-1 loss between a predicted category and the actual category. Also, I don’t know how to formulate the first penalty term C_0 ||\lambda||_0 where ||\lambda||_0 is one if lambda is nonzero, and 0 if \lambda is zero.

Any help, and also resources about such types of problems, would be greatly appreciated.

Pictures from
Ustun, B., Traca, S., & Rudin, C. (2013). Supersparse linear integer models for interpretable classification. arXiv preprint arXiv:1306.6677 .

leethargo · January 8, 2020, 11:18am

I believe the paper you mention actually provides a full MIP formulation.

In any case, to represent the “0-norm”, you could introduce a binary variable a_j \in \{0, 1\} for every component j of \lambda. Then you can minimize the C_0\sum_j a_j.
You also need to add constraints of the form \lambda_j \ge M a_j where M is a suitable upper bound on the values that \lambda can take.

robertfeldt · May 2, 2020, 12:11am

@Maximilian did you find a solution to representing 0-1 Loss in JuMP? When I try this:

loss01 = @expression(model, (y .* (X * lambda)) .<= 0.0)

I get a MethodError:

ERROR: MethodError: no method matching isless(::GenericAffExpr{Float64,VariableRef}, ::Float64)

Any tricks to handle this?

odow · May 2, 2020, 1:32am

Use @constraint instead of @expression. Take a read of the JuMP documentation under “constraints” and “expressions.”

robertfeldt · May 2, 2020, 6:15pm

Thanks, but this is not a constraint. It is a part of a larger expression that is to be in the objective. So not clear one can use contraint instead.

leethargo · May 2, 2020, 7:18pm

But in that case, you can’t use <=. The @expression should probably only cover the left-hand side of the inequality?

Topic		Replies	Views
Seeking a smart way to formulate an optimization problem Optimization (Mathematical)	4	519	November 6, 2021
Formulating objective function, DimensionMismatch Optimization (Mathematical) question , jump	6	163	January 19, 2024
SciML Optimization - algorithms and convergence speed Optimization (Mathematical)	10	588	May 29, 2023
Solving optimization problems with bilinear matrix inequalities (BMI) in Julia Optimization (Mathematical) jump , optimization	3	907	June 6, 2021
Linear equation with integer digits solution Optimization (Mathematical) jump	3	401	October 27, 2021

Optimize 0-1 loss in MIP

Related topics