How to access variables and expressions faster in jump

Betristor · February 6, 2023, 8:34am

I established a very large model using julia with plenty of variables and expressions. Though not all of these will be meaningful for accessing, I still wrote all into results in dataframe and then saved to csv files. I’d like to know how could I access a column or row of a large matrix, let’s say in two or three dimensions with index length over hundred, and fill them into a dataframe with fast speed.

For example, I have an expression named NeededExpr in JuMP with index f,z,t, the length of them is 6, 31 and 168 (or larger). And I need NeededExpr[f, z, t] with given f and z, should I use value.(NeededExpr[f, z, t]) or value.(NeededExpr)[f, z, t]?

jd-foster · February 6, 2023, 8:42am

Moving this to the Optimization category (you’re more likely to get people who know the answers to your question).

In the meantime, have a look at this section of the JuMP docs:

jd-foster · February 6, 2023, 10:02am

If you’re asking which is faster for access to the single value, it’s the former and the dot . is not needed. If you want to obtain a whole column of values, it’s the latter.

Betristor · February 6, 2023, 3:00pm

Typically, the last index represents time and is the biggest. In practice we need to store and access it for many times. When I discussed with others about how julia store and access data, we came into a claim that julia stores and accesses data in column order, so if we place t at the first position like NeededExpr[t, f, z] and execute value.(NeededExpr)[:, f, z], will this be faster than NeededExpr[f, z, t] and value.(NeededExpr)[f, z, :] with given f and z?

odow · February 6, 2023, 7:42pm

value.(NeededExpr[f, z, t]).

# value.(NeededExpr[f, z, t]) is equivalent to
a = NeededExpr[f, z, t]
value.(a)

# value.(NeededExpr)[f, z, t] is equivalent to

a = value.(NeededExpr)
a[f, z, t]

The second approach computes the value of all variables, and then subsets that. Computing the value of every variable only to pick a few is expensive.

we came into a claim that julia stores and accesses data in column order

Ignore this suggestion (for now). It is almost certainly not something that will make a material difference to the runtime of your function.

The real answer for what you should do depends on how you’re going to use the results.

If you want only a few individual values once, then use value.(NeededExpr[f, z, t]).
If you’re going to eventually access the value of every variable, and some indices multiple times, then store the solution once in solution = value.(NeededExpr) and query from that.

The most common bottleneck is getting solutions out of the solver with the value call.

It’s hard to provide more specific advice without a reproducible code example of exactly what you’re trying to do and achieve.

Topic		Replies	Views
About JuMP's speed of building an expression Optimization (Mathematical) question , jump	20	287	June 1, 2025
Matrix multiplication time with JuMP variables Optimization (Mathematical) jump	5	140	August 29, 2024
Alternatives to @expression for complex model constriction Optimization (Mathematical) jump	4	884	February 7, 2017
Performance: Fast way to access numbers in Dataframes or alternatives Performance dataframes , data_structures	12	1172	November 15, 2022
Reading lots of variables and passing them to JuMP model's variables and constraints automatically General Usage jump	1	362	April 13, 2018

How to access variables and expressions faster in jump

Related topics