Does a very large number of observed variables (3207 vs 1188 equations) significantly increase ODEProblem memory?

chen_hardworking · April 8, 2026, 11:26am

Hi SciML team and community,

I’m developing PowerEMT.jl, an open-source Julia package for Electromagnetic Transient (EMT) simulation of large power systems using ModelingToolkit.jl, specifically for IBR-dominated power system. The package relies heavily on hierarchical component-based modeling with many nested @named subsystems (inverters, generators, transmission lines, controllers, etc.).

After applying structural_simplify (and attempting mtkcompile), one of my test systems reports the following structure:

text

Equations (1188):
  1188 standard: see equations(sys)
Unknowns (1188): see unknowns(sys)
  ⋮ (e.g. vg25₊iq(t), vg25₊id(t), vg25₊ucq(t), ...)

Parameters (2160): see parameters(sys)
  ⋮

Observed (3207): see observed(sys)

The resulting ODEProblem is extremely large — approximately 437 MiB.

My main question is:

Does such a high number of observed variables (here ~2.7× the number of equations) significantly contribute to the memory footprint of the generated ODEProblem? Specifically, does it inflate the size of the compiled RHS function and the ObservedFunctionCache?

In power system EMT models, most of these observed variables are internal measurements automatically generated by sub-components (voltages, currents, powers, etc.). For typical use cases, we only need a small subset of them for post-processing and analysis.

Is this memory overhead expected for hierarchical models of this scale?
Are there recommended ways to minimize or eliminate the compilation of unnecessary observed functions while keeping the core 1188 differential/algebraic equations intact?
Would moving non-essential observed equations out of the system (and computing them manually after solve()) be an effective strategy for a library like powerEMT?

Any insights, best practices, or suggestions for reducing this overhead before the first public release of powerEMT would be greatly appreciated!

Thanks in advance!

ChrisRackauckas · April 9, 2026, 5:50pm

Nope, the observed equations are only evaluated on demand, so the solver memory is w.r.t. the unknowns and independent of the observed.

Are you doing jac=true? With sparse=true? Usually if memory is big its from dense Jacobians.

chen_hardworking · April 10, 2026, 7:57am

Thank you Chris (and Aayush too) for the quick and clear reply!

Just to confirm the setup I’m using:

julia

@mtkcompile sys = System(eqs, t, [], []; systems = systems)
prob = ODEProblem(sys, [], (0.0, 5.0); jac = true, sparse = true)
sol = solve(prob, 
            reltol = 1e-5, abstol = 1e-5, 
            TRBDF2(linsolve = KLUFactorization()), 
            maxiters = 100, dt = 1e-3)  # power simulation

Yes, jac=true + sparse=true + KLUFactorization as you suggested. The second and all subsequent solves are amazingly fast.

The bottleneck is purely the creation of the ODEProblem itself (it takes a long time and the resulting prob is huge — ~437 MiB as I guessed). Aayush mentioned that a lot of the compile time is spent generating code for the observed variables, which makes sense because in our model the number of observed variables is currently ~2.7× the number of states/equations.

I’m going to refactor the model to reduce the number of observed variables and see whether that shrinks both the problem size and the compilation time. I’ll report back once I have numbers.

Thanks again for the insight — really appreciate the help!

Topic		Replies	Views
First solve of `ODEProblem` and running out of memory: compilation problems? Modelling & Simulations modelingtoolkit , differentialequation	3	189	February 24, 2025
Performance overhead observed variables ModelingToolkit Modelling & Simulations question , modelingtoolkit	1	456	July 6, 2021
ODEFunctionExpr and observed variables [Modeling Toolkit] Modelling & Simulations modelingtoolkit	0	286	November 3, 2022
Speeding up solution to the large system of non-linear, complex-valued ODEs General Usage diffeq , first-steps , performance , optimization	6	1259	May 27, 2020
DifferentialEquations.jl: running out of memory with large system size ODE General Usage	11	1842	April 16, 2021

Does a very large number of observed variables (3207 vs 1188 equations) significantly increase ODEProblem memory?

Related topics