[ANN] Durbyn.jl — Time Series Forecasting in Julia

Resul.Akay · September 24, 2025, 2:54pm

@langestefan I should clarify that I wasn’t referring specifically to the packages mentioned, but speaking more generally: I can’t just add dependencies simply because they exist. Every dependency introduces long-term maintenance risks, so I’m careful about what I include.

I’m also not undermining the importance of Optim.jl — it’s a great package. But in this particular use case, it didn’t perform as well as needed. More broadly, time series forecasting in Julia is still not a mature area, and sometimes it requires specialized tools. That’s exactly what Durbyn.jl is trying to provide. For example, in the R ecosystem, tsibble was developed specifically for time series even though tibble and data frames already existed - because the needs were different.

Without tsibble, the tidy forecasting framework (Fable) in R simply wouldn’t have been possible.

Similarly, why should I bring in GLM.jl just to compute OLS residuals, when I can simply do:

β = X \ y
fitted = X * β
residuals = y - fitted


function ols(y, X)
β = X \ y
fitted = X * β
residuals = y - fitted
n, p = size(X)
df_residual = n - p
σ2 = sum(residuals .^ 2) / df_residual
XtX = X' * X
cov_β = σ2 * inv(XtX)
se = sqrt.(diag(cov_β))
return OlsFit(β, fitted, residuals, σ2, cov_β, se, df_residual)
end

and wrap that in a lightweight function called ols? That avoids adding a heavy dependency while still providing the functionality needed.

I’m a bit puzzled by the criticism here. For my use case I just need a very small, labeled matrix container—row/column names with shape checks—nothing like the full feature set of DataFrames.jl. Pulling in DataFrames for this would add a heavy dependency which Duryn doesn’t need.


struct NamedMatrix{T}
    data::Matrix{T}
    rownames::Union{Vector{String},Nothing}
    colnames::Vector{String}

    function NamedMatrix{T}(data::Matrix{T},
                            rownames::Union{Vector{String},Nothing},
                            colnames::Vector{String}) where {T}
        if rownames !== nothing && size(data,1) != length(rownames)
            error("Row names do not match number of rows")
        end
        if size(data,2) != length(colnames)
            error("Col names do not match number of columns")
        end
        new{T}(data, rownames, colnames)
    end
end

langestefan · September 24, 2025, 3:27pm

Sure, that makes sense. But packages like StatsModels.jl, Optim.jl, Tables.jl are used by lots of downstream packages with many thousands of users that depend on it. So for me personally that is enough ensurance that those packages will continue to be maintained. But this is always a risk with open source software and I don’t think you can ever avoid it.

I haven’t looked at your implementation, but can’t you just plug in any BBO from BlackBoxOptim.jl? Why specifically nelder-mead?

GLM.jl is a very old package from 2012. But actually, there are lots of reasons why you would not want to write your own solver and pick something proven/battle hardened instead (absolutely not GLM.jl because it is a very high level package). I don’t want to discourage your development by the way, if your solver works better then it’s better. But as a user, I would prefer to just pick something off the shelf which I am already using in other packages.

I think you misunderstand that suggestion. You do not need a dependency on DataFrames.jl to support DataFrames.jl as an input. You only need to support the Tables.jl interface, and then you can take any input which is compatible with it. That includes dataframes, but your package isn’t even aware it is taking a dataframe as input

nilshg · September 24, 2025, 3:36pm

Can I ask how you’ve determined that your Nelder-Mead implementation is faster (for equivalent tolerances) than the Optim.jl one?

If I just run the example from the Optim Nelder Mead docs I get:

julia> using Chairmarks, Optim

julia> f(x) = (1.0 - x[1])^2 + 100.0 * (x[2] - x[1]^2)^2
f (generic function with 1 method)

julia> optimize(f, [.0, .0])
 * Status: success

 * Candidate solution
    Final objective value:     3.525527e-09

 * Found with
    Algorithm:     Nelder-Mead

 * Convergence measures
    √(Σ(yᵢ-ȳ)²)/n ≤ 1.0e-08

 * Work counters
    Seconds run:   0  (vs limit Inf)
    Iterations:    60
    f(x) calls:    117


julia> nmmin(f, [.0, .0], NelderMeadOptions())
(x_opt = [1.0006866976618765, 1.0021003276109683], f_opt = 5.324607341588884e-5, n_iter = 100, fail = 0, evals = 74)

julia> @b optimize(f, [.0, .0])
10.200 μs (320 allocs: 7.375 KiB)

julia> @b nmmin(f, [.0, .0], NelderMeadOptions())
8.600 μs (825 allocs: 36.078 KiB)

so the Durbyn implementation (which I just copy-pasted from the nmmin.jl file in the repo) is about 15% faster (while allocating 2.5x the memory), but finding an objective value that’s 4 orders of magnitude worse, so this difference seems more of a tolerance thing. When tuning the Durbyn implementation tolerance to get a similar objective value:

julia> nmmin(f, [.0, .0], NelderMeadOptions(; abstol = 1e-8))
(x_opt = [0.999950642570824, 0.9999048113257563], f_opt = 3.6778357780875667e-9, n_iter = 100, fail = 0, evals = 106)

julia> @b nmmin(f, [.0, .0], NelderMeadOptions(; abstol = 1e-8))
12.100 μs (1169 allocs: 51.016 KiB)

I see 20% lower performance (and 4x memory allocation) relative to Optim.jl

Of course the function I’m optimizing here might not be a good proxy for the typical workload in Durbyn, but my point more broadly is that it’s not easy to reliably benchmark these things, especially when building a user-facing package where you might not be able to perfectly anticipate the workloads.

More broadly, optimization is a fundamental building block of the Julia ecosystem, and there are many domain experts in the community willing to help out with optimization problems. By rolling your own you’re missing out on one of the greatest strenghts of the Julia community.

And just to add - the flipside of that is of course that if you actually have a Nelder Mead implementation that reliably outperforms the Optim, the best thing to do for the community would be to upstream this into Optim.jl so that everyone (not just time series forecasters using Durbyn) can benefit from it! This will also buy you community support on maintaining or improving your implementation.

Resul.Akay · September 24, 2025, 3:38pm

I did experiment with different solvers, it was not performing well for ets() that is why I start working on NM implementation.

Data frame interface is some long way to go but an interface similar to R Fable would be nice maybe as another package.

Resul.Akay · September 24, 2025, 3:47pm

@nilshg Optimazation in ETS function done vie opim · taf-society/Durbyn.jl@1db4d0f · GitHub

I worked a lot on Optim.jl to be integrated in Durbyn, experimented also a lot.
This commit is only one example. There were many other experiments that I did not commit to git.
As I repeatedly said Optim.jl is a great package, but for this case I will go in house.
Please feel free to fork the package change the optimization function and create pull request if it perform better then Durbyn implementation of the nm.

j_u · September 24, 2025, 6:46pm

Thank you very much for your package, Resul Akay. It looks extremely well thought out with a super clean interface. I also hope that the TAFS Forecasting Ecosystem, supported by the Time Series Analysis and Forecasting Society, will become a permanent part of the Julia Ecosystem. Taking this opportunity, may I ask if you are planning to further develop the package and to support more contemporary time series analysis and forecasting methods in the future?

jbytecode · September 24, 2025, 6:52pm

Congrats! A very nice package.

It’s also good to see that we have a Julia counterpart of the R’s well-known forecast package. I’ve just tried it out and looks like the auto_arima function works like a charm.

Thank you!

Resul.Akay · September 24, 2025, 7:20pm

@j_u Yes, I’m leading the development of Durbyn.jl and planning to extend it further — including ML-based forecasting. I also develop in R and Python, but Julia has been an amazing language to work with for high-performance forecasting.

Thank you very much for your kind words and encouragement! I really appreciate the friendliness of the community. I realize I may have set off on the wrong foot in some earlier discussions, and at times it felt a bit like criticism was directed at me personally. But I take this as part of the learning process, and I’m committed to making Durbyn and the TAFS Forecasting Ecosystem a lasting contribution to Julia’s ecosystem.

langestefan · September 24, 2025, 7:22pm

I made a PR to show how you could implement the Tables.jl interface: Tables.jl interface example and other tweaks by langestefan · Pull Request #3 · taf-society/Durbyn.jl · GitHub

As you can see it’s only a single function, very simple.

But your package currently doesn’t have a unified interface, so you would have to implement the dispatch for every single model. That’s not very nice. To make it more flexible, you can have a single forecast method which then dispatches to specific algorithms.

I also noticed you didn’t have any unit tests or a runtests.jl, so I added that. Now you can do ] test to run unit tests. It also shows how to use your forecast models with a table as input.

Resul.Akay · September 24, 2025, 7:27pm

@langestefan Thank you for the effort. I will check.
Yes, tests are still missing, I had my hands full

Resul.Akay · September 24, 2025, 8:04pm

When I was talking about data frame interface I was talking about a following process which allow to do the forecast at scale:

Shape the data into a panel which imported from a csv or excel which contains let’s say 100K products

pt   = PanelTable(data; groups=[:product_id], date=:date, freq=:month, ml_data=false)

Create a ModelSpec (example)

models = ModelSpec(
  ETS(y = . ),     # ETS example (univariate) auto ets
  ARIMA(y = p() + d() + q() + P(0) + D(0) + Q(0)), #non seasonal auto_arima with drift term
 )

Fit 100K time series (products) in parallel over series

fit = fit(model_spec = models, data = pt, parallel = true, ncore = -1)

4a) Forecast a fixed horizon (univariate models like ETS)

fc = forecast(fit, 12)              # 12 steps ahead for all products

4b) Or, if the model uses exogenous features:

fc = forecast(fitobj, newdata = X_future)  # X_future aligned with horizon and groups

the base models will always be with array, if I introduce tables at this stage I know I will be regretting in the future. Thanks again for your effort.

langestefan · September 24, 2025, 8:09pm

But this you can also do perfectly with StatsModels.jl right?

My PR did not change that, it just added a dispatch on tables. Nothing was changed about your model internals. That’s the power of julia’s dispatch!

I think the way you have implemented the interface right now you just made it more inconvenient to maintain for yourself.

Resul.Akay · September 24, 2025, 8:13pm

yes, I use it a lot.

I will check. thanks for suggestion.

I also organizing a workshop with many leading forecasting experts to talk about “The grammar of forecasting” from leading python and R developers and academic community.

keynescoefen · September 28, 2025, 1:33pm

Amen, thank you for putting out a package like that.

Re: the discussion on dependencies, it may appear extreme to re-write things from scratch, but I have been burned myself from unsupported dependencies in my replication packages meant to survive longer than just a year or two. For many academics (as many Julia users are), this reliability is important. For example, I just successfully replicated software code from 1991 on my current Mac, written in the commercial Gauss language. If had to reverse engineer everything, that would have cost so much extra time.

For the optimisers in particular, they are sometimes hard to debug. Some R packages also have this problem as they rely on obscure optimiser dependencies that sometimes require full-on hacks to get informative error messages.

@Resul.Akay well done for writing this package and getting good results!

Of course, there are advantages to interfacing with current packages, too. Ideally, there would be a array-only option as some R packages have where both DataFrames or simple arrays can be fed into urca, lm etc. That way the package can still work in array mode in the event that DataFrames gets abandoned.

Resul.Akay · September 28, 2025, 4:35pm

@keynescoefen Thank you for the encouraging words, I completely agree - reliability and long-term maintainability are essential, especially for academics and industry users who need software to run reliably even many years later. That’s exactly why I’m very deliberate with architectural choices.

Julia is a fantastic language for building something truly state-of-the-art and scalablle. With the right design, Durbyn not only serves its imediate goals but also open the door to a wider community: we plan to release Python and R packages with Julia bindings. The broader vision is to position Julia as a powerful backend for forecasting tools across ecosystems, lightweight, reliable, and something that different communities can trust and use with confidence.

You might ask: if Durbyn.jl already comes from R forecast, why should R users adopt a Julia backend? The answer is performance - Julia enables high-performance implementations that can scale to millions of time series, which is hard to achieve in R alone.

Help us spread the word about Durbyn.jl!

If you find the package interesting or useful, please consider:

Starring the repo on GitHub: Durbyn.jl
Sharing it with colleagues, students, or your network

Every bit of visibility helps attract more users and contributors. That, in turn, accelerates developments and bring us closer to the vision of making Julia a trusted, high-performance backend for time series forecasting across ecosystems.

j_u · September 29, 2025, 1:37am

Respectfully, from a long range time frame perspective, aren’t you overcomplicating this?

Resul.Akay · September 29, 2025, 8:16am

@j_u
C++ (for example) has been used as a backend in R and Python for long time. Why not Julia? easy to learn, and it is fast.

Boris · September 29, 2025, 12:50pm

As someone working on time series forecasting, although in a different domain, I can only give you praise for putting in the effort.

langestefan · September 29, 2025, 1:08pm

In the extreme case you can always fork a dependency and maintain it yourself. Surely that must be less effort than creating the dependency from scratch?

Why not work with the package authors to improve the debugging experience to the benefit of all users of said dependency?

I like having everything composable. Maybe another overlooked advantage, composibility can also substantially reduce the pre-compilation time in case your dependency is used across several projects.

DoktorMike · September 29, 2025, 6:27pm

I will also say that this package is most welcome to the Julia universe. Traditional forecasting is one of those tasks where I always reach for R. Now I can stay in Julia for these tasks as well. Great job.

Topic		Replies	Views
TSAnalysis: time series analysis and state-space modelling Package Announcements statistics , time-series , machine-learning	49	10882	December 10, 2021
Multivariate OLS General Usage	17	5981	November 13, 2018
Taking Fitting Seriously Data plotting	39	5882	December 8, 2018
What package[s] are state-of-the art OR attract you to Julia, and make you stay there (not easily replicateable in e.g. Python, R, MATLAB)? General Usage	61	15755	April 10, 2020
Google Summer of Code Proposal: Econometrics.jl Statistics package , proposal , economics	20	3736	July 6, 2019

[ANN] Durbyn.jl — Time Series Forecasting in Julia

Related topics