Significantly Higher VRAM Usage and Slower Training on Flux Compared to PyTorch

yolhan_mannes · May 18, 2026, 6:43am

There is also Lux.jl/examples/ImageNet at main · LuxDL/Lux.jl · GitHub with some inside you could ask for yours to be added in there

giordano · May 18, 2026, 8:05am

You can disable that by setting the environment variable XLA_REACTANT_GPU_PREALLOCATE=false (it may make performance slightly worse though).

Alternatively, if you are on GPU you can get memory statistics with Reactant.XLA.allocatorstats

using Reactant
device = Reactant.XLA.default_device(Reactant.XLA.default_backend())
Reactant.XLA.allocatorstats(device)

This will tell you the memory actually used (and the peak memory at any past point) within the preallocated pool.

camilodlt · May 18, 2026, 11:51am

Another roadblock I encountered in my cv models in lux and flux was the dataloader and image augmentation. I don’t know if it has changed this past year, but there were no so many maintained image augmentations libraries ( at least close to the level of PyTorch). Another good thing in PyTorch is the multi process dataloader which can speed up training a lot.

yolhan_mannes · May 18, 2026, 12:17pm

Reactant let you mesh directly the data and the paremeters however you like between your devices and MLUtils.jl do provide a DataLoader. As for image augmentations I’m not very familiar with Images.jl and everything in there but its big and feature request are welcome I think.

csvance · May 18, 2026, 1:51pm

Julia is actually insanely capable for this sort of thing; the issue is that no one has put together an end-to-end off the shelf solution aimed at standard workflows. I use a Julia based data pipeline in both Python and Julia CV workflows, but I’m not sure it makes sense to try to open source since its specialized to medical imaging workflows (DICOM/Lossless JPEG, UInt8/UInt16 only) living entirely outside of the Images.jl ecosystem.

camilodlt · May 18, 2026, 2:59pm

I ended up using pytorch’s dataloader with augmentations through pythoncall for a while but it was not ideal. I agree that Julia has a lot of potential there too ofc, since multithreading and multi processing works really well.

Topic		Replies	Views
Slow LSTM on GPU in Flux Machine Learning gpu , flux , pytorch	21	2447	February 15, 2024
Flux ready for a beginner deep learning project? Machine Learning flux	31	9042	June 20, 2019
Deep learning in Julia Machine Learning	35	13349	April 22, 2024
Flux running slow? Machine Learning	16	2987	August 19, 2021
Is it a good time for a PyTorch developer to move to Julia? If so, Flux? Knet? Machine Learning	52	25950	January 11, 2021

Significantly Higher VRAM Usage and Slower Training on Flux Compared to PyTorch

Related topics