A implementation of ResNet-18 uses lot of GPU memory

Iulian.Cioarca · March 25, 2020, 8:55pm

In my case your Flux implementation takes around 7 mins per epoch with batchsize of 64, but my GPU might not be as fast as yours. It’s quite busy, at 100%.
Tensorflow trains in 6 min per epoch or total?

Edit: are you using FP16 on RTX2070?

Topic		Replies	Views
Memory challenges for Flux on Resnet Machine Learning gpu	8	1473	September 7, 2022
Flux runs out of memory Machine Learning memory-allocation , flux	25	4585	June 1, 2023
MNIST GPU CuArrays error GPU	23	3185	January 22, 2019
Memory usage increasing with each epoch Machine Learning cuda , flux	18	918	April 14, 2025
Flux Transformer Out of Memory Machine Learning	25	1670	March 13, 2023

A implementation of ResNet-18 uses lot of GPU memory

Related topics