MNIST GPU CuArrays error

Fadi_Nader · January 22, 2019, 9:35am

Hi dear,

I did the same comparison using tensorflow, and the GPU performed better, so maybe Flux is not optimized on that type of GPU.
Anyways, thanks a lot for your support and guidance for this issue

kristoffer.carlsson · January 22, 2019, 9:37am

How much better?

If you feel up to it, you can profile the run, using e.g.

nvprof path/to/julia myfile.jl

and see what is taking time for the Flux model.

maleadt · January 22, 2019, 10:50am

If not, it would make a valuable issue now that you have side-by-side Flux/TF implementations.

Fadi_Nader · January 22, 2019, 3:06pm

Hi,

I ran MNIST training for 45 epochs, with different batch sizes and got below results:

Flux:

CPU:

batch size 100: 479.395901 seconds (52.17 M allocations: 303.536 GiB, 4.77% gc time)
batch size 512: 160.653196 seconds (6.71 M allocations: 184.139 GiB, 10.34% gc time)
batch size1024: 256.346342 seconds (3.39 M allocations: 169.667 GiB, 53.28% gc time)
batch size 2048: 250.305340 seconds (1.73 M allocations: 162.432 GiB, 55.05% gc time)

GPU:

batch size 100 483.669281 seconds (33.77 M allocations: 302.615 GiB, 4.81% gc time)
batch size 512: 159.605954 seconds (6.78 M allocations: 184.142 GiB, 10.33% gc time)
batch size1024: 255.784214 seconds (3.45 M allocations: 169.670 GiB, 53.21% gc time)
batch size 2048: 246.802858 seconds (1.80 M allocations: 162.434 GiB, 55.49% gc time)

Tensorflow:

GPU:

batch size 100: 368 seconds
batch size 512: 125 seconds
batch size1024: 111 seconds
batch size 2048: 97 seconds

@kristoffer, actually I’m using notebook, not a file.jl so anything you need me to do by this command ?

nvprof path/to/julia myfile.jl

Topic		Replies	Views
Error when setting up Flux and CuArrays Machine Learning gpu , flux	4	673	October 2, 2019
CUDNNError when using Flux within a Task General Usage flux , cuarrays	15	1610	June 22, 2020
Flux failing on GPU Machine Learning	25	4131	February 21, 2020
Code using Flux slow on GPU GPU flux	9	3115	November 6, 2019
Flux.jl: training fails at GPU but works on CPU Machine Learning gpu , flux	1	642	September 19, 2019

MNIST GPU CuArrays error

Flux:

Tensorflow:

Related topics