Does Flux.jl layers make use of tensor cores in Nvidia GPUs?

lepton01 · August 27, 2023, 7:23am

I was searching for an answer in the Flux.jl, CUDA.jl, cuDNN.jl, but only found [https://juliagpu.org/2020-10-02-cuda_2.0/#low--and-mixed-precision-operations] which talks about independent GPU operations using CUDA.jl.
I have not found particular information about Flux.jl exploiting this technology.

maleadt · August 28, 2023, 7:31am

I don’t think Flux uses mixed-precision, so probably no. It is possible to configure CUDA.jl to use tensor cores more eagerly, at the expense of some precision, by starting Julia with fast math enabled or by calling CUDA.math_mode!(CUDA.FAST_MATH), which will e.g. use TF32 when doing an F32xF32 matmul. Further speed-ups are possible by setting CUDA.jl’s math precision to :BFloat16 or even :Float16. Ideally though, I guess Flux.jl would have an interface to use mixed-precision arithmetic.

Topic		Replies	Views
NVIDIA Tensor Cores not useful for double-precision simulations? GPU	12	5564	November 19, 2020
Flux.jl and the state of multi-processing Machine Learning	2	1632	February 27, 2019
Neural network in Flux.jl using CUDA is slower General Usage	0	478	July 15, 2020
Can Flux handle multiple GPUs? Machine Learning	16	2444	August 5, 2022
CUDA.jl 2.0: Per-thread streams, Float16, CUSPARSE clean-up Package Announcements	2	802	October 2, 2020

Does Flux.jl layers make use of tensor cores in Nvidia GPUs?

Related topics