Modifying Flux source code for GPU

wsshin · March 20, 2023, 7:16pm

In Flux.jl, I wanted to modify the behavior of BatchNorm, so I changed the code here. The modified code was effective when it ran on CPU, but I realized that it was not effective on GPU.

After some investigation, I found out that that after transferring the model m to GPU by m = m |> gpu, the above linked code was no longer executed.

What is the general procedure to make the changes in the Flux.jl source code effective on GPU as well? I think the actual code executed on GPU might be this, but I’m not sure how to modify it because it eventually uses @ccall.

ToucheSir · March 20, 2023, 9:21pm

For historical reasons, the CUDA path for Flux’s batchnorm lives in its own file. Thus you’d either have to remove this code or change those methods to make your own version work on GPU. Medium-term, we plan to clean up the API layering around norm layers so that all batchnorm layer methods can live in one place.

wsshin · March 25, 2023, 1:51am

Thanks. So I was able to track down that the following happens during training:

(BN::BatchNorm)(x::CuArray) calls NNlibCUDA.batchnorm()
NNlibCUDA.batchnorm() calls cudnnBNForward!()
cudnnBNForward!() calls cuDNN.cudnnBatchNormalizationForwardTraining()
cuDNN.cudnnBatchNormalizationForwardTraining() calls @ccall libcudnn.cudnnBatchNormalizationForwardTraining()

Now, where can I see the code for the final function, libcudnn.cudnnBatchNormalizationForwardTraining()?

wsshin · March 27, 2023, 8:51pm

Maybe @maleadt could help?

ToucheSir · March 27, 2023, 8:57pm

The final function is in cuDNN itself, which is closed source. Your best bet for understanding what it does is reading the relevant docs at NVIDIA Deep Learning cuDNN Documentation.

Topic		Replies	Views
BatchNorm: only track_stats=true supported on gpu Machine Learning flux	3	444	May 31, 2021
How to make a Custom Layer work on GPU? Machine Learning question , flux	3	792	September 2, 2020
Compilation error in Zygote, Flux and CUDA interaction Machine Learning question , cuda , error , flux , zygote	3	391	January 6, 2023
Flux.jl: training fails at GPU but works on CPU Machine Learning gpu , flux	1	630	September 19, 2019
MNIST GPU CuArrays error GPU	23	3063	January 22, 2019

Modifying Flux source code for GPU

Related topics