A possible way to improve training in Flux?

ToucheSir · August 9, 2020, 5:43pm

I can’t comment on your first paragraph, but the truth is most DL applications just don’t need the level of precision afforded by 64-bit floats. Normalization and other forms of regularization generally encourage smaller magnitude weights that can more effectively use the limited precision of float32 or even float16. Models that are sensitive to small weight perturbations are also more likely to be susceptible to adversarial attacks, while training with noisy data can improve network generalization.

With regards to speeding up training, mixed-precision has been gaining traction of late (see e.g. torch.cuda.amp).

Topic		Replies	Views
Generic way to change float precision in FluxML Machine Learning flux , fastai	3	246	August 18, 2023
Why does this Flux code fail with Float64, but run for Float32? New to Julia flux	0	603	November 7, 2021
Will decreasing the precision of intermediate variables improve performance of code? Performance performance	12	1298	May 9, 2020
Mix-mode training of large languages models in Julia Machine Learning	7	682	July 26, 2023
Bug? Using Flux & getting Float32 response on 64 bit Ubuntu OS Machine Learning	4	1254	March 4, 2019

A possible way to improve training in Flux?

Related topics