Gradient Descent Optimizer in Flux.jl

FGerace · May 13, 2019, 11:32am

Hello,

I’ve recently started using Flux.jl. I’ve seen that Flux provides different kinds of optimizers, such as Descent, ADAM, etc etc…

I was just wondering: does the Descent optimizer perform full batch gradient descent, i.e. it sums up the gradient over all examples, or does it perform mini-batch/stochastic gradient descent, i.e. it sums up the gradient over a lower number of examples based on the batch size? If this is the case, how can I select the batch size?

Thanks in advance!

braamvandyk · May 13, 2019, 5:12pm

I believe it is stochastic gradient descent, but for ultimate verification, you can look at the source code.

Topic		Replies	Views
SGD in Flux.jl Machine Learning question , api	1	1370	June 18, 2021
Two questions on Flux Machine Learning	23	4859	October 2, 2020
Gradient Descent, each datapoint is an array, in Flux General Usage flux	0	292	August 2, 2022
Accelerated gradient descent algorithm General Usage question	1	655	June 16, 2020
[ANN] FluxOptTools Package Announcements optim , flux , visualization	0	1007	July 1, 2019

Gradient Descent Optimizer in Flux.jl

Related topics