Lately, I’ve seen second-order optimization suddenly becoming a trend in deep learning.
Is this
- An opportunity that Julia neural network system can do well?
- Something that would rather shift the balance toward PyTorch/etc?
- Just yet another development that doesn’t shift the landscape too much?
Julia is said to support high-order automatic differentiation. Now that 2nd order optimization is becoming more popular in neural network, is this an opportunity for Julia?
Here’s arguably the paper that started the trend.
The landscape is changing fast.