Hello experts,
recently I am trying out Flux.jl
to initiate a research project on ML. I quickly ran into the issue of network compression and knowledge distillation. I have a background in physics and I have heard about the success of SVD decomposition in tensor network algorithms. Then I came across this paper
The Singular Values of Convolutional Layers
(by Hanie Sedghi, Vineet Gupta and Philip M. Long from Google Brain)
My question is :
has anyone implemented the Flux equivalent of the kernel SVD in this paper ?
Thanks in advance