Knet vs Flux etc

denizyuret · November 9, 2018, 2:51pm

Sounds good. I think there are three sources of potential speed-up:

AD: AutoGrad vs Flux vs Zygote vs Capstain etc.
Alloc: KnetArray vs CuArray vs CPU etc.
Kernels: Knet kernels vs CUDANative/Flux vs CPU etc.

My GPU experiments vary all 3 components, which makes it difficult to pinpoint causes. My CPU experiments only vary the AD, so that can give us some clues right away. I think I can easily run Knet with CuArray alloc/kernels which should give another AD comparison. Your suggestion of using CuArray allocator with Knet kernels should highlight allocator differences. This is a bit more difficult to implement (the kernels dispatch based on the KnetArray type) but doable. We can probably figure out other combinations of the above three components that will inform the optimization work.

Topic		Replies	Views
Flux vs Knet for research and production Machine Learning knet , flux	8	1943	March 27, 2022
Flux vs Tensorflow training performance comparison? General Usage tensorflow , flux , machine-learning	5	6469	January 3, 2020
Best Julia Package for Neural Networks Machine Learning question	12	2668	September 16, 2020
Knet 1.1.1 is out: performance improvements, new monitoring tools, new gpu memory manager Machine Learning	0	660	October 1, 2018
Flux or Knet? General Usage question	7	2320	June 3, 2021

Knet vs Flux etc

Related topics