Speed Comparison Python v Julia for custom layers

willleeney · July 27, 2023, 9:26am

That’s to be expected since Flux is still doing the work of applying a null bias and no-op activation function.

The faster Julia implementation is the one without the dense() layers and instead using the abstract matrices and just using flux to initialise the parameters. This means that it’s not doing a null bias operation and the only activation function applied for python and Julia is the softmax.

To your original question, it may be worth also comparing backward pass timings unless you’re planning on loading pre-trained weights from somewhere else

I can compare backwards as well because I am interested in both forward as well as forward + backwards. Any optimisation suggestions would have to account for the backwards pass but I am interested in forward time execution as well.

the full networks you’re trying to run

I’m not going to share the full networks I want to run. I want to know how one would speed this particular layer. I understand there are different ways to speed up the overall network by taking advantage of parallelism’s based on overall structure. But, in this question, I am only interested in how one would speed up this layer (if you can think of a very similar layer that does naturally have an easy way to speed up then I would be interested in this as well).

whether you need GPU support

I want to know CPU speeds. I’m not familiar with the differences between parallelism on GPU vs CPU but my interest is whether what benchmark performance on a CPU I will get.

how important training speed is if at all

Training speed is variably important. It depends on the ratio of relative inference speed up. I could say inference speed is 10x more important. Having said that, I would prefer to see all solutions to inference speed up as long as the backwards pass still works.

Topic		Replies	Views
Julia Performance - Help Needed Performance question , python	40	3065	September 17, 2021
Flux vs pytorch cpu performance Machine Learning first-steps , flux	59	9471	October 2, 2020
Flux multi-cpu parallelism? New to Julia question , flux , zygote	9	2967	November 21, 2020
Flux.jl RNN performance Machine Learning	11	3172	October 28, 2018
Kron vs scalar product speed difference. python code faster? New to Julia question	41	4243	January 14, 2017

Speed Comparison Python v Julia for custom layers

Related topics