Is that really the case? I thought batched matmul functions generally had some optimizations, at least by automatically leveraging parallelism on multiple cores or a GPU. A plain for-loop or broadcasting won’t do that for us.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Numpy 10x faster than Julia ?! What am I doing wrong ?! [solved - julia faster now] | 37 | 11135 | October 15, 2019 | |
Why is this Julia code considerably slower than Matlab | 64 | 8621 | March 5, 2017 | |
Linear solver \(A, B) performance vs Matlab A\b | 32 | 7820 | May 21, 2017 | |
Kron vs scalar product speed difference. python code faster? | 41 | 4243 | January 14, 2017 | |
Multiply many-matrices by many-vectors | 33 | 6852 | December 14, 2018 |