Union splitting vs C++

melonedo · July 8, 2021, 3:54pm

Hello! I created an experimental package SingleDispatchArrarys.jl that generates dispatch table for non-homogeneous arrays, which achieves c++ like performance for non-homogeneous arrays and is much more flexible to manage than manual switch-case.

However, c++ like performance is not necessary a good thing. With long if-elseif-elseif chain, LLVM generates jump table for this. On x86 machines, loading a jump table is nearly as fast as branching, but on Raspberry pi 4b, jump table is actually twice as slow as branching(20ms vs 10ms), looks like LLVM has made a bad choice.

Topic		Replies	Views
Performance drawback with subtyping Performance	34	3204	August 26, 2021
Avoiding Vectors of Abstract Types Performance question , type-stability	22	4429	February 17, 2022
Update on single dispatch benchmark comparison to C++ Performance cxx , benchmark	19	3058	February 13, 2021
[ANN] ManualDispatch.jl Package Announcements dispatch	15	2794	February 7, 2022
Slow code with Union and Box General Usage	17	298	March 6, 2024

Union splitting vs C++

Related topics