50x speed difference in gemv for different values in vector

StefanKarpinski · March 19, 2017, 3:13pm

Just to elaborate on that, subnormals (aka denormals) are floating-point values with large negative exponents – so large that they no longer use all the bits of the value and have less than full precision. This allows something known as “gradual underflow” where you lose precision gradually, instead of immediately getting a zero value. Doing arithmetic with subnormal values does not go through the normal CPU pathways (on Intel hardware) and thus takes considerably longer – i.e. floating-point ops do not take a fixed number of clock cycles, which is what you’re seeing here.

Topic		Replies	Views
Slowdown due to subnormal float, coming from neural net training Performance	20	830	October 27, 2022
Feedback on benchmark General Usage	0	242	January 10, 2020
Subtract Float32 number from Float64 number - what's the rule? New to Julia	13	1226	August 31, 2023
Massive data-dependent floating-point slowdown Performance	3	676	May 28, 2021
Float32 performance fluctuating over time Performance	1	291	October 22, 2021

50x speed difference in gemv for different values in vector

Related topics