Surprising benchmark results for a basic mixed precision example

nvenkov1 · February 10, 2023, 1:44pm

I started to experiment with mixed precision and I benchmarked a basic example as follows:

using BenchmarkTools
using Quadmath
using LinearAlgebra
a16 = Vector{Float16}(undef, 100_000_000);
a32 = Vector{Float32}(undef, 100_000_000);
a64 = Vector{Float64}(undef, 100_000_000);
a128 = Vector{Float128}(undef, 100_000_000);
@benchmark norm(a16)
@benchmark norm(a32)
@benchmark norm(a64)
@benchmark norm(a128)

I get the following median times:
862.683 ms for a16
25.218 ms for a32
47.776 ms for a64
1.143 s for a128

I tried it several times, including with zeros instead of undef, and I keep getting similar results. Aren’t these results surprising? Why is the time taken by norm(a16) larger (and by so much) than the time taken by norm(a32) and norm(a64)? Also, isn’t the time taken by norm(a128) a bit large compared to the times taken by norm(a32) and norm(a64)?

ChrisRackauckas · February 10, 2023, 1:47pm

There’s no hardware accelerator for that case on your CPU, so its a slow software thing.

Again, no hardware accelerator, so it’s a slower software emulation. For this case, you might want to look into double-double arithmetic, i.e. DoubleDouble.jl

Topic		Replies	Views
Performance of norm function Performance linearalgebra	21	12367	March 28, 2021
Apples to apples comparison of A\b with Float64 and Float16 on A64FX Performance question , linearalgebra	12	788	May 2, 2022
Time it takes to multiply two floats Numerics float	2	428	August 18, 2022
Will decreasing the precision of intermediate variables improve performance of code? Performance performance	12	1172	May 9, 2020
Slowdown due to subnormal float, coming from neural net training Performance	20	829	October 27, 2022

Surprising benchmark results for a basic mixed precision example

Related topics