Should `literal_pow` optimize for bigger exponents?

stevengj · February 10, 2022, 2:58pm

prod(ntuple(...)) is a very suboptimal way to compute large powers — you want to use repeated squaring, or more generally an optimal addition chain. This is implemented in:

but it isn’t the default for literal_pow because it is slightly less accurate (for floating-point types).

LLVM only does this by default for integer types; for floating-point types you have to use @fastmath because it changes (worsens) the roundoff errors. The FastPow package extends this to other types beyond the small set of built-in types supported by LLVM.

Topic		Replies	Views
Small, fixed powers Performance	5	503	May 28, 2021
Poll: speed vs accuracy for `Float64^-3` Performance poll , math , float	55	2654	March 31, 2022
Exponentiation with literals in both base and exponent not removed by compiler in 1.8 Performance compilation , math	10	581	July 6, 2022
Incorrect results from `@which` and `@code_warntype` for literal powers General Usage	1	653	March 14, 2017
Power function with Integer ( x^4 much slower than xxx*x ) Performance	12	1448	May 18, 2018

Should `literal_pow` optimize for bigger exponents?

Related topics