Repeated allocations for fft in pseudospectral code

giordano · July 4, 2025, 4:37pm

Use a plan:

julia> using FFTW, LinearAlgebra, BenchmarkTools

julia> u = randn(32);

julia> function apply_n_fft(u, n)
           uhat = complex.(similar(u));
           for _ in 1:n
               uhat .= fft(u);
           end
       end
apply_n_fft (generic function with 1 method)

julia> @btime apply_n_fft($(u), 5);
  7.727 μs (49 allocations: 7.83 KiB)

julia> @btime apply_n_fft($(u), 50);
  81.314 μs (454 allocations: 70.41 KiB)

julia> function apply_n_fft_plan(u, n)
           uc = complex(u)
           uhat = similar(uc);
           plan = plan_fft(uc)
           for _ in 1:n
               mul!(uhat, plan, uc)
           end
       end
apply_n_fft_plan (generic function with 1 method)

julia> @btime apply_n_fft_plan($(u), 5);
  1.713 μs (8 allocations: 1.34 KiB)

julia> @btime apply_n_fft_plan($(u), 50);
  2.932 μs (8 allocations: 1.34 KiB)

julia> @btime apply_n_fft_plan($(u), 500);
  15.597 μs (8 allocations: 1.34 KiB)

Without a precomputed plan, fft internally needs to (a) re-compute the same plan all over again which is also a very time-consuming task which you don’t really want to repeat, (b) convert the input array to complex, and (c) allocate the output array, every single time. With the pre-computed plan, and having the input array already as complex numbers, lets you save all these repeated allocations, and the mul! would be entirely in-place, with the number of total allocations not depending on n.

Topic		Replies	Views
About In-line-fft Performance question , fftw	6	765	October 24, 2023
How to avoid memory blow up with FFT without expensive garbage collection? Performance question , fftw , parallel , memory-allocation , fft	3	415	June 10, 2024
Avoid allocations in the naive discrete Fourier transform Performance question	12	891	September 19, 2020
Is multithreaded FFTW on multiple arrays causing too much overhead? Performance fftw	15	1001	August 19, 2023
Parallel FFT not that much faster General Usage fftw , parallel , multithreading	10	7144	December 19, 2016

Repeated allocations for fft in pseudospectral code

Related topics