Simple CRPS optimization

cmarcotte · December 9, 2022, 12:10pm

I have a short function for computing the Continuous Ranked Probability Score (CRPS) from an ensemble of states Xb::Array{Float64,2} and a set comparison state y::Array{Float64,1}, where y may have varying sparsity (i.e., findall(isnan.(y)) may or may not be empty), and when it is sparse, it typically only has ~8000 or so non-NaNs.

The cluster I am running the following function on has relatively low-performing cores on the compute nodes, but many of them, so I know that threading is helpful here. I compute the CRPS using the following function:

Xb = rand(Float64, 6_000_000, 20)
y  = rand(Float64, 6_000_000)

function CRPS(X,y)
	inds = findall(.!isnan.(y));
	N,M = size(X);
	out = zeros(eltype(X),N)
	Threads.@threads for n in inds
		for j in 1:M
			out[n] += abs(X[n,j]-y[n])/M;
			for k in 1:M
				out[n] -= abs(X[n,j]-X[n,k])/(2*M*(M-1));
			end
		end
	end
	return mean(out[inds])
end

My question is that running this function takes several hundred milliseconds, even with many threads. Looking at the output of @report_opt CRPS(Xb,y) suggests 4 problems with runtime dispatch (three from the lock-wait-unlock of threading and one from the call to mapreduce within mean(out[inds])) which I don’t think I can do much about without completely restructuring the function.

Am I leaving a lot of performance on the table with this function?

Oscar_Smith · December 9, 2022, 1:53pm

the main easy speedup I can see is using multiplication instead of division.

Topic		Replies	Views
Performance of Computation of Expectation of slow function Optimization (Mathematical) jump , performance , optimization	13	1012	January 30, 2022
Having issues speeding up code with multithreading Performance parallel , multithreading	19	584	July 16, 2023
Improve performance of CRRA utility function Performance performance , broadcasting , loopvectorization	6	379	April 29, 2022
Help with optimization Performance	18	123	September 28, 2024
(Blog Post) Optimizing Repeated Correlations Teaching & Outreach	5	295	August 3, 2024

Simple CRPS optimization

Related topics