Vector filtering performance and... gc?

cstjean · July 10, 2018, 1:54pm

I have vecs::Vector{Vector{Float32}} (really, the columns of a dataframe), each of which has length of 300000. I’m interested in the performance of v[t], where t is a BitVector. I’m finding that performance is strangely bimodal. Why?

# t = rand(Bool, length(vecs[1]))
p = plot(ylim=[0.0003,0.001], legend=false)
gc_enable(false)
for i in 1:10
    plot!([@elapsed(v[t]) for v in vecs])
end
gc_enable(true)
p

If I leave the gc on, I get more jumps

In the above, mean(t) is 0.75, but it has long stretches of true and false. If I use t = rand(Bool, length(vecs[1])), then the bimodality disappears.

Am I seeing memory hierarchy effects? GC generations?

Topic		Replies	Views
Performance of custom `Vec` type versus `SVector{3, Float64}` Performance staticarrays	4	435	February 1, 2022
A problem about performance Performance	30	717	October 3, 2022
SVector vs Vec usage: Why do I have an 8x speedup in a simple example? Performance	7	1036	August 17, 2019
BitVector vs Vector{Bool} as default on comparison operations Performance	8	9139	November 2, 2020
Help with optimizing GC time with large objects in memory Performance	3	657	November 10, 2018

Vector filtering performance and... gc?

Related topics