Shuffle values and sample indices of a sparse matrix

stevengj · March 23, 2021, 2:16pm

I’m thinking of something much simpler. When you are sampling from a range (or any array with unique elements), and the fraction of samples is small, you can just sample uniformly and check that the samples are unique; the probability of a collision is quite low so you will only need to re-sample a few times.

function seqsample_trivial!(rng::AbstractRNG, a::AbstractRange, x::AbstractArray)
    n, k = length(a), length(x)
    k <= n || error("length(x) should not exceed length(a)")
    while true
        for i in 1:k; x[i] = rand(a); end
        sort!(x)
        allunique = true
        for i = 2:k
            if x[i] == x[i-1]
                allunique = false
                continue
            end
        end
        allunique && return x
    end
end

(You can save the final shuffle! call in your code if you are willing to pass in a buffer array, or if you don’t care about the order. In your case, since you are calling this repeatedly, and only for lengths ≤ some threshold, I would just pass a preallocated buffer array and use it to cache the non-sorted values.)

Topic		Replies	Views
What is the fastest method to shift matrix? General Usage question	43	1294	August 5, 2022
Sparse matrix-vector product: much more slow than Matlab Performance matlab , optimization	24	4638	December 20, 2017
Inplace matrix sampling General Usage question	1	318	October 21, 2020
How to efficiently construct a large SparseArray? Packages for this? Performance package , performance , parallel , sparse	20	1839	May 15, 2022
Asymmetric speed of in-place `sparse*dense` matrix product General Usage	7	1559	November 8, 2018

Shuffle values and sample indices of a sparse matrix

Related topics