Efficient sorting of 2 Vectors with many duplicate values?

Datseris · September 29, 2017, 11:37am

I have two vectors, X, Y that represent positions of 2D points. Both of these vectors have a lot of duplicates, but there is no duplicate point (i.e. many points have same x or y coords, but no points have both x and y the same).

I am trying to sort these vectors such that they are first sorted by increasing X and then by increasing Y. Is there a straight-forward way to do it?

My current approach is to first sort X. Then, for each section of X that has the same value, sort the corresponding section of Y. Get these sorting indices (with sortperm) and put their values in the master sorting vector. Continue until the end.

This seems kinda bad though

mauro3 · September 29, 2017, 11:49am

You could represent your points as tuples:

julia> sort([(3,4), (3,2)])                                                                                                                           
2-element Array{Tuple{Int64,Int64},1}:                                                                                                                
 (3, 2)                                                                                                                                               
 (3, 4)

Or use Point from GitHub - JuliaGeometry/GeometryTypes.jl: Geometry types for Julia, which I suspect would allow similar sorting. (Edit: I’m not sure Point actually exists, but something along those lines.)

Datseris · September 29, 2017, 11:53am

Holy bananas it was that simple? Really? Damnit.

Yeap, using sortperm(collect(zip(X,Y))) does exactly what I need! and it is suprisingly fast as well!
14ms for Vectors with 30,000 elements.

Topic		Replies	Views
Sorting by two values (basic sorting) New to Julia sort , arrays	7	4085	March 11, 2021
Fast permutation vector: code needed Performance sorting	38	615	March 21, 2023
Which sorting algorithm should I use? Performance sort , sortperm	2	728	October 31, 2018
Parity of sorting permutations for vectors with very small size General Usage question	0	146	February 24, 2023
Sort vector by frequency New to Julia sort , arrays	8	2878	July 7, 2022

Efficient sorting of 2 Vectors with many duplicate values?

Related topics