How to efficiently find columns of the matrix which are the same?

Dan · November 7, 2023, 1:51pm

The question is a little vague on the desired output, but I think the following will succintly include the OP’s desired information
(just noticed this is pretty similar to stevengj’s answer):

julia> using StatsBase

julia> M = permutedims(reshape([[1, 1, 1, 1, 0, 0]
       [1, 1, 1, 0, 1, 0]
       [0, 0, 0, 0, 0, 0]
       [0, 0, 0, 0, 0, 0]
       [0, 0, 0, 0, 0, 0]
       [0, 0, 0, 0, 0, 0]], (6,6)))
6×6 Matrix{Int64}:
 1  1  1  1  0  0
 1  1  1  0  1  0
 0  0  0  0  0  0
 0  0  0  0  0  0
 0  0  0  0  0  0
 0  0  0  0  0  0

julia> countmap(eachcol(M))
Dict{SubArray{Int64, 1, Matrix{Int64}, Tuple{...} with 4 entries:
  [1, 1, 0, 0, 0, 0] => 3
  [1, 0, 0, 0, 0, 0] => 1
  [0, 1, 0, 0, 0, 0] => 1
  [0, 0, 0, 0, 0, 0] => 1

julia> sort(values(countmap(eachcol(M))); rev = true)
4-element Vector{Int64}:
 3
 1
 1
 1

The last expression’s first element is the count of the most popular column in the matrix. If several columns appear multiple times, the vector will reflect this.

Topic		Replies	Views
Count occurances for matrix rows (where column order does not matter) General Usage question , count	30	1061	December 13, 2022
Finding unique columns of a matrix New to Julia	3	783	March 18, 2021
Choosing only different vectors from a matrix New to Julia	8	579	September 1, 2019
Fastest way possible to find index of value equals 1 across a matrix column Performance question , performance , speed-optimization	7	1072	October 6, 2023
How do I get the indexes for matrix columns whose elements are all the same? General Usage	12	676	April 13, 2023

How to efficiently find columns of the matrix which are the same?

Related topics