Unique! and count

hieun · December 21, 2021, 2:22pm

Hi,
I’m trying to determine unique counts of values in an array
as an example, given the following array
data = [‘a’, ‘b’, ‘a’, ‘c’]
i wanna get: unique_array = [‘a’, ‘b’, ‘c’] and count_array = [2,1,1]
in python I can do like this: unique_array, count_array = np.unique(data, return_counts=True)
and i can also solve with julia like this: unique_array = unique!(data)
but when count, I use: count(i=>i==‘a’,data). I wonder if there are some other solutions in case I don’t know the value of data (a,b,c)

albheim · December 21, 2021, 2:27pm

Something like count_array = [count(==(x), data) for x in unique_array] should work, though this loops over the data many times so if you have a lot of data to crunch it might be worth to look at something smarter.

stevengj · December 21, 2021, 3:16pm

Sounds like you want countmap in the StatsBase.jl package.

Bardo · December 21, 2021, 5:23pm

Or

function uniquecount(data)
   unique_array = unique(data)
   counts = Dict(unique_array .=> 0)
   for (i, c) in enumerate(data)
      counts[c] += 1     
   end
   keys(counts), values(counts)
end

rafael.guerra · December 21, 2021, 6:41pm

For data input as: data = rand('a':'z', 1000), StatsBase’s countmap() (including collecting keys and values) seems to be 25% faster than the count() comprehension, and ~3x faster than uniquecount().

hieun · December 22, 2021, 4:20pm

Thanks, I got it

Topic		Replies	Views
Number of each unique value in an array General Usage	4	5269	March 26, 2024
How would I check for unique values across many arrays without for loops? General Usage	7	1048	June 2, 2020
Opposite of unique New to Julia sets	19	2395	March 25, 2021
Counting number of occurences in an array Tooling question , statistics , arrays , splitapplycombine	10	16135	December 18, 2019
Pandas value_counts() equivalent in base Julia or in Data frame package? General Usage	1	246	January 27, 2023

Unique! and count

Related topics