Hello,
Is there a way to remove duplicate row values in JuliaDB based on a subset of columns (which are also the keys in my case)? I’m working with a distributed table in chunks in case that matters.
For example, for data like the following where var1 and var2 are the keys:
var1,var2,var3
A, A, 1
B, D, 4
C, A, 9
B, D, 10
I would like returned
var1,var2,var3
A, A, 1
C, A, 9
Since rows 2 and 4 have the same values for var1 and var2. Thanks!