JuliaDB question

stillill · November 14, 2019, 11:13pm

Hello,

Is there a way to remove duplicate row values in JuliaDB based on a subset of columns (which are also the keys in my case)? I’m working with a distributed table in chunks in case that matters.

For example, for data like the following where var1 and var2 are the keys:

var1,var2,var3
A, A, 1
B, D, 4
C, A, 9
B, D, 10

I would like returned

var1,var2,var3
A, A, 1
C, A, 9

Since rows 2 and 4 have the same values for var1 and var2. Thanks!

Topic		Replies	Views
Remove duplicated rows General Usage juliadb	2	1505	May 1, 2019
Unique rows in distributed table New to Julia juliadb	1	526	March 27, 2020
JuliaDB delete row from table New to Julia juliadb	4	908	April 15, 2019
Remove all entries that occur more than once New to Julia dataframes	3	425	February 18, 2022
Filtering dataframe for unique rows with respect one of column New to Julia question , dataframes	1	52	July 18, 2024

JuliaDB question

Related topics