Delete all rows contained in a dataframe, as specified by an array of ids

Nash · March 10, 2021, 1:01pm

Using

ids = findall(nonunique(mydata))

I have found the ids of all rows that are duplicates. Now, I want to clean mydata by erasing the duplicate rows.

How is that done?

Christopher_Fisher · March 10, 2021, 1:20pm

Here is one approach:

df = df[Not(ids), :]

Rudi79 · March 10, 2021, 1:23pm

Or

df = unique(df)

sijo · March 10, 2021, 2:32pm

To do the same by modifying the existing dataframe:

delete!(df, ids)

or directly with the boolean array

delete!(df, nonunique(df))

or simply

unique!(df)

Topic		Replies	Views
Delete duplicate rows in a DataFrame New to Julia dataframes	10	6109	June 22, 2023
Delete rows in DataFrame Conditionally General Usage dataframes	4	1623	February 18, 2020
Filtering dataframe for unique rows with respect one of column New to Julia question , dataframes	1	52	July 18, 2024
Delete row from DataFrame in place based on entire row value New to Julia question , dataframes	7	617	April 4, 2023
Remove all entries that occur more than once New to Julia dataframes	3	425	February 18, 2022