I am trying to write a function that will efficiently delete the rows in a large DataFrame if it exists in a separate larger DataFrame. So the function look something like… function removedup!(Tables.namedtupleiterator(NewDF), Tables.namedtupleiterator(dataDF)) for row in NewDF in(row…

I would have said this is a task for antijoin

Delete row from DataFrame in place based on entire row value

bkamins April 2, 2023, 1:26pm 4

you could do leftjoin! of both tables (assuming larger table does not have duplicates). This should be efficient.

2 Likes

Topic		Replies	Views
Most efficient way to delete row without memory consuming New to Julia	4	542	June 10, 2020
Delete rows in DataFrame Conditionally General Usage dataframes	4	1619	February 18, 2020
Delete all rows contained in a dataframe, as specified by an array of ids New to Julia	3	326	March 10, 2021
Delete rows that exist in another data frame in Julia? New to Julia question	1	1557	May 27, 2019
Delete duplicate rows in a DataFrame New to Julia dataframes	10	6088	June 22, 2023