DataFrames: obtaining the subset of rows by a set of values

pdeffebach · October 18, 2018, 3:49pm

iris |> 
Filter(Species -> Species == "versicolor") |>
Map((SepalLength, SepalWidth) -> SepalLength / SepalWidth)

This scares me a bit because it means that fast filtering would have to happen with variables themselves, named in the code, instead of with functions, due to the non-standard evaluation. It’s an R-ism that I want to avoid if possible.

nalimilan · November 17, 2018, 4:43pm

I’ve opened a WIP PR in DataFrames about using the select approach for combining results of a grouping operations: https://github.com/JuliaData/DataFrames.jl/pull/1601

josimar · December 5, 2022, 3:56pm

Could you please clarify how to type in the Greek symbol on the REPL?

I tried \epsilon but this is not the correct symbol.

I mean the symbol that is written here: filter(row → row.col ∈ [1,2,3], df)

Thanks!

pfitzseb · December 5, 2022, 4:16pm

\in

nilshg · December 6, 2022, 9:55am

When in doubt ask the REPL:

help?> ζ
"ζ" can be typed by \zeta<tab>

help?> ∈
"∈" can be typed by \in<tab>

merlin · April 27, 2024, 9:32pm

That ∈([1,2,3]).(df.col) is some niiiiice syntax!

Specifically, nice for getting row number or index to create a @view of a dataframe. Views cant be made from the dataframe output of a filter or subset operation. This had me stumped for a while.

For example, I need to change the cells in column :X2 if the rows in column label match a string:

df_vw = @view df[∈(["label_p", "label_t", "label_w"]).(df.label), [:X1, :X2]]

# X1 is fraction, make into bips and update cell in :X2
transform!(df_vw, :X1 => ByRow(x -> string(round(Int, x * 100000), " bps")) => :X2)

# alternately with direct assignment:
df_vw[:, :X2 = string.(round.(Int, df_vw[:, :X1] * 10000), " bps")

Thanks again, @ExpandingMan !!!

Topic		Replies	Views
Invert a row selection in DataFramesMeta Data dataframes	3	258	April 24, 2023
Filter dataframe with regular expression New to Julia regex , dataframes	8	2593	February 20, 2025
DataFramesMeta.jl insert @where subset programmatically? General Usage question	5	990	October 12, 2017
Confusing/misleading error message for a beginner New to Julia dataframes , error-message , dataframesmeta	10	5108	December 13, 2022
Filter DataFrame by an Array New to Julia	8	5909	December 10, 2019

DataFrames: obtaining the subset of rows by a set of values

Related topics