Create a new column that has a list of items of a second column, with a condition from a third column

Juan_Mac_Donagh · June 22, 2022, 9:40pm

Hi, I have a simple problem: I have a DataFrame that looks like this:

df = DataFrame()

df.sum = [[200,0,1,1,0], [0,0,0], [0,1,1,1,1,0,0]]
df.score= [[1,2,22,25,5], [1,1,2], [1,24,54,89,12,1,2]]
df.id = ["1", "2", "3"]

And I want to generate a column that has the values of df.score that coincide with the values that are >+ 1 in the df.sum column. This is my expected result:

df.x1 = [[1,22,25], [], [24,54,89,12]]

Where the second row should be empty, because there are no values greater than 1 in df.sum[1].

This is the code that I’ve been tinkering with (I took it from Change column to row using conditions), but I can’t get my head around on how to modify it so it works:

	dv2 = df.sum;
	for (i, v) in enumerate(dv2)
		for (j,h) in enumerate(v) 
    		if h >= 1
        		df[:,:new_col] .= df.Score[i][j]
			end
    	end
	end

The problem with this that it gives me a column that is composed only by the number 12 ([12, 12, 12]) in every row, and that is the last value that is greater than 1 of the last column. I don’t know why this happens, but I know that it has to do with how I the loop is written

Any help is welcome, thanks a lot!

pdeffebach · June 22, 2022, 9:54pm

I don’t understand what the actual transformation you are trying to do is, but here is an answer using DataFramesMeta.jl

julia> t = @rtransform df :x1 = begin 
           better_than = :score .> (:sum .+ 1)
           :score[better_than]
       end;

julia> @select t :id :x1
3×2 DataFrame
 Row │ id      x1                  
     │ String  Array…              
─────┼─────────────────────────────
   1 │ 1       [2, 22, 25, 5]
   2 │ 2       [2]
   3 │ 3       [24, 54, 89, 12, 2]

EDIT: you want

julia> t = @rtransform df :x1 = begin 
           better_than = :sum .>= 1
           :score[better_than]
       end;

Topic		Replies	Views
How to conditionally select values in a dataframe and insert to a new column New to Julia dataframes	6	2931	January 2, 2020
How do I add a new column to a dataframe in Julia using conditional logic? General Usage dataframes , dataframesmeta	5	2195	February 26, 2022
Adding a column to a dataframe and conditionally filling based on another column within the same dataframe New to Julia question	4	212	September 15, 2024
Filter rows after a perticular value in column New to Julia dataframes	5	1121	April 26, 2022
Replace all values greater than 1 in a DataFrame with 1 New to Julia question	10	843	October 1, 2023

Create a new column that has a list of items of a second column, with a condition from a third column

Related topics