Bad performance of group_by of DataFrames - updated -

Well, I construct a dataframe with colums of type String and type Float64. Than I do a group_by operation on the three string colums, aggregating the Float64 numbers with either sum or maximum.

That’s it.

The tricky part is that the columns selected for the group_by operation is random. And this is sometimes causing a recompilation.