I often need to apply functions or calculate statistics on grouped variables for large datasets (e.g scipy’s binned_statistic). I could not find a good function for this in Julia, hence BinStatistics. BinStatistics includes the binstats
function that provides highly flexible and efficient computation of n-dimensional binned statistic(s) for n-variable(s). All of the heavy lifting is done by DataFrames.jl and CatagoricalArrays.jl.
4 Likes