Also string interning incurs a cost when creating strings which may or may not be worth it depending on the number of times you perform grouping/sorting on the data. To take this into account one would need to include the time needed to create strings from raw bytes, or to read a text file.
In the previous thread you showed a graph where Julia was as fast as R. What’s the difference with the new graph? Is it just sort
vs. sortperm
?