We hadn’t made an announcement until now, but a new meta-package called StatsKit has been registered some time ago. This is just a convenience package which automatically installs and loads all the most common packages needed to work with statistics.
using StatsKit currently loads the Statistics standard library module, and the following packages:
This package is intended for users of statistics packages. Packages themselves should continue to list individual packages in they dependencies rather than StatsKit as a whole.
Just one question that came to my mind after reading the package list: Is there any specific reason, why there’s CSV.jl included instead of CSVFiles.jl?
Great work on StatsKit. Actually it would be very nice to have a blog for newcomers describing how to combine it with PackageCompiler to build them in into a system image - hopefully I will have some time in the future to write it up unless someone else does (@sdanisch made some great efforts with this recently). This would be a real killer, as it would start instantly - all past-R users would love it.
Yes, that’s because the design of CSV.jl is more powerful for the long term since it supports loading data on the fly via the Tables.jl streaming API. On the contrary, CSVFiles/TextParse loads the full dataset as vectors (even if it supports streaming from that).
There’s no common documentation at this point. Unfortunately I’m not sure it’s possible to automatically merge all Documenter manuals from separate packages into a single reference. Of course we can still write general tutorials, but they won’t be specific to StatsKit.