Suggestion: move DataFrames, plotting into standard distribution

ChrisRackauckas · February 19, 2018, 5:43pm

Cross-posting: Multivariate OLS - #15 by ChrisRackauckas

it is a matter of where to draw the line.

and you draw the line at whatever is needed for

finance, economics, and statistics classes

But that’s quite arbitrary. MATLAB and Octave come with pretty terrible statistics support but that’s fine. However, if they pulled out the ODE solvers people would rebel because that’s such core functionality. So why wouldn’t you say it’s a core functionality?

Because what’s core is personal, except for the extreme basics. Julia is trying to make Base be those extreme basics because otherwise everything is core to some segment of the population and you get bloat.

How do you handle it? You could just tell your class about the 5 or so packages to use. Or if it’s just stats, point them to JuliaStats. It really shouldn’t take more than 5 minutes to introduce the stats packages. If you want it pre-installed, as @aaowens suggests, just have them install JuliaPro which was created exactly for this purpose.

But you’re not going to convince anyone that the top 100 packages should go to the Julia Base repository, or even that the top 3 packages you care about should. Julia’s already been there, and what it does is the opposite of what you’re thinking. Packages in Julia Base cannot update regularly because they are tied to Julia releases. They are harder for contributors to jump into since there’s so much other code around them. They are harder to test because they are tested with the rest of Julia. In the end what it does is cause stagnation due to the inertia of larger repositories, while on its own DataFrames.jl is nimble and can release bugfixes almost instantly. Julia had a lot of stuff in Base and is getting leaner for this reason.

Topic		Replies	Views
Please recommend a Julia ecosystem for Statistics New to Julia	28	4507	June 8, 2019
How do DataFrames.jl compare to R's? And Interoperability between R and Julia General Usage	23	6760	January 3, 2018
DataFrames.jl development survey Data question , dataframes	52	3554	September 27, 2020
Things that are easier in Julia than Python/R etc Community python , r	59	7650	October 17, 2021
What's the current (spring 2024) canonical approach to data science in Julia? General Usage dataframes	34	4913	April 8, 2024

Suggestion: move DataFrames, plotting into standard distribution

Related topics