Suggestion: move DataFrames, plotting into standard distribution

piever · February 20, 2018, 2:01pm

This specific feedback is actually very useful.

Concerning verbosity, I’ve translated the same hflights tutorial to JuliaDB here and I find the syntax reasonably concise. Can you pinpoint specific cases that could be improved/specific suggestions on how to do so? As a caveat, you need a very recent version of JuliaDB and IndexedTables to run the tutorial as most syntax improvements are recent.

Concerning dplyr ggplot2 integration via pipeline (I’m not sure how that’d work exactly as I’m not a R user) the closest I can think of is the @df macro from StatPlots which is fully integrated with the Query/IterableTables framework (though I have some idea to simplify the syntax even further when plotting from a @map statement, but I haven’t quite decided how, I’m curious what @davidanthoff has in mind). There is also GroupedErrors to make plots from data tables if you’re working with grouped data.

See this announcement, though I haven’t focused on Query integration (as ShiftedArrays and Query have different missing data representation): will think about it once there is convergence.

I’d be curious to see how to add rownumber: what does it do exactly? You can use it inside a groupby and it will give you the row numbers of the group as computed inside the larger dataset?

Topic		Replies	Views
Please recommend a Julia ecosystem for Statistics New to Julia	28	4243	June 8, 2019
How do DataFrames.jl compare to R's? And Interoperability between R and Julia General Usage	23	6504	January 3, 2018
DataFrames.jl development survey Data question , dataframes	52	2943	September 27, 2020
Things that are easier in Julia than Python/R etc Community python , r	60	6999	October 17, 2021
What's the current (spring 2024) canonical approach to data science in Julia? General Usage dataframes	34	4168	April 8, 2024

Suggestion: move DataFrames, plotting into standard distribution

Related topics