I’m pleased to announce a new release of the CSV.jl package. This release provides notable improvements in several areas, including performance, additional features, and enhanced flexibility. Notable improvements: “Perfect” column typing: gone are the days of rows_for_type_detect and parsing get…

Great set of examples in the docs!

[image] quinnj: Improved performance: great care has been taken to improve performance on several levels; underlying type parsers (provided by Parsers.jl ), better data locality and cache friendliness, and greater use of custom Julia structures for efficiency It would be very useful if you (o…

Great and thank you - this is way faster! Immediately noticed the following parse behaviour change: 428.E+03 is default parsed as String now, used to be as Float64 with previous version of CSV.jl. If I try to force with types=Dict(Symbol(" S-Mises")=>Float64)) for the column in question, I…

The key trade-off is if you want to work with DataFrames.jl later if you want to materialize a DataFrame using DataFrame or DataFrame! - or equivalently if you use copycols=true vs copycols=false keyword argument in CSV.read (I am not trying to give a comprehensive answer to what @Tamas_Papp wants a…

Thanks for reporting! Would you mind opening an issue on the CSV.jl or Parsers.jl repo? This must have regressed with all the new work that’s gone in, should be a simple fix.

Issue filed in CSV.jl repo. Cheers, GC

Question: would it be possible to get this to infer? file = CSV.File("file.csv") test(it) = first(it).first_column @code_warntype test(file)

If the column type is determined from the file contents, no. Use a function barrier .

But isn’t the type of each column is already in file?

CSV.jl defines CSV.getcell(f::CSV.File, T, col, row) which would be inferrable for individual values. It also doesn’t require iteration. You can get the types for a file by doing CSV.gettypes(f).

[ANN] New CSV.jl 0.5 Release

Package Announcements

simeonschaub May 22, 2019, 5:46pm 16

Whoa, that went quick! Thank you, keep up the great work!

Reading Data Is Still Too Slow

Topic		Replies	Views
[ANN] CSV.jl 0.7 Release Data	38	5719	July 18, 2020
CSV.jl type stability General Usage csv , type-stability	26	1264	October 22, 2022
CSV read performance vs Pandas General Usage	29	8557	May 6, 2019
CSV Reading (rewrite in C?) Internals & Design	50	5632	October 1, 2018
CSV.read extremely slow wrt readtable Data	14	3794	July 27, 2018

[ANN] New CSV.jl 0.5 Release

Related topics