What's the difference between CSV.jl and CSVFiles.jl?

davidanthoff · November 24, 2018, 9:15pm

They are two independent CSV readers. You should use whichever suits your needs better

I maintain CSVFilesjl, so I can probably better speak to that one. Here are some things I like about it:

It is part of the larger Queryverse.jl file IO story, which gives you a nice uniform API not just for CSV files, but also ExcelFiles.jl, FeatherFiles.jl, StatFiles.jl (Stata, SPSS, and SAS files), and ParquetFiles.jl. All of that is documented here.
It works with any source or sink that implements the TableTraits.jl interface. When I last counted that was something like 21 packages.
It uses TextParse.jl under the hood, which is fast. And getting faster, there are a bunch of things on master and in branches that are not yet reflected in the benchmarks I posted there.
It is a mature package, by julia standards. Its been around for about 1.5 years. It gets a continues stream of improvements, but the basic structure has been settled and battle tested since the beginning. It has been a very stable story, and I expect that to stay that way going forward, i.e. I generally try very hard to not break things, and the track record so far has been pretty good, I think
You can read gz compressed files out of the box. I’m just mentioning this here, because in a hilarious twist the package has supported that for a long time, but I only found out a few weeks ago See here.

Topic		Replies	Views
[ANN] New CSV.jl 0.5 Release Package Announcements data , csv	18	5079	October 20, 2019
[ANN] Fread.jl - read CSVs faster with the help of R's {data.table} Package Announcements performance , data , csv	6	2053	October 9, 2019
CSV parsing performance Data	4	1404	July 4, 2017
CSV vs DelimitedFiles vs Numpy Performance	15	969	January 20, 2024
[ANN] TableReader.jl - A fast and simple CSV parser Package Announcements package , announcement , data , csv	24	5878	March 28, 2019