Request for feedback on potential CSV.jl feature

piever · June 7, 2021, 8:39am

Mainly because this has not been mentioned here, I was wondering how much of what is proposed is already possible with a reasonably simple API using CSV + shashi/FileTrees.jl: Parallel file processing made easy (github.com), see also the package announcement.

This probably does not address the parsing / promotion issues, but I thought it could be a relevant reference (it should definitely handle reading multiple files concurrently).

Another useful reference for the API could be JuliaDB.loadtable, which also allows to ingest multiple files at once. It also supports adding a separate column that is populated with the name of each file (or a function thereof). This can actually be pretty useful if some relevant information is encoded in the file name and not in the csv itself.

Topic		Replies	Views
[ANN] New CSV.jl 0.5 Release Package Announcements data , csv	18	5078	October 20, 2019
[ANN] CSV.jl 0.7 Release Data	38	5328	July 18, 2020
ANN: uCSV.jl Data	4	1206	October 3, 2017
[ANN] TableReader.jl - A fast and simple CSV parser Package Announcements package , announcement , data , csv	24	5875	March 28, 2019
What's the difference between CSV.jl and CSVFiles.jl? New to Julia	25	8101	January 29, 2020

Request for feedback on potential CSV.jl feature

Related topics