JuliaDB Questions/Issues

MaximilianJHuber · July 2, 2019, 3:50pm

If you have a single large file there are two ways to go about what you want:

Read the file into JuliaDB with loadtable in one go and call Dagger.distribute on that table. The only question is whether your memory can take it. If loadtable fails, maybe consider TextParse.jl or CSV.jl which will yield a DataFrame which you can turn into a JuliaDB table.
Split the CSV file first. Write a short piece of code that splits your large file into many small files and use loadtables distributed functionality.

Topic		Replies	Views
ANN: JuliaDB.jl Community	40	9705	November 13, 2018
JuliaDB, tutorial with large datasets and other questions General Usage tutorials	0	830	January 20, 2020
Struggling with Julia and large datasets General Usage question , big-data	67	11064	October 17, 2024
JuliaDB - Saving to CSV New to Julia question	11	5192	June 18, 2019
Performance of `Union{Missing,Float64}` General Usage question	14	1147	May 25, 2021