ANN: JuliaDB.jl

dfdx · May 8, 2017, 8:00pm

Could somebody please clarify what “distributed datasets” mean in this context? Coming from Hadoop and distributed databases I understand it as a set of files stored on multiple machines with the ability to run particular code or a query locally without copying data over a network. However, in a description of both - JuliaDB.jl and Dagger.jl - I can see only examples of loading data from a local disk and maybe copying it to other machines for processing.

In other words, is JuliaDB.jl more similar to DataTables or to Hadoop?

Topic		Replies	Views
JuliaData BoF @ JuliaCon2023 discussion Data discussion	2	466	August 14, 2023
[ANN] DataFrameDBs.jl Data package , announcement	60	4039	May 2, 2020
[ANN] A new lightning fast package for data manipulation in pure Julia Package Announcements data , dataframes , inmemorydatasets	95	10555	July 4, 2022
Difference between JuliaDB and DataFrames Data	13	1902	June 17, 2021
[ANN] New and Improved JuliaDB Community package , announcement	14	2808	August 7, 2018

ANN: JuliaDB.jl

Related topics