If tuplex can do it. So can Julia!

dfdx · July 20, 2021, 7:26pm

I think the question is not how to implement it, but who actually needs it. Some time ago I asked here what people need from a distributed computation framework and got silence in response. Spark evaluated into just a more flexible SQL database. Distributed machine learning is mostly concerned with multi-GPU training and has its own frameworks. UDFs are rare and usually it’s easier to just implement them in the native language for a framework (e.g. in Java/Scala for Spark).

Topic		Replies	Views
What do you need from a distributed computation framework? Julia at Scale	1	672	July 20, 2021
JuliaData BoF @ JuliaCon2023 discussion Data discussion	2	466	August 14, 2023
HPC / Julia, MPI / big data Julia at Scale	15	1481	October 13, 2020
Can Julia efficiently make use of 20+ cores for transforming hundreds of millions of rows for machine learning? Machine Learning question , big-data	27	2984	December 1, 2020
Expected Performance of Julia within a Spark Environment? General Usage performance , spark	7	545	December 7, 2022

If tuplex can do it. So can Julia!

Related topics