A serious data start-up structured around a Julia data manipulation framework for larger-than-RAM data

CameronBieganek · September 15, 2024, 3:30pm

Yeah, I think a Spark replacement probably has more commercial opportunity than a Pandas/Polars replacement. As you mentioned, adding in distributed ML algorithms can help. Spark has libraries for both distributed ML and distributed graph algorithms.

That being said, I want the library to be free and open-source, so I’m not sure exactly how commercialization would work. Some kind of cloud-computing services? Integration with JuliaHub?

Topic		Replies	Views
What's the latest and greatest in data in Julia Data	29	2379	August 15, 2024
Future directions for DataFrames.jl Data package , dataframes	47	6791	June 3, 2022
Struggling with Julia and large datasets General Usage question , big-data	67	11543	October 17, 2024
Direct interface to Polars Rust library Data question	13	1844	November 9, 2023
How is the data ecosystem right now for large datasets? Data	35	6922	July 13, 2017

A serious data start-up structured around a Julia data manipulation framework for larger-than-RAM data

Related topics