Why is Python, not Julia, still used for most state-of-the-art AI research?

Palli · September 1, 2020, 3:51pm

Hmm, yes, multi-GPU was a blind spot to me, I see it done as far back as 2016, but not specific to ANNs/DL:

Is it for sure dead? This one or at JuliaGPU (both updated recently)? GitHub - vchuravy/NCCL.jl: A Julia wrapper for the NVIDIA Collective Communications Library.

There’s interesting work being done to scale NNs down not just up (as with GPT-3), both for NLP and computer vision. Still, GPT-3 is huge (ALBERTA I mentioned much smaller), so multi-GPU seems needed for sure (at least for good NLP now).

I’m curious, if the network itself doesn’t need to be that big (say fits on one memory), but the problem is the dataset/training, what happens if you spit it 2 or N ways and train independently, can you in general (or say for images only) combine two such trained networks? Isn’t that what people call minibatching? I could see it maybe not working for NLP.

And can you simply use:

Horovod is a distributed deep learning training framework for TensorFlow, Keras, PyTorch, and Apache MXNet

E.g. Julia has by now (for a long time, while it’s not the post popular framework, for Julia or otherwise) official support for MXNet, and as I posted, there’s a PyTourch wrapper, while the Tensorflow one is a bit outdated.

Topic		Replies	Views
Where does Julia provide the biggest benefits over other ML frameworks for research? Machine Learning	34	10625	September 16, 2019
On Machine Learning and Programming Languages Machine Learning	48	8899	January 25, 2018
Is it a good time for a PyTorch developer to move to Julia? If so, Flux? Knet? Machine Learning	52	25302	January 11, 2021
State of machine learning in Julia Machine Learning	60	66258	August 26, 2022
PyTorch and Julia Machine Learning	12	15430	March 27, 2019

Why is Python, not Julia, still used for most state-of-the-art AI research?

Related topics