ANN: MPI.jl v0.10.0: new build process and CUDA-aware support

simonbyrne · August 16, 2019, 4:18am

I have just tagged a new version of MPI.jl. Though the user-facing interface is largely the same, there has been extensive work underneath to internally use the C API (instead of the Fortran one). As a result, the build process is much simpler (it no longer requires CMake or a Fortran compiler). Additionally, it also directly supports CUDA-aware MPI libraries, allowing CuArrays to be passed directly as buffers (thanks to Seyoon Ko).

I would greatly appreciate if people are able to try it out, especially with different clusters and MPI implementations.

johnh · August 16, 2019, 5:06am

Is it worth sharing this on the OpenMPI mailing list? I would guess you work closely with those guys anyway.

johnh · August 16, 2019, 5:09am

I just noticed this is in Julia at Scale!
Picture me doing a happy dance. Or maybe my Miata doing doughnuts (see avatar image).

kose-y · September 3, 2019, 3:20am

Thank you @simonbyrne! Here are some code examples using the new version of MPI.jl.

https://github.com/kose-y/Julia-MPI-CuArray

mcreel · September 4, 2019, 12:46am

Basic send Send, Recv! works fine on an ordinary Linux cluster with OpenMPI. Thanks!

rveltz · September 9, 2019, 4:35pm

Nice! Is there a way to partition a big CuArray into p parts like the slab decomposition?

kose-y · September 10, 2019, 12:10am

I’m working on it based on barche/MPIArrays.jl. I think it can be released some time this year.

rveltz · September 10, 2019, 5:55am

You may be interested in the python lib.

kose-y · September 10, 2019, 7:36am

@rveltz This looks interesting. Thanks for the reference!

johnh · September 13, 2019, 10:11am

I have access to a DGX-1 GPU system. I’m not sure how much time I can get on it.
If there are any tests which could be run for this package I could give it a try.

simonbyrne · September 13, 2019, 4:11pm

Do you know if it has a CUDA-aware MPI? If so, would be good to run the test suite with JULIA_PROJECT set to [pkgdir]/test/cudaenv.

samo · December 3, 2019, 5:53pm

@kose-y and @simonbyrne, a big thanks for making CUDA-aware MPI available to the Julia community! Unfortunately, when I tried CUDA-aware MPI with MPI.jl on two different systems, it failed in both cases. Could you have a look at this post where I reported the errors? It would be fantastic if I got it to work before the AGU conference this weekend…

rveltz · May 31, 2020, 8:06pm

Hi,

Is there any news about Distributed Arrays with MPI, or mixing CuArrays with MPI?

simonbyrne · June 1, 2020, 3:25pm

You will need to build and link against a CUDA-aware MPI implementation, but other than that, CuArrays should work with MPI.jl:
https://juliaparallel.github.io/MPI.jl/stable/usage/#CUDA-aware-MPI-support-1

There is a proof-of-concept package of distributed arrays:

but other than that, no. It really depends on what sort of functionality you will want, e.g. we’ve built our own to provide support for ghost elements (https://github.com/CliMA/ClimateMachine.jl/blob/master/src/Arrays/MPIStateArrays.jl), but it makes a lot of assumptions about data layout, etc.

rveltz · June 2, 2020, 8:31am

I see. Ideally, I want to perform ffts on multiGPU…

simonbyrne · June 2, 2020, 6:34pm

There is some discussion here: PencilFFTs on GPUs? · Issue #3 · jipolanco/PencilFFTs.jl · GitHub

rveltz · June 2, 2020, 7:11pm

yes! That is exactly what I am looking for

jipolanco · June 2, 2020, 7:35pm

As mentioned in the issue, it would be great to add GPU support to PencilFFTs. The MPI-heavy part of the code was recently refactored, and hopefully, extending things to work with CuArrays should not take too much work.

I personally don’t have any experience with CUDA-aware MPI, and I don’t have access to multi-GPU systems (that I’m aware of), so any help with this is most welcome!

kose-y · October 28, 2020, 1:34pm

I have just released it here.

https://github.com/kose-y/DistStat.jl

rveltz · October 28, 2020, 4:59pm

This is really cool stuff. I am wondering if you should not rename it MPIArraysV2.jl

It would be a shame to have this work “duplicated”. I see there is already PencilArrays as well.

Topic		Replies	Views
Question about CUDA-aware MPI GPU	1	872	April 17, 2020
CUDA aware MPI fails but runs on multiple GPUs Julia at Scale	5	809	July 21, 2021
Error/segfault in basic test of CUDA-aware MPI Julia at Scale question	10	1449	November 6, 2020
CUDA aware MPI works on system but not for Julia Julia at Scale parallel , mpi	30	3141	January 24, 2022
Arrays, MPI, and broadcasting Julia at Scale question	5	925	August 17, 2020

ANN: MPI.jl v0.10.0: new build process and CUDA-aware support

Related topics