Distributed, AllReduce, and Distributed Training

I’m currently trying to implement an equivalent to the MPI_Reduce operation in Julia. You may want to check out this similar thread. How to sum the chunks of a distributed array using a binary reduction tree?.

I’m not sure either if I used the correct section to post it…