With a sorting network, I can sort 4 values using 5 comparisons in three sequential groups.
The first two comparisons are internally independent, so ok to do in parallel. And so for the next two.
Is there a way to let Julia know a local sequence that is not a loop need not be sequential?