I this other thread an alternative in Java was tested:
But the conclusion (unexpected according to the author) was that it was slower than any of the Julia alternatives.
In that thread about the ray tracer someone suggests that C++ has a better way to deal with that, but we didn’t see that actually implemented. I think that this is a problem that deserves some attention, at least for a definitive answer and advice be available.