We have loop vectorization which optimizes a single, highly restricted loop, then we will have loopmodel with less restriction but no multithreading. Then, there is a “future work” for loopvectorization. Naturally, one would beg to wonder to combine all the advantages of the loop libraries. Then, where do we go from here? Beat C/C++ at this and Julia will be the first language to fully optimize loops.
Where is it now?