Looking for Some Best Practices for Optimizing Julia Code Performance?

Of course this one: Performance Tips · The Julia Language

For allocations: Common allocation mistakes

For parallel computing, the docs of this package (and of course the package) are very useful: OhMyThreads · OhMyThreads.jl