My Julia code is slower than Python and Matlab

Did you try to add:

using AppleAccelerate

as first line?

On Intel or AMD you could try:

using MKL