Blog post: How to optimise Julia code: A practical guide

Mateusz_K · June 15, 2022, 4:40am

Nice post. Maybe it’s worth also adding advantages of threads over processes for performance. Imagine genomic app loading data and spamming fork() and than passing data between processes which is inter process communication less efficient than between threads. I think when people use fork() they don’t want to deal with race conditions and it’s easier for them, but threads so much lighter and just better for performance.

Oscar_Smith · June 15, 2022, 5:21am

There is some nuance here. Threaded Julia currently can have some problems with GC time, which can lead to multiprocess being easier to get full speed with (for now).

Salmon · June 16, 2022, 7:43am

Thanks, this is an interesting read!
Could someone clarify this part which I’m not sure I understand correctly:

A practical advice is to look through your code and see when you conceptually use a type whose structure is not perfectly captured by any existing types in your code. For example, if you see Vector{Tuple{String, UInt8, DNA}}, in your code, then it probably means you need to refactor the tuple out to a new struct, which you then can optimise.

From a code design perspective this is clear but I wonder what exactly would be the performance gain in defining a new type for the tuple. From what I understand, small tuples like this are often indistinguishable from defined types and often get compiled to the same machine code.

Or is it simply that you can define and dispatch on more efficient methods for this datatype?

jakobnissen · June 16, 2022, 8:13am

That’s exactly right. For the compiler, structs and tuples are the same, and produce the same machine code. But my experience so far has been that for the programmer (i.e. me), creating a new type enables me to be more clear about what data I need to encode, how it can be encoded efficiently, and what kinds of operations I do on this thing.

jjdegruijter · June 26, 2022, 2:54pm

Could you please explain the concept of type stability for me as a newby?

jules · June 26, 2022, 3:16pm

It just means that for a given set of input types to a function, the return type is the same no matter what concrete values the inputs have. And an extended version is that also all intermediate values / results of function calls inside the function have the same types for a given set of input types. As the compiler only compiles knowing input types, not values (except for constant folding) you get inefficient machine code if values of unknown types at compile time have to be handled.

The simplest way to violate type stability is to do something like

if condition
    object_of_type_A
else
    object_of_type_B
end

rafael.guerra · June 26, 2022, 4:02pm

Fyi, this blog post on type stability is a very good read.

jjdegruijter · June 27, 2022, 10:37am

That is helpful. Thank you.

Topic		Replies	Views
Looking for Some Best Practices for Optimizing Julia Code Performance? General Usage	5	444	December 23, 2024
Making efficient some small subroutines Performance	22	1126	May 8, 2020
This month in Julia world - 2023-10 Newsletter	1	1570	November 2, 2023
Type stability on Arrays of Arrays in nested for loops? Performance question , arrays , type-stability	16	831	February 20, 2021
This month in Julia world - 2023-12 Newsletter	0	986	December 24, 2023

Blog post: How to optimise Julia code: A practical guide

Related topics