Using Tasks 101

Satvik · May 28, 2024, 9:31pm

I’m not sure if this is the right place to post this, but I spent a lot of time learning about Tasks recently for a project at work, and wrote up a blog post tutorial summarizing how to use them: https://www.lesswrong.com/posts/kPnjPfp2ZMMYfErLJ/julia-tasks-101

Feedback and corrections appreciated!

Sevi · May 31, 2024, 7:23am

I really appreciate this post! Especially as someone who needed several attempts to wrap my head around both the concepts and the nomenclature of tasks/threads/coroutines/… (in general, not particularly in Julia).

Given that the reader already knows about the existence of hardware/OS threads (which I think is a given if one is looking for this kind of information), it’s straight to the point and very accessible

Gast · July 21, 2024, 5:31pm

I liked your post, very useful! Thanks for sharing!

With thread-safe you basically mean avoiding data race between threads, right? Or something else?

Have you ever had the need of using Atomic operations to avoid data race conditions? In the Julia documentation atomic operations can become handy in these cases

nsajko · July 21, 2024, 5:44pm

Atomic operations are a very low-level thing, basically reserved for concurrency experts. In the majority of cases one should instead use higher level synchronization concepts.

Gast · July 21, 2024, 6:14pm

Got it
And what would be examples of higher level synchronization concepts?

Satvik · July 21, 2024, 6:41pm

With thread-safe you basically mean avoiding data race between threads, right? Or something else?

I mean a slightly more general definition of thread-safety. If you try to write to a Dict from multiple threads, you won’t get races, you’ll get a segfault. From Wikipedia: " a function is thread-safe when it can be invoked or accessed concurrently by multiple threads without causing unexpected behavior, race conditions, or data corruption."

Have you ever had the need of using Atomic operations to avoid data race conditions?

I haven’t used Atomic – Channels are a higher-level concept, that along with Tasks are (theoretically) general enough express all concurrent computations.

Atomics seem more like they’re oriented towards language developers, or developers of particularly high performance libraries – they only work on primitive times, only support a limited range of operations, etc.

Satvik · July 21, 2024, 6:59pm

And what would be examples of higher level synchronization concepts?

Channel, Threads.@spawn, and @sync are the main ones built in to Julia.

For higher-level options you can look at OhMyThreads.jl or ThreadsX.jl, both of which are under the JuliaFolds2 organization: JuliaFolds2 · GitHub

sgaure · July 21, 2024, 9:05pm

While Channels are great for serializing access to shared resources like a common Dict, you could add a chapter about locks. It is somewhere between atomics and channels, and fairly easy to understand. What to use depends on how much overhead one can afford. Atomics isn’t very complicated either, except the fine points of memory ordering semantics.

nsajko · July 21, 2024, 9:22pm

Atomics are at the lowest level. Above them are locks and semaphores, above those are condition variables (currently not available in Julia, AFAIK) and Channel.

See the Concurrency part of Operating Systems: Three Easy Pieces for a general traditional overview. For semaphores in particular, see The Little Book of Semaphores by AB Downey.

There was also this fun game, hope the Web site is still functional:

greatpet · July 21, 2024, 9:28pm

Your blog post says @async is deprecated? Julia’s official manual on Network and Streams uses @async in some examples. Should the examples be updated with @spawn as a drop-in replacement?

matthias314 · July 21, 2024, 10:25pm

In fact, @async appears 45 times in the Julia 1.10.4 manual and 48 times in the one for 1.11.0-rc1. Quite a bit for a deprecated macro!

Satvik · July 21, 2024, 11:46pm

While Channel s are great for serializing access to shared resources like a common Dict , you could add a chapter about locks.

A post about this would be interesting for sure, but I’ve had a lot of trouble trying to write correct code with locks. E.g. we left code like this in our codebase for months, which was causing deadlocks in notebooks about once a week:

function setindex!(tsd::ThreadSafeDict, k, v)
    lock(tsd.slock)
    h = setindex!(tsd.d, k, v)
    unlock(tsd.slock)
    return h
end

In contrast when I wrote a Channel version of a thread-safe Dict it just worked.

Your blog post says @async is deprecated? Julia’s official manual on Network and Streams uses @async in some examples. Should the examples be updated with @spawn as a drop-in replacement?

Probably? When I asked about it, I was told “for new code there is no reason to use @async”.

era127 · July 22, 2024, 2:18am

It might be worth mentioning ThreadSafeDicts.jl as well.

carstenbauer · July 22, 2024, 4:21am

Fwiw: GitHub - carstenbauer/LittleBookOfSemaphores.jl: Julia code snippets inspired by the Little Book Of Semaphores

nsajko · July 29, 2024, 7:17am

github.com/JuliaLang/julia

Documentation inconsistency concerning @async

opened 06:27PM - 29 Jun 23 UTC

algunion

domain:docs

The `@async` macro documentation includes the following [warning](https://github….com/JuliaLang/julia/blame/f6f35533f237d55e881276428bef2f091f9cae5b/base/task.jl#L495): > It is strongly encouraged to favor `Threads.@spawn` over `@async` always even when no parallelism is required especially in publicly distributed libraries. This is because a use of `@async` disables the migration of the parent task across worker threads in the current implementation of Julia. Thus, seemingly innocent use of `@async` in a library function can have a large impact on the performance of very different parts of user applications. However, the asynchronous programming piece of the manual presents `@async` as a legitimate way to do things - and `@async` is even used in the how-to related to Channels. Take a look [here](https://github.com/JuliaLang/julia/blame/f6f35533f237d55e881276428bef2f091f9cae5b/doc/src/manual/asynchronous-programming.md#L67C36-L67C36) and in general to the asynchronous programming section of the manual. From the timestamps, it seems that the asynchronous programming section of the manual might be outdated - and is basically suggesting practices that are labeled as to be avoided in other parts of the documentation. I also opened a topic on [discourse](https://discourse.julialang.org/t/inconsistency-between-async-warning-and-its-usage-in-documentation/100978?u=algunion), but it didn't grab much attention. Is the `@async/@swap` warning still standing and the manual is to be updated? Or the other way around?

github.com/JuliaLang/julia

Deprecate `Base.@async`

opened 08:31PM - 09 Nov 23 UTC

kpamnany

kind:julep domain:multithreading kind:deprecation

Julia does need the ability to pin a task to a thread (in our current terminology: make it sticky). This currently happens implicitly for tasks created with `Base.@async` to [support its semantics](https://github.com/JuliaLang/julia/issues/41324). To support those semantics, the parent task must also be [made sticky](https://github.com/JuliaLang/julia/blob/f31cd8ad2fecefa2717aa0f5462876c985acbdea/base/task.jl#L794). More precisely, if a newly created task `S` is sticky (which they are when created with `Task()` or `@task`), then whichever task calls `schedule(S)` will also be made sticky. A task that calls the `AsyncCondition(callback)` constructor (which spawns an `@async` task) will become sticky, as will a task that calls `Timer(callback)`. This sort of stickiness-infection ignores threadpool boundaries -- a task created on the default threadpool can end up being stuck to an interactive thread. We can fix some of these (as we have [previously](https://github.com/JuliaLang/julia/pull/49601)), but we cannot fix them all, due to `@async` semantics. Furthermore, such stickiness-infected tasks are [never](https://github.com/JuliaLang/julia/blob/f31cd8ad2fecefa2717aa0f5462876c985acbdea/base/task.jl#L785) [unstuck](https://github.com/JuliaLang/julia/blob/f31cd8ad2fecefa2717aa0f5462876c985acbdea/base/task.jl#L331). There is [this PR](https://github.com/JuliaLang/julia/pull/41393) which attempts to unstick such tasks, but adds too much overhead. We should introduce an explicit task pin/unpin interface (see https://github.com/JuliaLang/julia/issues/52108) and deprecate `Base.@async` so that all this implicit stickiness and associated cruft can be removed from the runtime.

Satvik · July 30, 2024, 7:49pm

Ok, I’ve attempted a PR to update the documentation, let’s see how this goes: Replace `@async` mentions in manual with `Threads.@spawn` by Satvik · Pull Request #55315 · JuliaLang/julia · GitHub

Topic		Replies	Views
Relation of coroutines, threads, and Tasks General Usage question , parallel , multithreading , task	4	1056	October 1, 2022
Is `@async` memory safe? General Usage multithreading	13	1681	August 19, 2021
Question regarding Julia's planned approach to composable, safe, and easy multithreading Internals & Design multithreading	14	3770	December 23, 2019
Changing and accessing data in different threads General Usage question	3	298	November 22, 2021
Basic examples of Tasks/Channels? (or more verbose documentation?) General Usage	10	4696	November 2, 2017

Using Tasks 101

Related topics