Thread- and process-based parallelisms in Transducers.jl (+ some news)

tkf · December 15, 2019, 2:39am

It’s been a while since I added parallelism supports in Transducers.jl but I’ve never announced this feature properly. I just added a few utility functions and a tutorial so I think it’s good timing to do this.

Quoting Overview of parallel processing in Transducers.jl:

Transducers.jl supports thread-based (reduce) and process-based (dreduce) parallelisms with the same composable API; i.e. transducers. Having a uniform API to cover different parallelisms as well as sequential processing foldl is useful. Using multiple cores or machines for your computation is as easy as replacing foldl with reduce or dreduce; you don’t need to re-write your transducers or reducing functions.

See also:

Parallel processing tutorial in Transducers.jl manual.

API documentation of reduce and dreduce.

Thread-based parallelism

Transducers.jl supports thread-based parallelism for Julia ≥ 1.0. You can use it by replacing foldl with reduce. With Julia ≥ 1.3, Transducers.jl supports early termination to avoid unnecessary computation while guaranteeing the result to be deterministic; i.e., it does not depend on how computation tasks are scheduled.

Process-based parallelism

Transducers.jl supports process-based parallelism using Distributed.jl. You can use it by replacing foldl with dreduce. It can be used for horizontally scaling the computation. It is also useful for using external libraries that are not “thread-safe.”

Note that early termination is not supported in dreduce yet.

Misc news

I managed to upstream a few transducers to Julia Base! It’ll be available in Julia 1.4. For example, it makes sum(y for x in 1:1000 for y in 1:x if y % 2 == 0) ~3x faster. See: Transducer as an optimization: map, filter and flatten by tkf · Pull Request #33526 · JuliaLang/julia
Recent versions of Transducers.jl include withprogress that can be used to monitor the progress of your computation. This is done by emitting ProgressLogging.jl-compatible progress events. It will show progress bars if you use Juno, ConsoleProgressMonitor.jl, or TerminalLoggers.jl. It can be used with thread- and process-based parallel reduce.
Transducers.jl now can be used to create various table types from DataFrames.jl, TypedTables.jl, StructArrays.jl, etc. See copy and its parallel versions tcopy and dcopy.

carstenbauer · December 15, 2019, 6:01am

Great work!

tkf · December 16, 2019, 3:10pm

I forgot to mention this, but Example for the Depth first multithread implementation performance gain as a motivation reminded me that the early termination feature depends on that Julia scheduler being depth-first. The computed result is deterministic and scheduler independent. However, the depth-first scheduling makes it possible to terminate as early as possible by writing the reduction in divide-and-conquer approach. It makes the implementation very straightforward, if not trivial. A big thanks to Julia dev team!

tkf · January 9, 2020, 1:51am

Cross-posting:

ianfiske · January 15, 2020, 7:23pm

Does Transducers.jl support nested threaded parallelism similar to raw @spawn? That is, in the [contrived] example

using Transducers

function f1(x)
    xs = x .+ rand(10000)
    return reduce(+, Map(sin), xs)
end

reduce(+, Map(f1), 1:10000)

Is threaded-parallelism used at both the top-level reduce and also within each f1?

tkf · January 15, 2020, 9:06pm

Transducers.jl is implemented with @spawn in Julia >= 1.3 so it naturally supports nested parallelism as in your example. (But I think it will crash in Julia < 1.3.)

Topic		Replies	Views
Parallel implementation of Transducers.jl General Usage parallel	1	542	February 22, 2021
Writing effective parallel code Performance parallel	8	1709	December 18, 2019
ANN: Transducers.jl 0.3. taking "zeros" seriously, type stability improvements, fusible groupby, OnlineStats, "GPU support", and more Package Announcements	29	3139	December 2, 2020
Some questions on Transducers.jl General Usage	6	973	February 22, 2021
Transducers and reduct-like reduction Performance	3	482	August 24, 2021

Thread- and process-based parallelisms in Transducers.jl (+ some news)

Thread-based parallelism

Process-based parallelism

Misc news

Related topics