I just released a new version of Transducers.jl with a new threads_basesize option to dreduce . “Two-level” parallelism should automatically kick in if you set JULIA_NUM_THREADS appropriately in all workers; i.e., each worker process uses multiple threads.