How to parallelize within a function

tkf · October 2, 2021, 3:18am

Uh… I don’t think this is true yet actually. Some things (e.g., Array) work great but many things are hard to reason about the safety (e.g., sparse matrix). Importantly, there’s no documentation on what is safe when.

Sorry to nitpick, but I’d be careful about such a claim. For some definitions of “true threads”, one can argue Python has “more true” threads than Julia. For example, Python has a more transparent OS thread API than Julia; Julia only has tasks (and that’s kinda the point). But unfortunately to Python programmers, it was designed in 90s where threads for parallelism were not a thing (at least not for everyone) and so it’s useful mainly for I/O or GIL-releasing external code. On the other hand, Julia has “more true” threads than Python in another sense if one defines “threads” as a synonym of shared-memory parallelism.

If you have a rough idea of the applicability of process-based parallelism in the problem you have, I think starting from your comfort zone sounds like a good idea.

Going back to the problem in the OP, it’s not a good idea to use @everywhere inside a function. It’s mainly for “static things” like using Package and include("script.jl"). You’d probably want to use remotecall here (and maybe iterate over the worker ids returned from workers()).

Also, please quote your code: Please read: make it easier to help you

Topic		Replies	Views
Adding data to worker processes via @everywhere General Usage	12	1200	January 11, 2019
Using addprocs and pmap inside a function General Usage question	0	296	November 17, 2020
@everywhere and pmap inside of a function? Julia at Scale question , parallel	14	3824	August 17, 2017
Documentation for parallel computing - clarification Performance documentation	3	670	October 22, 2019
Baffling addprocs() with @everywhere Julia at Scale	17	3443	April 24, 2018

How to parallelize within a function

Related topics