@spawn at fails when called inside a function in a module

Sijun · April 4, 2022, 11:13am

I have @spawnat expression run inside a function in a module and get the following error:

module Test
using Distributed

function test()
    t = @spawnat :2 1==1
    @show fetch(t)
end

end

julia> Test.test()
ERROR: On worker 2:
UndefVarError: Test not defined

It looks like somehow the remote process needs to know about Test but I don’t understand why. t is just a local variable and it must have no binding to the name Test.

Sukera · April 4, 2022, 12:33pm

It’s because the remote process has to know where to put the result. There’s a module wide sync variable for distributed operations, according to the source of @spawnat and what the macro expands to:

[sukera@tempman ~]$ julia -q -p 2
julia> using Distributed

julia> module Test
       using Distributed

       function test()
           t = @spawnat :2 1==1
           @show fetch(t)
       end

       end
Main.Test

julia> using .Test

julia> @code_lowered Test.test()
CodeInfo(
1 ─       Core.NewvarNode(:(value))
│         Core.NewvarNode(:(t))
│         #1 = %new(Main.Test.:(var"#1#2"))
│   %4  = #1
│         ref = Distributed.spawnat($(QuoteNode(2)), %4)
└──       goto #3 if not false
2 ─       Distributed.put!(Main.Test.:(var"##sync#48"), ref)
3 ┄       t = ref
│   %9  = Main.Test.fetch(t)
│         value = %9
│   %11 = Base.repr(%9)
│         Base.println("fetch(t) = ", %11)
└──       return value
)

Distributed creates a Future, passes that to the external process to place the result into and then assigns that Future to t. ~~I’m not sure if that requires knowing the Test module though, may be a bug.~~

EDIT: thinking about this some more, it does require knowing about the containing module, because there may be more than one module referencing that variable. I guess it could be gensymd, but then how would that be communicated to the remote process?

Sijun · April 4, 2022, 1:54pm

How could there be more than one module referencing that variable t? t is only visible inside Test.test() and the module Test lives only in the calling process.

Sukera · April 4, 2022, 2:53pm

It’s not about t. @spawnat doesn’t even know about it - it’s a seperate, special variable that’s the problem here whose exclusive purpose is facilitating communication between processes. It’s the

that’s the problem, not t. This variable is required to make sure that the remote process knows where to send the result to.

Sijun · April 5, 2022, 12:10am

Thank you for the useful comment! Come to think of it, it seems inevitable for the remote process to know the namespace of where the remote call is made, in order to push a result back to the calling process. And the expanded macro looks in line with the speculation. So, the correct way to call test() would be (indeed it succeeds):

@everywhere module Test ... end

Test.test()

Topic		Replies	Views
Error running distributed code inside of a module General Usage	6	721	January 31, 2021
How do I run a function in a process and afterwards terminate the process? New to Julia question , distributed	0	309	August 9, 2021
Spawning inside a module General Usage question	2	685	April 30, 2021
Why do I have to specify the name of the module when code is included in remote processes? General Usage module , distributed	0	266	July 25, 2022
How to understand code availability of @spawn General Usage question , parallel	1	487	August 19, 2017

@spawn at fails when called inside a function in a module

Related topics