Floop threading and Juliacall produce segmentation fault

CarlosContrerasQ12 · June 22, 2023, 9:50pm

Hi! I’m trying to speed up some calculations using julia to call from python. I’m trying to use juliacall as follows:

I have a julia threaded function to produce samples

#"file.jl"
using FLoops

function my_func(N)
    samples=Array{Any}(undef,N)
    @floop for i in 1:N
        samples[i]=sum(ones(100000)*i)
    end
    return samples
end

I try to call it from python using

from juliacall import Main as jl
jl.include("/home/carlos/Documentos/Trabajo de grado/Tesis/New code/file.jl")
sa=jl.my_func(10000)
print(sa)

but it produces a segmentation fault error, even if it works with the original julia code in the REPL. Someone knows how to fix it?

lmiq · June 22, 2023, 11:13pm

You have to disable the garbage collection while running the threaded Julia code. Something like this:

def mddf(*args, **kwargs) : 
     jl.GC.enable(False) 
     result = jl.cm.mddf(*args, **kwargs)  # this is multi-threaded in Julia 
     jl.GC.enable(True) 
     return result

jishnub · June 23, 2023, 5:41am

Why is this necessary?

lmiq · June 23, 2023, 7:30am

github.com/JuliaPy/PythonCall.jl

segmentation fault with multi-threading

opened 06:26PM - 09 Sep 22 UTC

lmiq

I have this MWE, where I get segmentation faults (frequently, but not determinis…tically), when trying to run some script that uses multi-threading on the Julia side. I have used before launching `ipython3`: ```bash export JULIA_NUM_THREADS=4 ``` (my computer has 4 cores - 8 threads). The MWE is: ```python Python 3.10.4 (main, Jun 29 2022, 12:14:53) [GCC 11.2.0] Type 'copyright', 'credits' or 'license' for more information IPython 7.31.1 -- An enhanced Interactive Python. Type '?' for help. In [1]: from juliacall import Main as jl In [2]: import numpy as np In [3]: jl.seval(""" ...: function test(x) ...: partial = zeros(Threads.nthreads()) ...: Threads.@threads for i in 1:Threads.nthreads() ...: for j in i:Threads.nthreads():length(x) ...: partial[i] += x[j] ...: end ...: end ...: return sum(partial) ...: end ...: """) Out[3]: test (generic function with 1 method) In [4]: x = np.random.random((10_000,)) In [5]: %timeit jl.test(x) 72.2 µs ± 35.9 µs per loop (mean ± std. dev. of 7 runs, 1 loop each) In [6]: %timeit jl.test(x) Segmentation fault (core dumped) ``` Here I have emulated the error using the `%timeit` macro from `ipython`, but my actual error I get after some runs of a function of my package: ```python In [1]: from juliacall import Main as jl In [2]: jl.seval("using CellListMap") In [3]: import numpy as np In [5]: x = np.random.random((10_000,3)) In [6]: nb = jl.neighborlist(x.transpose(), 0.05) In [7]: nb = jl.neighborlist(x.transpose(), 0.05) In [8]: nb = jl.neighborlist(x.transpose(), 0.05) In [9]: nb = jl.neighborlist(x.transpose(), 0.05) In [10]: nb = jl.neighborlist(x.transpose(), 0.05) In [11]: nb = jl.neighborlist(x.transpose(), 0.05) In [12]: nb = jl.neighborlist(x.transpose(), 0.05) In [13]: nb = jl.neighborlist(x.transpose(), 0.05) In [14]: nb = jl.neighborlist(x.transpose(), 0.05) In [15]: nb = jl.neighborlist(x.transpose(), 0.05) In [16]: nb = jl.neighborlist(x.transpose(), 0.05) In [17]: nb = jl.neighborlist(x.transpose(), 0.05) In [18]: nb = jl.neighborlist(x.transpose(), 0.05) In [19]: nb = jl.neighborlist(x.transpose(), 0.05) Segmentation fault (core dumped) ``` `%timeit` runs the function multiple times, there seems to be some memory corruption, or memory overflow, causing the error. Anyway, even if you have only some hint on how to debug this, I will be very thankful. (even in the simplest example above, the segfaults only occur with multi-threading).

giordano · June 23, 2023, 8:01am

Is this a problem even if you aren’t passing any python object to Julia? Does PyCall & Co have the same limitation? This sounds like a severe limitation in JuliaCall and PythonCall

lmiq · June 23, 2023, 11:02am

It seems that there is something similar, although I didn’t go into the details to see if it is the same thing: thread safety · Issue #882 · JuliaPy/PyCall.jl · GitHub

CarlosContrerasQ12 · June 23, 2023, 11:37am

Surprisingly it works without much effort using PyJulia, without disabling the garbage collector. However, I haven’t tested it works well enough yet, without bugs.

CarlosContrerasQ12 · June 23, 2023, 3:58pm

Sorry, it didn’t work, segmentation fault is still there

lmiq · June 23, 2023, 5:43pm

Uhm… it worked here, but note that your function allocates a lot, so I cannot run it here without having the GC turned on. Thus, I modified it such that it does not allocate as much. In that case, it worked:

In [1]: from juliacall import Main as jl

In [2]: jl.seval("using FLoops")

In [3]: jl.seval("""
   ...: function my_func(N)
   ...:     samples=Array{Float64}(undef,N)
   ...:     @floop for i in 1:N
   ...:         samples[i]=sum(i for i in 1:10^4)
   ...:     end
   ...:     return samples
   ...: end
   ...: """)
Out[3]: my_func (generic function with 1 method)

In [4]: jl.GC.enable(False)
Out[4]: True

In [5]: jl.my_func(10000)
Out[5]: 
10000-element Vector{Float64}:
 5.0005e7
...
 5.0005e7

In [9]: jl.my_func(10000)
Out[9]: 
10000-element Vector{Float64}:
 5.0005e7
...
 5.0005e7

In [10]: jl.my_func(10000)
Out[10]: 
10000-element Vector{Float64}:
 5.0005e7
...
 5.0005e7

In [11]: jl.my_func(10000)
Out[11]: 
10000-element Vector{Float64}:
 5.0005e7
...
 5.0005e7

In [12]: jl.my_func(10000)
Out[12]: 
10000-element Vector{Float64}:
 5.0005e7
...
 5.0005e7

In [13]: %timeit jl.my_func(10000)
43.4 µs ± 2.72 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [14]: jl.GC.enable(True)
Out[14]: False

In [15]: %timeit jl.my_func(10000)
Segmentation fault (core dumped)

I ran the function several times without GC, and didn’t get the segmentation fault, and then with it I get it. The %timeit macro (from ipython3) runs the function several times.

If I run your original function the section gets Killed because of lack of memory, but that’s another, albeit related, issue.

lmiq · June 23, 2023, 5:55pm

That is, this limitation does require that the threaded code is minimally allocating, or one can have memory issues given that GC is off on the Julia side.

Anyway, that is a workaround, it may not work in every case, and I’m not a specialist on it.

vchuravy · June 23, 2023, 10:48pm

Does Python replace/override the fault handlers of Julia?

When multi-threaded GC uses read-protected pages to implement safepoints (e.g. to ask other threads to stop doing work). This causes a benign segmentation fault, but it must be handled by the Julia segmentation handler.

vchuravy · June 23, 2023, 10:49pm

This would do it.

github.com

cjdoris/PythonCall.jl/blob/e374e2503b7b843c457d30596a2428ebfc26bb0b/pysrc/juliacall/init.py#L132


      
          CONFIG['opt_compile'] = choice('compile', ['yes', 'no', 'all', 'min'])[0]
          CONFIG['opt_compiled_modules'] = choice('compiled_modules', ['yes', 'no'])[0]
          CONFIG['opt_depwarn'] = choice('depwarn', ['yes', 'no', 'error'])[0]
          CONFIG['opt_inline'] = choice('inline', ['yes', 'no'])[0]
          CONFIG['opt_min_optlevel'] = choice('min_optlevel', ['0', '1', '2', '3'])[0]
          CONFIG['opt_optimize'] = choice('optimize', ['0', '1', '2', '3'])[0]
          CONFIG['opt_procs'] = int_option('procs', accept_auto=True)[0]
          CONFIG['opt_sysimage'] = sysimg = path_option('sysimage', check_exists=True)[0]
          CONFIG['opt_threads'] = int_option('threads', accept_auto=True)[0]
          CONFIG['opt_warn_overwrite'] = choice('warn_overwrite', ['yes', 'no'])[0]
          CONFIG['opt_handle_signals'] = 'no'
          
          # Stop if we already initialised
          if CONFIG['inited']:
              return
          
          # we don't import this at the top level because it is not required when juliacall is
          # loaded by PythonCall and won't be available
          import juliapkg
          
          # Find the Julia executable and project

CarlosContrerasQ12 · June 26, 2023, 3:16am

Sorry, can you be a bit more explicit? How can I change it? What would be different?/

Topic		Replies	Views
PyJulia multithreading segfaults General Usage question , multithreading , juliacall , pythoncall	1	1049	July 23, 2021
Can JuliaCall easily handle multi-threading? General Usage juliacall , pythoncall	14	823	February 22, 2024
Help with PyCall-related segfault General Usage	4	1350	February 26, 2021
JuliaCall segmentation fault General Usage question	5	957	October 14, 2022
Using PyCall on different threads crashes my Julia REPL General Usage question	3	676	November 10, 2021

Floop threading and Juliacall produce segmentation fault

Related topics