PyCall: Weirdish PyArray conversion performance behaviour

davidavdav · January 18, 2018, 12:35pm

Hello,

I have a PyCall-wrapped python module (musdb) that natively gives me PyArray{Float64}, which can be fairly large.

At some stage I need an Array{Float32} of this, but conversion times vary a lot:

@elapsed convert(Array{Float32}, pa) ## 15.8
@elapsed convert(Array{Float32}, convert(Array{Float64}, pa)) ## 3.1
@elapsed convert(Array{Float32}, view(pa, :, :)) ## 0.063

PyArray is a view to the underlying python object, but apparently an explicit view() around it makes it more performant.

—david

stevengj · January 18, 2018, 1:30pm

The PyArray type was implemented a fairly long time ago, before all of the IndexStyle stuff in Base; it could be that the convert routine is somehow using linear indexing with PyArray, which will be slow since it uses ind2sub, rather than the newer CartesianIndex loops that are used by SubArray?

It would be interesting to drill down (with @which or @edit) to find what methods are being called by the convert routine, and what additional IndexStyle (or whatever) methods could be defined for PyArray to make it switch over to the faster path apparently used by SubArray. A PR would be welcome.

davidavdav · January 19, 2018, 2:48pm

OK, thanks, I might have a look into this, but this would require quite a bit of study on my side.

Topic		Replies	Views
How to Efficiently Index PyArrays? Performance question , pycall , python	25	1525	October 16, 2021
Converting Julia arrays, views to NumPy arrays via PyCall General Usage pycall , pythoncall	7	2914	May 19, 2021
[ANN] Announcing NumPyArrays.jl Package Announcements pycall , array , python	3	1020	July 14, 2021
Handling PyArray in Generic Functions: how to use `similar` and `copy` correctly General Usage pythoncall	2	73	August 25, 2024
PyCall returning numpy arrays General Usage pycall	19	280	February 15, 2025

PyCall: Weirdish PyArray conversion performance behaviour

Related topics