Allocations on multi threaded matrix operations (caused by BigFloats)

DavidBerghaus · July 28, 2018, 12:42pm

Yes, that is true.
It doesn’t make a big difference in this case for me tho.
I measured the times and calling columnwise seems to be only 0.8% faster.

JeffreySarnoff · July 28, 2018, 2:42pm

No.

DavidBerghaus · August 3, 2018, 11:17am

To give you a quick update:

I tried to make my algorithm work by only using Arbs, to avoid getting new allocations by the BigFloats.

Unfortunately, this does not work because Arbs seem to not be made for this kind of usage.
They lose numerically stability during the matrix operations and get unprecise results (unless one puts the precision much higher than for BigFloats, but this makes the performance even worse than the BigFloat-allocations).

I would like to know if someone knows another way how I could make this type of algorithm work.

Maybe there is another multiprecision floating type or a way to preallocate the BigFloat calculations?

JeffreySarnoff · August 10, 2018, 6:13pm

Thank you for the update. Your experience is a matter of record, and I will refer back to this after finishing work on the functional part of ArbNumerics and before considering whether there may be aspects of the underlying library which may ameliorate some of this. Please recap (a) the dimensions of the matrices which are of interest, (b) the bit precision of numeric entries as they are given when creating the matrices, (c) the bit precision which you require of numeric entries at the conclusion of all processing, (d) the specific matrix-valued functions, transforms, factorizations, and any other computational work that you apply with the appropos order of operations you adopt. I do not suggest there may be help that is available within the Arb C library, nor that if there were, it would be available to Julia in a manner that I utilize – it is not entirely impossible.

JeffreySarnoff · August 10, 2018, 6:25pm

The only way I know to preallocate BigFloats is do that in C [and by ‘know’ read “assume could be done”]. There is an arbitrary precision (modern) FORTRAN library mpfun2015, if you want to port that.

DavidBerghaus · August 18, 2018, 4:04pm

In case there is a misunderstanding:
I could only test with Nemo-Arbs since you said that there is no way to transform them to your arbs and the matrix is spawned through Nemo bessel-functions.
I dont know if your arbs are having the same kind of issues or if they behave different to the ones from Nemo.

Regarding your questions:

a) larger than 1000
b) 1.2x the matrix dimension (precision in digits, not bits!)
c) that is hard to tell; I need to find the value where the determinant reaches its minimum. The issue is that the determinants get really small (< 1e-100000) so I often only get an error ball with arbs.
d) I am computing the determinant of a large, dense and ill conditioned matrix through the parallel LU-factorization algorithm by Beliakov & Matiyasevich

If you are interested in more specific details, you can read my conversation in the Nemo-Forum:
https://groups.google.com/forum/?hl=en#!topic/nemo-devel/55Baht7u_yE

Thank you for your efforts!

Topic		Replies	Views
Large allocations using @Threads.threads in a loop leads to slow down New to Julia multithreading	17	986	August 19, 2023
Increased allocations when using threads Performance question	20	253	July 11, 2024
Allocations in @threads loop General Usage multithreading	3	956	May 20, 2017
Understanding Allocations in multithreaded code Performance multithreading	4	1271	June 15, 2020
--track-allocation and @threads General Usage question , multithreading	10	756	August 1, 2022

Allocations on multi threaded matrix operations (caused by BigFloats)

Related topics