MPLAPACK (multiprecision blas+lapack)

JeffreySarnoff · October 27, 2021, 8:33pm

MPLAPACK works with any of these multiprecision libs: GMP, MPFR+MPC, libquadmath, and David Bailey’s DD/QD library.
Here is the user guide and the repository,

All BLAS routines are available using multiple precision arithmetic.
LAPACK functions are available with multiple precision Real arithmetic. (Complex support is planned for v2):

Linear Equations
Linear Least Squares (LLS) Problems
Generalized Linear Least Squares (LSE and GLM)
Standard Eigenvalue and Singular Value Problems
Symmetric Eigenproblems (SEP)
Nonsymmetric Eigenproblems (NEP)
Singular Value Decomposition (SVD)
Generalized Eigenvalue and Singular Value Problems
Generalized Symmetric Definite Eigenproblems (GSEP)
Generalized Nonsymmetric Eigenproblems (GNEP)
Generalized Singular Value Decomposition (GSVD)

Ver 1.0 (2021-Sep-28) is released under the "2-clause" BSD license.

Oscar_Smith · October 27, 2021, 9:43pm

Would be cool to get that in Julia. I bet we could be faster and 1/10th loc.

stevengj · October 27, 2021, 10:15pm

Don’t we have most of it in Julia already, e.g. with GenericLinearAlgebra.jl and GenericSchur.jl?

JeffreySarnoff · October 27, 2021, 10:18pm

That is a question! I have not been using BLAS or LAPACK funcs, dunno.

stevengj · October 27, 2021, 10:19pm

I’m not talking about the BLAS/LAPACK API (which makes little sense to re-implement in Julia), but rather the functionality.

Oscar_Smith · October 27, 2021, 10:25pm

GenericLinearAlgebra doesn’t do any of the blocking or repacking that is necessary to achieve high performance (even for slower datatypes). An ideal version would be able to do all performance tricks that still matter while keeping the genericness.

Also, it’s probably worth implementing sub-cubic algorithms since they do better when multiplication is more costly than addition which is true for most of the more complex number types.

stevengj · October 27, 2021, 10:31pm

Blocking/repacking/re-ordering only matters for problems that are memory-bound (for simpler implementations). With arbitrary-precision arithmetic, most of these algorithms should be compute-bound, in which case you might as well use LINPACK/EISPACK-style “textbook” triple-loop algorithms.

Oscar_Smith · October 27, 2021, 10:56pm

With arbitrary precision, you are probably right, but I bet that for something like DoubleDouble the packing still matters. Also, as the blocking becomes less important, the sub-cubic methods become more important, so fanciness of some kind is still necessary for good performance.

Fredrik_Johansson · October 28, 2021, 8:32pm

There are order-of-magnitude speedups to be found by doing arbitrary-precision linear algebra using atomic dot products (small sizes) and block matrix multiplication (large sizes), converting FP matrix multiplication to exact matrix multiplication over Z where a multimodular + Strassen approach can be used. See my paper [1901.04289] Faster arbitrary-precision dot product and matrix multiplication and the blog post High-precision linear algebra in Julia: BigFloat vs Arb

Topic		Replies	Views
Native Julia gemm implementation Performance	16	3638	May 3, 2018
Performance gotcha in linear algebra lu() General Usage performance , linearalgebra	33	3722	February 11, 2020
File organization in Base, and modularity Internals & Design	3	1116	February 14, 2017
Exact linear algebra Numerics linearalgebra	10	2451	October 8, 2017
Linear solver \(A, B) performance vs Matlab A\b General Usage	32	7819	May 21, 2017

MPLAPACK (multiprecision blas+lapack)

Related topics