Computing eigenvalues/eigenvectors using GPU?

Noel_Araujo · July 15, 2018, 7:17pm

Hi everyone

I have a Vega RX 64 and I would like to speed up eigenvalues decomposition on it.
Does anybody know a (simple) solution with Julia ?

Ralph_Smith · July 15, 2018, 10:37pm

Short answer: AFAICT there isn’t even a not-simple solution.

To use a GPU effectively for dense eigensystem computations is quite difficult. The functionality was only recently added to the CUDA libraries (useless to you) and is conspicuously absent from the prominent projects using OpenCL (e.g. ArrayFire). There was some work in the clMagma project, but that seems to have been abandoned well before recent upgrades to the AMD software stack.

You say “decomposition”, so I assume you mean for dense matrices. For a few eigenvalues/vectors of sparse matrices there is more hope, but it’s still not simple.

Elrod · July 16, 2018, 5:28am

(Look at the develop-upstream branch.)

They have unit tests for HIP, but I’m not sure what functionality is currently supported. The last PRs were very recent. HIP is cross platform, compiling through hcc for AMD, and CUDA for NVidia. There’re also pure CUDA unit tests in eigen.

So it may be a matter of creating a few shared libraries and ccalling them.

Gregory Stoner of the ROCm project said months ago that there was some work underway for a ROCm-native in Julia, like CUDAnative. Not sure if there’s any progress on that front.

EDIT: HIP/ROCm only work on Linux. I think any distribution is supposed to work starting with Kernel 4.18, but at the moment only Ubuntu 16.04, and RHEL/CentOS 7.4 & 7.5 are officially supported.
This probably doesn’t qualify as “simple”.

Also, want to point out that HIP/ROCm is pretty good. Their BLAS libraries were almost twice as fast on Vega as CLBLAS when I benchmarked them about five months ago.

Noel_Araujo · July 18, 2018, 1:26pm

I’m trying to use ArrayFire.jl, but when I try some linear algebra operations I get error with MKL

using ArrayFire;
a = rand(10, 10);
ad = AFArray(a);
det(ad)

Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so.

Simple operations with variable ad like sin(ad) or maximum(ad) do work. However, operations like det, qr or svd don’t.

I’ve run the mkl script to fix PATH before running Julia

source /opt/intel/mkl/bin/mklvars.sh intel64

And also, I followed the comments on web [1]:

conda install nomkl numpy scipy scikit-learn numexpr
conda remove mkl mkl-service

But none of this commands solved my problem.
I’m open to suggestions

For reference:
[1] python - Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so - Stack Overflow

Elrod · July 18, 2018, 1:56pm

Conda is a Python package manager.

The following shared libraries (and versions) come with (a recent) ArrayFire:

libafcpu.so           libaf.so            libforge.so.1.1.0      libmkl_mc3.so
libafcpu.so.3         libaf.so.3          libglbinding.so.2      libmkl_mc.so
libafcpu.so.3.6.1     libaf.so.3.6.1      libglbinding.so.2.1.4  libmkl_tbb_thread.so
libafcuda.so          libcublas.so.9.2    libmkl_avx2.so         libnvrtc-builtins.so
libafcuda.so.3        libcufft.so.9.2     libmkl_avx512.so       libnvrtc.so.9.2
libafcuda.so.3.6.1    libcusolver.so.9.2  libmkl_avx.so          libOpenCL.so.1
libafopencl.so        libcusparse.so.9.2  libmkl_core.so         libtbb.so
libafopencl.so.3      libforge.so         libmkl_def.so          libtbb.so.2
libafopencl.so.3.6.1  libforge.so.1       libmkl_intel_lp64.so

You can see that libmkl_avx2.so and libmkl_def.so are in that list. Make sure they’re in your path, so Julia can find them.

However, that it was trying to use mkl sounds like it was trying to do the calculations on your CPU, and not your GPU?
I’m not familiar with ArrayFire.

Noel_Araujo · July 18, 2018, 2:01pm

Yep, all these files are there. And in my PATH

> echo $LD_LIBRARY_PATH
:/opt/arrayfire/lib

Should I change others PATH variables too ?

Elrod · July 18, 2018, 2:05pm

Hmm, just installed ArrayFire and…

julia> a = rand(Float32, 10^3, 10^3);

julia> ad = AFArray(rand(Float32, 10^3, 10^3));

julia> @time det(ad)
Intel MKL FATAL ERROR: Cannot load libmkl_def.so.

EDIT:
Also, when I tested last November, ArrayFire did not support Vega yet:
https://groups.google.com/forum/#!searchin/arrayfire-users/Vega|sort:date/arrayfire-users/SupCI8sTcdM/oiDH6bXtCAAJ
Not sure if that’s changed. Matmul works, but for large matrices is about 3x slower than HIP BLAS (still faster than CPU).

I am not sure what the problem is.
https://github.com/JuliaComputing/ArrayFire.jl/blob/943405b8b148d95e23ae9cea63c2579568f97cb0/src/wrap.jl
There are a lot of calls here, and most work.

I suspect it is the linear algebra ones that do not. They’re the ones that had problems then, too.

julia> qr(afc)
Intel MKL FATAL ERROR: Cannot load libmkl_def.so.

Ferran_Mazzanti · August 1, 2018, 8:32am

Just for the record, I’ve just tried a fresh, new install of ArrayFire and I’m getting the exact same problems
Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so.

…which makes it a pity as I was hoping to use the library for something more interesting that simply adding or multiplying matrices

Does anybody know the status of that?

BTW I’ve installed in ubuntu mate 18.04

Best,

Ferran.

rveltz · August 1, 2018, 2:52pm

Hi, it works on osx for me.


julia> using ArrayFire
ArrayFire v3.6.0 (OpenCL, 64-bit Mac OSX, build 49a5436)
[0] APPLE: AMD Radeon Pro 560 Compute Engine, 4096 MB
-1- APPLE: Intel(R) HD Graphics 630, 1536 MB

julia> a = rand(Float32, 10^3, 10^3);

julia> ad = AFArray(rand(Float32, 10^3, 10^3));

julia> @time det(ad)
  1.808117 seconds (1.56 k allocations: 84.891 KiB)
(Inf, 0.0)

julia> @time det(ad)
  0.061030 seconds (7 allocations: 224 bytes)
(Inf, 0.0)

julia> @which det(ad)
det(_in::ArrayFire.AFArray) in ArrayFire at .julia/v0.6/ArrayFire/src/wrap.jl:1585

Noel_Araujo · September 11, 2018, 6:14pm

For all of those that had the same issue, I’ve found the solution:

In summary, we need to load some paths with LD_PRELOAD . In my computer, libmkl_avx2.so and libmkl_def.so are in /opt/arrayfire/lib/ .

Then I should use:
export LD_PRELOAD=/opt/arrayfire/lib/libmkl_avx2.so:/opt/arrayfire/lib/libmkl_def.so

However , those files need libmkl_sequential.so and libmkl_core.so .

libmkl_core.so is included on ArrayFire folder, so just use the same path as the others
libmkl_sequential.so is not included with ArrayFire, but it does with Anaconda3 - in my computer is on the folder /opt/anaconda3/lib/

The final solution was :
> export LD_PRELOAD=/opt/arrayfire/lib/libmkl_avx2.so:/opt/arrayfire/lib/libmkl_def.so:/opt/anaconda3/lib/libmkl_sequential.so:/opt/arrayfire/lib/libmkl_core.so
> julia

julia> using ArrayFire
julia> ad = AFArray(rand(Float32,4,4))
julia> det(ad) # working !!

Then, after running Julia , operations like inv , det and qr are working

Topic		Replies	Views
GPU generalized eigendecomposition GPU question	1	383	July 1, 2023
CUDA eigenvalues of a sparse matrix GPU question	8	4322	November 17, 2021
Is it possible to diagonalize matrices with M-series Apple Silicon GPUs? GPU gpu , apple , metaljl	10	974	January 12, 2024
Acceleration of Intel MKL on AMD Ryzen CPU's Performance performance , mkl , linearalgebra	34	7020	May 9, 2024
Linear algebra of sparse matrices on GPU GPU gpu , linearalgebra	0	472	April 10, 2022

Computing eigenvalues/eigenvectors using GPU?

Related topics