Gradient for Cholesky decomposition with CuArrays with Zygote.jl

guixien · May 15, 2023, 9:47pm

Here’s a minimal example:

using Zygote, CUDA, LinearAlgebra
CUDA.allowscalar(false)
C = cu(rand(5,5))
f(x) = sum(cholesky(x’ * x).L)
Zygote.gradient(f, C)

While the CPU version works fine, this will throw the scalar indexing error. It seems like this is a recurring issue:

and there’s a purposed change with a customized adjoint:

But that doesn’t work for the above case.

Does anyone know how to get around this?

Topic		Replies	Views
Constructing a diagonal 2D CuArray while preserving Zygote's gradient functionality on the diagonal line New to Julia question , package	3	145	June 6, 2024
Zygote gradient error with `reduce` on GPU Machine Learning cuda , zygote	3	321	February 6, 2023
Efficient GPU dense-sparse matmul differentiable with Zygote Performance question , cuda , zygote , sparse	2	678	May 5, 2021
Zygote adjoints for cholesky factorization on sparse arrays Machine Learning	0	268	May 9, 2023
Zygote gradient Error General Usage question , flux , zygote	3	994	December 7, 2023