Questions with building a chainrules for mutating array function

chooron · July 19, 2024, 1:45pm

I want to build two functions for replacing values in a vector and an array, as shown below:

function mutate_vec(vec::AbstractVector{T}, new_value::T, idx::Int) where {T}
    vec[idx] = new_value
    vec
end

function mutate_arr(arr::AbstractArray{T}, new_value::AbstractArray{T}, idx::Tuple) where {T}
    arr[idx..., :] .= new_value
    arr
end

But it is known that Zygote usually does not support mutating arrays, so I wrote rrule rules for these two functions by following the guidelines from Which functions need rules? · ChainRules, as shown below:

function ChainRules.rrule(::typeof(mutate_vec), vec::AbstractVector{T}, new_value::T, idx::Int) where {T}
    vec = mutate_vec(vec, new_value, idx)
    function mutate_vec_pullback(ȳ)
        return NoTangent(), ones(T, size(vec)), T(1.0), NoTangent()
    end
    return vec, mutate_vec_pullback
end

function ChainRules.rrule(::typeof(mutate_arr), arr::AbstractArray{T}, new_value::AbstractArray{T}, idx::Tuple) where {T}
    arr = mutate_arr(arr, new_value, idx)
    function mutate_arr_pullback(ȳ)
        return NoTangent(), ones(T, size(arr)), ones(T, size(new_value)), NoTangent()
    end
    return arr, mutate_arr_pullback
end

Due to my insufficient understanding of gradients, I am not sure if the rules I wrote are correct, so I hope someone can give me advice.

gdalle · July 20, 2024, 9:16am

Hi @chooron!

This could be more explicit in the ChainRules docs, but you must distinguish between two kinds of mutation:

mutation of objects created inside the function
mutation of objects passed as arguments to the function

The first kind of mutation is exactly what ChainRules allows you to solve with custom rules. However, the second kind is still experimental, and you need to take a look at this documentation page to use it.

chooron · July 22, 2024, 8:30am

Sorry, I didn’t fully read this document, so I missed this part. Thank you very much for your suggestion. I will revise my code according to the document.

Topic		Replies	Views
Using ChainRules rrule for mutating function to work with Zygote General Usage zygote , forwarddiff , reversediff , autodiff , chainrulescore	2	732	August 31, 2021
Idea to make Zygote support mutation in easy cases Machine Learning zygote , ad	6	476	December 27, 2022
Zygote.gradient(): Mutating arrays is not supported General Usage	1	786	August 18, 2020
Mutating versus non-mutating arrays for Zygote Gradient General Usage	6	347	December 25, 2022
Help With Automatic Differentiation General Usage autodiff	8	415	June 26, 2023

Questions with building a chainrules for mutating array function

Related topics