What's wrong with this Flux model definitin?

xiaodai · November 7, 2019, 3:13am

The errors seems to be not wrapping w in params. So this works

using Flux
using CuArrays
CuArrays.allowscalar(false)

x = gpu(rand(Float32, 1_000_000, 2))
y = x*gpu([2, 2]) + gpu(rand(Float32, 1_000_000))

w = gpu(rand(Float64, 2, 1))
loss(x, y) = Flux.mse(x*w,y) |> gpu

Flux.train!(loss, params(w), ((x,y),), ADAM())

Topic		Replies	Views
Scalar indexing GPU problem in Flux.jl model GPU question , flux	4	355	May 8, 2024
Training a simple linear model in Flux Machine Learning	4	2494	September 11, 2020
GPU gradient issues with matrix tranpose GPU flux	0	341	March 14, 2021
Zygote + CUDA: scalar getindex with custom activation function using multiplication GPU flux	3	719	September 3, 2020
Flux: Scalar getindex error Machine Learning	13	2058	May 15, 2020

What's wrong with this Flux model definitin?

Related topics