Flattened data vs Flux.flatten layer

congUoM · September 23, 2021, 8:37pm

Hi all,

I am trying to use the following MLP model (for MNIST) in the model zoo as a template for my work, and I have this problem:
https://github.com/FluxML/model-zoo/blob/master/vision/mlp_mnist/mlp_mnist.jl

I remove the flattening data (lines 18 - 19) and try to add a Flux.flatten layer before the Dense layer, like this:

return Chain(
              Flux.flatten,
              Dense(prod(imgsize), 32, relu),
              Dense(32, nclasses))

but I got very bad training/testing accuracy (it still runs though). I thought they should be the same. Any idea?
Thanks.

ToucheSir · September 23, 2021, 10:43pm

I think they should be too. Can you confirm the shape of x is correct before it’s fed into the network (i.e. after https://github.com/FluxML/model-zoo/blob/master/vision/mlp_mnist/mlp_mnist.jl#L84)? If everything looks right there (the batch is 28 x 28 x [batch size]) and you experience this poor performance across multiple runs, please do file an issue

congUoM · September 23, 2021, 11:40pm

I check the shape of x before its fed into the networks and it is of the shape (28, 28, 256), which looks correct. I just submit the issue:
https://github.com/FluxML/model-zoo/issues/316

Topic		Replies	Views
Why the reshape in Flux mnist convolution example Machine Learning question	1	1041	August 25, 2018
Flux: multiple input of unequal dimensions Machine Learning flux	4	1304	September 7, 2020
Flux: Feed minibatch into Neural Network New to Julia flux	1	514	December 10, 2019
What is the Flux.flatten inverse operation? Machine Learning flux	2	956	August 27, 2022
Flux with Matrix input dimensions Machine Learning flux	2	1290	March 12, 2023

Flattened data vs Flux.flatten layer

Related topics