CNN for MNIST

mbauman · December 10, 2020, 7:30pm

The thing to remember about CNNs is that their input is 3d (instead of the “typical” 1d vector input): X + Y + channel. Thus, when you batch lots of them together, you get a 4d input (instead of the “typical” matrix).

I think what might be tripping you up is that the MNIST dataset is implicitly 1-channel, so you’ve used unsqueeze to add in that third dimension. The batching is what adds that fourth dimension. You can of course test single images from either the test or train set — but you just need to either batch them together or make a single one 4d (again with unsqueeze).

Topic		Replies	Views
Flux.jl DimensionMismatch(3 vs 4) New to Julia question	2	436	April 18, 2022
Conv_mnist model-zoo example, Machine Learning	9	779	August 18, 2021
DimensionMismatch: matrix A has dimensions (100,10), matrix B has dimensions (1,7) New to Julia question , neural-network	5	284	November 25, 2023
MNIST GPU CuArrays error GPU	23	3059	January 22, 2019
LoadError: DimensionMismatch General Usage	3	847	March 4, 2020

CNN for MNIST

Related topics