Hi,
In Flux models (created using Chain), we give data array of the format D x N
(D
- data dimension, N
- number of samples). This is is different from other ML libraries such as Tensorflow/PyTorch where we use N x D
format.
This also results in the output (for a scalar prediction) being a 1 x N
matrix rather than a N
dimensional vector (which is more intuitive, atleast to me).
I am curious to know what is the reason for this design?
Thanks,
Vishnu