Why is Flux's data input format different?

v-i-s-h · November 19, 2022, 10:16am

Hi,

In Flux models (created using Chain), we give data array of the format D x N (D - data dimension, N - number of samples). This is is different from other ML libraries such as Tensorflow/PyTorch where we use N x D format.

This also results in the output (for a scalar prediction) being a 1 x N matrix rather than a N dimensional vector (which is more intuitive, atleast to me).

I am curious to know what is the reason for this design?

Thanks,
Vishnu

Tomas_Pevny · November 19, 2022, 11:31am

Because PyTorch and Tensorflow are written in C/C++, which has row-major layout of matrices, where Julia has column major layout of matrices. Therefore for efficiency, things are reversed.

Topic		Replies	Views
Shape of data for sequence learning in Flux.jl? Machine Learning first-steps	1	1505	March 26, 2019
Data-formatting in and out of Flux ML model Machine Learning	1	438	July 28, 2021
Tensor dimension order on convolution layer General Usage flux	3	777	December 15, 2019
Flux - LSTM - Issue with input format for multiple features Machine Learning flux , machine-learning	9	1850	November 1, 2022
How to export a Flux model to Python? Machine Learning flux	2	401	July 19, 2023

Why is Flux's data input format different?

Related topics