I am trying to port to Julia a Python exercise where however the first step is in reading some “pickled” data.
If I open the file with a text file I notice it is a binary file with stamped “numpy.core.multiarray” on the first row.
This is the code given in Python:
def get_MNIST_data(): """ Reads mnist dataset from file Returns: train_x - 2D Numpy array (n, d) where each row is an image train_y - 1D Numpy array (n, ) where each row is a label test_x - 2D Numpy array (n, d) where each row is an image test_y - 1D Numpy array (n, ) where each row is a label """ train_set, valid_set, test_set = read_pickle_data('../Datasets/mnist.pkl.gz') train_x, train_y = train_set valid_x, valid_y = valid_set train_x = np.vstack((train_x, valid_x)) train_y = np.append(train_y, valid_y) test_x, test_y = test_set return (train_x, train_y, test_x, test_y)
Is there a native Julia package to read such format ? If not, should I use PyCall instead ?