Hello everyone,
I am trying to work with big data in Julia. I just exported a .parquet file/folder using python’s dask
library and wanted to load it in Julia. It follows a representation of my code
using Parquet2: Dataset
ds = Dataset("path/to/my/file.parquet/")
df = DataFrame(ds; copycols=false)
Julia reports that the ds
object relates to a dataset amounting to 22146824 bytes – so it is definately non-empty. I also cross-check this information using python, where the data is properly read by dask
.
However, the df
object is an empty dataframe and when I run println("Number of row groups: ", length(ds.row_groups))
I get that there are 0 row groups, which is unexpected.
Does anyone have any tips or insights to share?
Bestest,
P.