Another thing that came up on Slack is GitHub - ablaom/Koala.jl: Julia machine learning environment, which appears to be it’s own little (unregistered) ML ecosystem. It has a few interesting ideas, one being that data splitting (e.g. train/test, or cross-validation) is just a wrapper type around the full dataset, along with a binary mask.