https://sco.library.emory.edu/research-data-management/publishing/copyright-data.html
Facts are not copyrightable. Kaggle can’t copyright a set of data. There is no legal basis on which they can prevent you from just including the ENSO dataset in your example.
The part about databases as a whole being copyrightable is mainly for something like an indexed book of facts where effort goes in to making it easily searchable and soforth. It doesn’t prevent someone from yoinking out one particular table and using it elsewhere.
Would you mind sharing some reference for this? What makes a dataset different from any other “work” such as a document, book, code or piece of art?
1 Like
The dataset Daily Climate time series data | Kaggle claims Creative Commons — CC0 1.0 Universal so I guess we can pull it and put it somewhere else accessible to DataDeps.jl?
Where should the data be hosted?
The github repo might be fine? How big is it?
I also tried with the code on Github but am getting the following error (please see the attached photo).
It was already there in the post, see the link repeated below. But yes, in order to make this explicit people should CC0 their datasets. Datasets are copyrightable in other countries.