How to add metadata info to a DataFrame?

kevbonham · May 26, 2018, 3:48pm

I rather liked this suggestion in that other thread:

It seems reasonable to associate some meta data to columns in a DataFrame. Having to track them in a separate Dict would possibly backfire because you would have to maintain the consistency between two separate data structures.

Consider the case of reading data from a relational database. Each column has metadata like data type, length, precision, allow nulls, etc. Often these are good information that is very handy when processing query results.

Something like this would be fairly easy to implement in DataFrames?
putmeta!(df, :column1, @NT(kind = :level))
putmeta!(df, :column2, @NT(kind = :rate))
getmeta(df, :column1)  # returns @NT(kind = :level)
colswith(df, t => t.kind == :level)  # returns an iterator over :level columns

Generally speaking, I think in julia it’s better to avoid accessing the internals of an object directly (eg a.meta[:source] = "www.some.site") since a function can be generic and do similar things to objects that have different internal structure.

Topic		Replies	Views
DataFrames.jl: metadata Data package , dataframes , metadata	149	6552	April 24, 2023
A type for metadata? Data question , proposal , metadata	11	1831	August 17, 2018
DataFrames.jl development survey Data question , dataframes	52	2946	September 27, 2020
[ANN-RFC] DFMacros.jl Package Announcements dataframes	30	2029	June 19, 2021
Frustrated using DataFrames New to Julia dataframes , data_structures	97	10552	April 22, 2022

How to add metadata info to a DataFrame?

Related topics