Saving data tables at different timepoints

RobertGregg · November 22, 2022, 8:09pm

Unfortunately space will become an issue if the models are large and run for millions of time-steps. My current ad-hoc solution is to use a dictionary of dataframes. Each dictionary key is a column name from the original dataframe (i.e. “A”, “B” and “C”) and the values are new dataframes that look like this:

Row  │ time   position  value
     │ Int64  Int64     Int64
─────┼────────────────────────
   1 │    21        12    500
   2 │    27         3    400
   3 │    52        12    500
   4 │    55         9      0
   5 │    59         9    500
   6 │    71         5    400
   7 │    93        11    500

where “time” is when the change occurred, “position” is where in the column the change occured, and “value” is the new value matching the type of the orginal column.

To get the dataframe back at any given time step, I use this function:

function getState(t, columnDict, initialState)
    
    out = deepcopy(initialState)
    
    #loop through columns and rows in initialState
    for (property, df) in pairs(columnDict)
        for row in eachrow(df)
            if row.time > t
                break
            end
            out[property][row.position] = row.value
        end
    end

    return out
end

It’s a little messy but it works

Topic		Replies	Views
Simulation Framework with Logging to Tabular Data General Usage question	25	2975	June 19, 2018
What is the future of time-indexed dataframes? Data question , dataframes , time-series	3	1367	December 14, 2017
Way to store dict of DataFrame tables in a single file? General Usage question	2	1130	September 21, 2018
Hierarchical or multi-index for data frames Data	10	7444	October 9, 2019
Questions about saving data to a DataFrame General Usage	1	453	October 11, 2020

Saving data tables at different timepoints

Related topics