Hi,
I’ve got a large CSV file I’m trying to analyse in Julia. I can load it into a data frame easily and I’ve got the column types identified correctly.
Format is basically
Location datetime value
area1, 2024-07-20 20:30:05, 20.0
…
area2, 2024-07-20 20:30:07, 60.0
…
area3, 2024-07-20 20:30:30, 20.0
…
area4, 2024-07-20 20:30:22, 660.0
There are 4 areas, and the file contains a whole year of data per area, then moves onto the next area. This does mean there’s a lot of duplication in the datetime column. I don’t actually need accuracy to the level of seconds. This is just how the data is dumped from the network.
What I’d like to do it for each location, produce stats (min, max, mean) for each (Year, month, and week)
I don’t know where to start for this last part - I’d appreciate any help.