I have a (very simplified below) xlsx file like:
Start | End | Q1 | Q2 | Q3 10 Jan 2021 15:00 | 10 Jan 2021 17:00 | 5.50 | 2.70 | - 10 Jan 2021 15:03 | 10 Jan 2021 16:53 | 2.70 | 3.00 | 1.50
In fact, I have several more columns like Q1, Q2, Q3,… and there are around 300 rows. I would like to use Julia to make an analysis of this dataset, which arises from amultiple choice examination given online and downloaded from a Moodle server.
I have been able to read it via XLSX and then transform it to a dataframe. The columns are all recognized, to begin with, as strings. However, I would like to:
(1) transform the columns Q1, Q2, … to float numbers (where obvious), with the corresponding “-” strings transformed to missing values (not strings).
I do not want the “-” string to be replaced by, for instance, the float 0.00, because I would like to keep the record as to which questions were left unanswered (of course, the corresponding grade will, effectively, be considered 0.0, for the calculation of the final total grade)…
Later I will also want to
(2) transform the columns Start and End to proper Date/Time objects, possibly from the Dates package (I have never used it) and then, by subtracting End from Start to have a new column Duration for each test/examination.
I tried the first issue (1) quoted above without success and I will then try the second one later on. Right now, I am particularly concerned about issue (1).
Thanks for any help!