CSV.jl open file error, contain data in scientific notation format

zhangliye · May 4, 2018, 6:33am

When I open the CSV file containing integer file in the scientific notation format, such as 2e2 for the integer 200. When I use CSV.jl to open this file, I got the following error.

CSV.ParsingException(“error parsing a Int64 value on column 2, row 7190; encountered ‘e’”)

CSV file data table as f as
x, y
12, 567e3
18, 2356

The Julia code is

using CSV
dtype = Type[Union{Missing,Int64} for i=1:2];
df_ais = CSV.read( file_path; delim=",", datarow=2, header=["x", "y"], types=dtype )

yakir12 · May 4, 2018, 8:25am

That’s because the type of 2e2 is a float not an integer:

julia> typeof(2e2)
Float64

zhangliye · May 4, 2018, 8:31am

Thank you so much! It works.

traktofon · May 4, 2018, 10:14am

While the Julia type of 2e2 and 567e3 is Float64, they can be cleanly converted to Int64. In the interest of “being liberal in what you accept”, maybe CSV.jl should attempt to do a conversion? After all, the CSV file might come from an application that does not adhere to Julia semantics.

yakir12 · May 4, 2018, 10:52am

Yea, I’ve come across situations where I wanted a round Float64 to magically become an Int, but this behavior would be outside the scope of CSV and downright unexpected: in my (albeit limited) experience, number notations with the exponential e (e.g. 2e2) are always treated as floats.
Part of the problem is that if we treat such numbers as Ints where we can, then their type won’t be guaranteed, it would depend on their value. But that issue doesn’t have to do with CSV, it’s a separate discussion.
BTW, I think there is a package that automatically converts floats to integers when it can, but I forgot what it’s called…

Topic		Replies	Views
CSV.read() faults on exponentially notated integers General Usage	2	636	December 28, 2017
CSV ruins scientific notation General Usage dataframes , csv	10	2663	July 17, 2019
Bug in CSV.read? General Usage	7	609	March 5, 2020
CSV.read with really small decimal value Data	3	627	September 3, 2018
Csv error reading numbers as string General Usage	16	2294	December 6, 2020

CSV.jl open file error, contain data in scientific notation format

Related topics