I am trying to find out in what data formats Julia 1.0 can currently [august 2018] use to read and write data frames. for a sample, I am using something simple like
julia> df
6×4 DataFrame
│ Row │ n1 │ n2 │ n3 │ n4 │
├─────┼────┼──────┼───────────┼─────┤
│ 1 │ 99 │ 9801 │ -0.999207 │ 'a' │
│ 2 │ 1 │ 1 │ 0.841471 │ 'b' │
│ 3 │ 3 │ 9 │ 0.14112 │ 'c' │
│ 4 │ 5 │ 25 │ -0.958924 │ 'd' │
│ 5 │ 7 │ 49 │ 0.656987 │ 'e' │
│ 6 │ 9 │ 81 │ 0.412118 │ 'f' │
I plan to retry this with missing
once I know I have the basics working.
-
I know that Serializer works, but it is not a long-term storage format.
-
The most important I/O format may well be csv files.
- I do not know if DelimitedFiles works. I have confirmed that some
readdlm()
work well. But the following throws an error about ‘ERROR: MethodError: no method matching iterate(::DataFrame)’:
- I do not know if DelimitedFiles works. I have confirmed that some
fo= open("sample-df.tab", "w"); writedlm(fo, df, '\t'); close(fo);
-
-
or is
CSV
now the preferred csv reader/writer? -
is there native
.csv.gz
support, or do I write to a pipe? -
CSV reading and writing, esp from gzip compressed files, needs to be fast.
-
ideally, I get one great package, rather than a few almost-working ones.
-
-
JLD does not work. the
using JLD
dies. -
is there an SQLite writer ?
-
are there other important input/output formats for data frames?
pointers appreciated.
regards, /iaw