Possibly confusing or inconsistent methods for read()

oatlzzvztd · February 14, 2017, 6:38pm

Ran into this gotcha today and felt like I should flag it down for anyone else who runs into it.

From the v0.5.1 docs, read() has the following two methods:

read(stream::IO, T, dims)

Read a series of values of type T from stream, in canonical binary representation. dims is either a tuple or a series of integer arguments specifying the size of the Array{T} to return.

read(s::IO, nb=typemax(Int))

Read at most nb bytes from s, returning a Vector{UInt8} of the bytes read.

What confused me here is that the first method must read prod(dims) * sizeof(T) bytes, whereas the second method may read at most nb bytes. In the 1D case, read(stream, nb) looks very similar to read(stream, T, numT) (and even does the same thing when T == UInt8.)

To get around this, use the second method in coordination with reinterpret(), like so:

temp = read(stream, numT * sizeof(T))
data = reinterpret(T, temp) # possibly followed by reshaping.

I’m open to other suggestions for reading at most numT bitstypes from a stream.

Topic		Replies	Views
Read a single element vs array from stream General Usage question , binaryio , dispatch	3	537	August 13, 2021
Read multiple variables from binary file at once General Usage binaryio	6	778	October 8, 2021
Read binary data of arbitrary dims and type New to Julia binaryio	8	3145	September 9, 2019
read!(io,Vector{MyStruct} not equivalent to loop over read with eachindex General Usage binaryio	4	102	August 10, 2024
Reading binary file in julia 1.0 New to Julia binaryio	13	7813	August 29, 2019

Possibly confusing or inconsistent methods for read()

Related topics