Zip with length checking

Tamas_Papp · October 16, 2018, 11:28am

Consider a loop

for (i, elt) in zip(indexes, itr)
    do_something(i, elt)
end

where Base.IteratorSize(itr) can potentially be Base.HasLength() or Base.SizeUnknown(). I want the code to work for both.

indexes is an AbstractVector so I know its length.

I want to check that the iteration does not terminate “early” because itr is shorter than `indexes. What’s the idiomatic way to do so?

I thought of the following:

manually iterate itr using,
use a counter and check that.

But neither seems elegant.

mschauer · October 16, 2018, 11:44am

zip(indices, takestrict(itr, length(indices))) from Itertools would be a choice. Actually, I might add zipstrict to Itertools.

RGonTheNoble · July 23, 2025, 3:23pm

Sorry to resurrect this thread, but has this actually been implemented in IteratorTools or base iterators? zipstrict seems like it would be a very helpful function to do “map” with multiple arguments in a way that’s similar to broadcast.

adienes · July 23, 2025, 3:43pm

there is Iterators._zip_lengths_finite_equal which is undocumented & internal, but I think in this case it’s pretty fine to just use it anyway

it might be reasonable to define

zipstrict(a...) = _zip_lengths_finite_equal(a) ? throw(ArgumentError(...)) : Zip(a)

or use a strict kwarg. but there isn’t really precedence for this kind of API in any existing iterators in Base.Iterators

mbauman · July 23, 2025, 3:58pm

That won’t work, unfortunately, with SizeUnknowns. For that you do need to reach into the zip implementation to demand that running out of one iterator asserts that all iterators are complete.

Tamas_Papp · July 24, 2025, 8:12am

AFAIK, no. I think it would be a fine addition to IterTools.jl.

The implementation would have to consider all the size trait combinations mentioned in this topic, also for more than two arguments, but in principle this should not be difficult.

Ken_Williams · July 24, 2025, 11:25pm

Which one would be preferable?

lazy: as soon as one of the iterators runs out, raise an exception if there were any iterators before it in the ziplist that didn’t run out, or any iterators after it that don’t run out
eager: raise an exception if we can’t validate ahead of time that all iterators are either infinite or the same length, and then just let a normal zip happen

The lazy version seems pretty easy to implement in a way that covers all cases, right? And the eager version is impossible to get to work with SizeUnknowns, as @mbauman points out?

Tamas_Papp · July 27, 2025, 2:44pm

I would go with lazy first, and also as a fallback, and then specialize to eager for the cases where this leads to a significant improvement.

Topic		Replies	Views
Collecting zip New to Julia	1	5397	February 13, 2019
Iterator tool similar to zip (mesh?) General Usage iterators , functional	23	1495	February 28, 2022
Collect() requires length() for iterators? New to Julia	2	1056	July 29, 2017
Any shorter form of ``zip(1:2, cycle(3))''? General Usage	3	839	April 21, 2017
Check two iterators for equality without collecting General Usage question	6	1122	May 12, 2017

Zip with length checking

Related topics