Replacement for find(all) on zip?

oschulz · November 16, 2018, 2:41pm

In Julia 0.6, we could to

julia> find(x -> x[1] + x[2] > 0.5, zip(rand(10), rand(10)))

However, in Julia v1.0 this doesn’t work, because Base.Iterators.Zip2 has no keys:

julia> findall(x -> x[1] + x[2] > 0.5, zip(rand(10), rand(10)))
ERROR: MethodError: no method matching keys(::Base.Iterators.Zip2{Array{Float64,1},Array{Float64,1}})

Do we have a good replacement for this?

nalimilan · November 16, 2018, 8:45pm

Not really AFAIK. findall(rand(10) .+ rand(10) .> 0.5) should do the trick, but it allocates.

See this issue for the history.

Seif_Shebl · November 16, 2018, 8:56pm

filter(x -> x[1] + x[2] > 0.5, collect(zip(rand(10), rand(10))))

Also, this generator is much better:

(x -> x[1] + x[2] > 0.5, zip(rand(10), rand(10)))

mbauman · November 16, 2018, 9:11pm

Just do the find over the set of indices:

A, B = rand(10), rand(10)
findall(i->A[i]+B[i]>0.5, keys(A))

Actually, if you do this, you probably do want to use filter instead:

filter(i->A[i]+B[i]>0.5, keys(A))

as then you don’t need to worry about indexing back into your vector of indices.

oschulz · November 17, 2018, 9:43am

Sure, that’s my current workaround. But the collect is expensive in my use case, it’s a lot of data.

oschulz · November 17, 2018, 9:48am

Just do the find over the set of indices:

Unfortunately, the function comes from somewhere else and is supposed to operate on values, not indices.

Actually, if you do this, you probably do want to use filter instead:
as then you don’t need to worry about indexing back into your vector of indices.

Normally, I would, of course. But in this case, i want the indices: The result is a data selector for a another large dataset (big entries). Have to avoid copies, and data may be out-of-core.

Tamas_Papp · November 17, 2018, 1:25pm

Compose it with getindex then.

mbauman · November 17, 2018, 4:39pm

I’m filtering the keys of the collection, not the values. Even if it’s an opaque function, this should just work. You can also customize which kinds of keys you get back (linear or Cartesian), for example:

filter(i->f(A[i], B[i]), eachindex(A,B))

oschulz · November 17, 2018, 7:15pm

Yes, that might work.

Topic		Replies	Views
Extracting indices using findall() New to Julia	4	239	October 25, 2022
Anonymous function applied to multiple arguments General Usage question	13	1491	June 12, 2020
How to obtain indices of an array satisfying boolean condition New to Julia	20	21370	December 1, 2022
Findall slow General Usage	8	1913	October 24, 2019
Where possible and sensible, allow `zip` to pass `getindex` through to its underlying iterables General Usage	2	350	April 2, 2023

Replacement for find(all) on zip?

Related topics