Is there a function for creating n (nearly) evenly-spaced integers for indexing?

rafael.guerra · January 13, 2024, 9:48pm

This seems to be different from the OP example, as it produces indexes: 1:3:88, and not variable step (= 3 or 4) indexes.

rocco_sprmnt21 · January 13, 2024, 10:06pm

From the proposed solutions I seemed to understand that this was the expected result.

A=rand(100)
A[range(begin, step=end ÷ len, length=len)]

It is not so?
I just wanted to write one of the solutions a little differently

mkitti · January 13, 2024, 10:16pm

Let’s say I want an integer difference between integers, and I care that that each step between each number is an integer. Julia has a convenient syntax for this: begin:step:end.

My criterion may be distinct from yours.

I want my indices to be exactly spaced apart by the same step.
I do not care if the last index is included.

julia> function helper(A; step)
           B = A[begin:step:end]
           @info "B = A[begin:step:end]" B[1] B[2] B[end] length(B)
           return B
       end
helper (generic function with 1 method)

julia> A = 1:100
1:100

julia> helper(A, step = 1)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 2
│   B[end] = 100
└   length(B) = 100
1:1:100

julia> helper(A, step = 2)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 3
│   B[end] = 99
└   length(B) = 50
1:2:99

julia> helper(A, step = 3)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 4
│   B[end] = 100
└   length(B) = 34
1:3:100

julia> helper(A, step = 4)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 5
│   B[end] = 97
└   length(B) = 25
1:4:97

julia> helper(A, step = 5)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 6
│   B[end] = 96
└   length(B) = 20
1:5:96

julia> helper(A, step = 6)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 7
│   B[end] = 97
└   length(B) = 17
1:6:97

julia> helper(A, step = 7)
┌ Info: B = A[begin:step:end]
│   B[1] = 1
│   B[2] = 8
│   B[end] = 99
└   length(B) = 15
1:7:99

For many arrays, 1:step:end might do, but not all Julia arrays start at index 1.

Next we need to determine step. It could be length(A) ÷ desired_length. Note that ÷ is an alias for div, which will do integer division, rounding towards zero. ÷ is similar to \\ in Python.

Simply setting step as above would create an array that is longer than desired. Thus we have to further truncate the result to the desired length.

Below I will use the idiom end-begin+1 to stand in for length.

julia> function helper2(A; length)
           B = A[begin:(end-begin+1)÷length:end]
           B = B[begin:begin+length-1]
           @info "B:" B[1] B[2] B[end] Base.length(B)
           return B
       end
helper2 (generic function with 1 method)

julia> helper2(A, length = 100)
┌ Info: B:
│   B[1] = 1
│   B[2] = 2
│   B[end] = 100
└   Base.length(B) = 100
1:1:100

julia> helper2(A, length = 95)
┌ Info: B:
│   B[1] = 1
│   B[2] = 2
│   B[end] = 95
└   Base.length(B) = 95
1:1:95

julia> helper2(A, length = 50)
┌ Info: B:
│   B[1] = 1
│   B[2] = 3
│   B[end] = 99
└   Base.length(B) = 50
1:2:99

julia> helper2(A, length = 45)
┌ Info: B:
│   B[1] = 1
│   B[2] = 3
│   B[end] = 89
└   Base.length(B) = 45
1:2:89

julia> helper2(A, length = 32)
┌ Info: B:
│   B[1] = 1
│   B[2] = 4
│   B[end] = 94
└   Base.length(B) = 32
1:3:94

julia> helper2(A, length = 16)
┌ Info: B:
│   B[1] = 1
│   B[2] = 7
│   B[end] = 91
└   Base.length(B) = 16

In summary, if you are flexible on the last index but want an integer step, there is an easy syntax if you can specify step.

parb · January 15, 2024, 2:41pm

Great idea

parb · January 15, 2024, 2:57pm

Thank you for your help! I like you and @rocco_sprmnt21 using the div function, though I haven’t quite made it work.

I tried

helper2(1:41, 23)

and got back 1:1:23. This would downsample the long array but instead give only the first section of it.

parb · January 15, 2024, 3:15pm

This is very similar to @mbauman’s solution - which is equally good. I slightly prefer using div rather than round (and rather than ÷ which I can’t find on my keyboard).

So in the original notation, from now on I’ll use

sampleindices = range(start=1, step=div(length(longarray), numsamples), length=numsamples)
downsampled = longarray[sampleindices]

The reason for making sampleindices is that other parallel arrays need indexing my typical use cases.

Thank you everyone!

rafael.guerra · January 15, 2024, 3:44pm

In one case (round) the range 1:100 is sampled with variable step (3 or 4) but does cover the full interval from 1 to 100, while in the simpler/trivial case, a constant step of 3 is used to produce 30 indexes: 1:3:88, leaving a big gap in the tail (from 89 to 100).

rocco_sprmnt21 · January 15, 2024, 3:54pm

help?> ÷
"÷" can be typed by \div<tab>

parb · January 15, 2024, 4:04pm

These are important nuances to keep in mind for each use case! I’d mark both as solutions if I could.

aplavin · January 15, 2024, 7:09pm

I wrote a more general function for similar usecases:

julia> using DataManipulation

julia> discreterange(identity, 1, 100; length=30)
30-element Vector{Int64}:
   1
   4
   8
...
  90
  93
  97
 100

Its main target are transformed ranges, like logarithmically-spaced discreterange(log, 1, 1000; length=30). There, you cannot just create a float range and round it to integers.
But discreterange works for regular linear ranges as well, as shown in the example.

rafael.guerra · January 15, 2024, 7:37pm

I noticed that the initial indexes don’t really follow a logarithmic proportion.
In this specific example, the output is equivalent to:

[1:4; round.(Int, exp.(range(log(5), log(1000), length=30-4)))]

aplavin · January 15, 2024, 9:56pm

Of course, because that would be impossible (:

rafael.guerra · January 15, 2024, 10:06pm

In some applications, it makes sense to allow repeated indexes at the beginning:

round.(Int, exp.(range(log(1), log(1000), length=30)))

aplavin · January 15, 2024, 10:20pm

Sure, the whole point of discreterange() is to give distinct integers. So both variants are available, either for regular linear ranges of for mapped ones:

julia> round.(Int, maprange(log, 1, 100; length=20))
[1, 1, 2, 2, 3, 3, 4, 5, 7, 9, 11, 14, 18, 23, 30, 38, 48, 62, 78, 100]

julia> discreterange(log, 1, 100; length=20)
[1, 2, 3, 4, 5, 6, 7, 9, 11, 14, 17, 20, 25, 30, 37, 45, 55, 67, 82, 100]

apo383 · January 15, 2024, 11:52pm

Any particular reason for the name discreterange, as opposed to something like integerrange? I suppose “discrete” is meant to imply integers, in the sense that floats are (nearly) continuous. But discrete mathematics can refer to finite countable objects, not necessarily one-to-one with all integers (or Int64).

aplavin · January 16, 2024, 12:33am

Any particular reason for the name discreterange , as opposed to something like integerrange ?

Not really, just the first name that came to mind when I needed such a function (:
uniqueintegerrange would be the most descriptive, even though a bit on the longer side…

Topic		Replies	Views
Any shortcut to 1:length(myVector)? General Usage	34	3086	January 12, 2021
Not evently-spaced grids/ranges New to Julia question	11	1442	April 10, 2019
How to generate a sequence of numbers in julia General Usage	2	25895	January 21, 2019
Accessing every elements of an array except a certain range General Usage question , array , arrays	10	2834	February 13, 2021
How do I create an array with random unique numbers in a specific range? New to Julia question , random	20	1425	June 24, 2024

Is there a function for creating n (nearly) evenly-spaced integers for indexing?

Related topics