How often do you use the |> operator?

rikh · March 1, 2021, 10:20pm

In response to the home/end keys. Do you know that CTRL+A and CTRL+E jump respectively backward and forward on the line? This works in shells and the Julia REPL.

vancleve · March 1, 2021, 10:31pm

cough, à la emacs, cough cough

iviar · March 5, 2021, 2:37am

The form above has a significant weak point of readability in the chaining variables, sometimes I use it with indentation vertically aligning the intermediate results variables, so for example seems you (1+11=author+upvoters) would have noticed that the given example accidentally do not chain result1 to result2, here is adjusted with my preferred readability.

# coder is not drunk, keep it aligned also vertically
                            result1 = function1(input_data)
        result2 = function2(result1)
a = gpu(result2)

Tamas_Papp · March 5, 2021, 5:38am

Good catch! In real life-examples I try to use descriptive variable names though, so typos like this are less of a concern.

gustaphe · March 5, 2021, 6:03am

I wouldn’t rely on manual indentation like this. I prefer

gpu(
    function2(
        function1(
            input_data
            )
        )
    )

If we’re anyway stacking far out horizontally.

iviar · March 5, 2021, 7:41pm

The many-lines simply nested form depend on taste, is not storing intermediate evaluations which sometimes is preferable (as was under consideration in this sub-thread),
The following one line seems to me fits just right the purpose of spacing

gpu( function2( function1( input_data )))

in place of the same

It might be subjective but in the one-line the closing parenthesis pictures immediately what is input_data and do not consume 7 lines of vertical space for something that to me do not seem a visual improvement.
I’m also influenced by lisp practice which welcome closing many parenthesis in the last line of the evidenced block (that can be navigated with % in vi or C-M-p and C-M-n in emacs).

pyrex41 · March 5, 2021, 7:52pm

For me, the big advantage in drafting code at least is that the pipe operator lets me write down code in the order I think about it. Sometimes I think “hey, i need to apply function f to x”, then I type that and then remember I need to pipe it into some new function next. Instead of adding a new line / local variable, or adding more parens and moving the cursor back, i can just add a pipe operator. For me, it adds flexibility that matches the way I tend to think, which is not always linear but sometimes is.

As far as good practice / how you should use it in a library…there seem to be a lot of opinions on that, I stay out of that and just try to keep my code relatively clean.

One related I found in using the Elm language for a website is a reverse pipe operator:

f <| 2+2 == f(2+2) == 2+2 |> f

Not really sure why, but it made it easier for me to see at a glance how my code was structured.

TL;DR: syntactic convention is just a tool to help translate human thoughts to code, so, use what seems clear to you and supports your thought process.

anon92994695 · March 5, 2021, 8:10pm

I use it every day. Usually when I am hacking away at something.

it’s a good way to separate types of functions in a long composition chain.

a = g(f(x)) |> i |> h

so if f and g are similar but I and h are not it can help break up the code a bit.

mthelm85 · March 5, 2021, 8:15pm

I use DataFramesMeta.jl all the time and I personally love the @linq macro. I work a lot with survey data that is often tens of millions of rows by a few hundred columns in size and I’m typically measuring all sorts of different things that can be measured from the data so a typical operation looks something like this:

total = @linq data |>
    where(:foo .> 10) |>
    transform(bar = :baz .* 2) |>
    by(:blurb, total = sum(:blob)) |>
    sort(:total, rev = true)

In my actual work there are often many steps that one has to take to arrive at the correct answer so I like being able to have each operation separated out on its own line so that I can see exactly what steps were taken when I come back to the code at a later date. I really don’t like reading code where a bunch of intermediate/successive variables are created just to store one step of a data transformation process when it would be possible to chain everything together and do it at once and assign the result to a single logically-chosen variable name.

Maximilian_Roos · March 6, 2021, 11:25pm

More so than the pipe operator, much of this discussion comes down to whether you prefer prefix or postfix functions. Julia’s functions are prefix, and using the pipe operator allows using postfix functions.

An advantage of postfix functions is the lexical order is the same as the execution order:

Prefix:

[parse(Int, x) for x in split(strip(foo))]

(hypothetical) Postfix functions:

foo.strip.split.map(parse(Int, _))

Postfix with pipes — very similar to postfix functions above:

@pipe foo |> strip |> split .|> parse(Int, _)

As pipelines get longer, the difference between lexical & execution ordering grows when using prefix functions. Here’s a pipeline that would arguably be very difficult to read with prefix syntax — but would by fine with the hypothetical postfix functions above:

x = @pipe foo |> strip |> split .|> parse(Int, _) |> sort |> [0, _..., maximum(_) + 3]

tkf · March 7, 2021, 1:52am

I think the usefulness of |> stems from the “algebraic” property that is similar to matrix * vector:

(C' * B' * A')' * x == A * (B * (C * x))
x |> (f ∘ g ∘ h) == x |> h |> g |> f

In fact, if we define f <| x = f(x), the similarity is much clearer

(A * B * C) * x == A * (B * (C * x))
(f ∘ g ∘ h) <| x == f <| g <| h <| x

Or, equivalently, with the opposite composition operator g ⨟ f = f ∘ g which is similar to (_' * _')':

(C' * B' * A')' * x == A * (B * (C * x))
(h ⨟ g ⨟ f)(x) == x |> h |> g |> f

Given this observation, it is somewhat interesting that some comments against |> can also be applied to * (or any infix operators).

I think using binary operators is a good way for emphasizing the algebraic property of your program. This, in turn, can help readability and editability. (The argument is true for the other way around. It probably is a bad idea to use a binary operator if you don’t have any algebraic properties.)

jakobnissen · March 7, 2021, 7:39am

I’ve begun to love it. A chain like

filter(!isempty, map((n,L) -> (n, strip(L), filter(x -> isodd(first(x)), enumerate(eachline(file)))))

is much more easily read as

eachline(file) |> enumerate |> filter(isodd ∘ first) |> map() do (n, L)
    n, split(L)
end |> filter(!isempty)

In fact, the chain reads like a series of assignments (which some would prefer, exactly because they are read in order).
Some languages allow for these chains with dot syntax. I think the |> operator is better.

anon74562486 · March 12, 2021, 9:02pm

Thanks for the replies.
From what I understand the use of |> is a matter of style, but many people don’t use the pipe operator because of its limitations (having to write a lambda every time the function to the right of |> takes more than one argument, or having to rely on external macros). I also don’t like writing lambda functions.

For example:

a = [1:100;]
a |> 
    x -> reshape(x, 10, 10)   |>    # reshape to 10x10 matrix
    x -> my_function(x, args) |>    # apply a function with other args to the matrix
    x -> reshape(x, size(a))  |>    # convert to original size
    x -> filter(iseven, a)

What do you think about this? (it’s probably a stupid solution but it seems to work, but I don’t know if it’s a bad practice)

|>₁(a, f_args::Tuple) = f_args[1](a, f_args[2:end]...)
|>₂(a, f_args::Tuple) = f_args[1](f_args[2], a, f_args[3:end]...)
|>₃(a, f_args::Tuple) = f_args[1](f_args[2], f_args[3], a, f_args[4:end]...)

a = [1:100;]
a |>₁
    (reshape, 10, 10)   |>₁    
    (my_function, args) |>₁
    (reshape, size(a))  |>₂
    (filter, iseven)

pdeffebach · March 12, 2021, 9:26pm

Use Chain.jl. It solves a lot of these pain points.

anon74562486 · March 12, 2021, 9:32pm

Thank you, but I would prefer not to use macros from external libraries

tbeason · March 12, 2021, 9:34pm

Yea perhaps what this thread is still missing is a list of all the packages that improve upon the base pipe operator. To name a few:

Tamas_Papp · March 14, 2021, 8:32am

I am curious about the reason for this.

StatisticalMouse · March 14, 2021, 9:15am

I’ve sometimes tried to use the |> from Base, but then I run into cases when it doesn’t work, find I don’t understand why, and give up.

The examples in Chain.jl readme probably explain why, not 100% sure.

I’ve not used the ones in packages because I haven’t wanted to spend time to learn the differences and choose one. What criteria would I use for choosing anyway? For me it would serve best if there was a ”community recommendation” to use one and not the others. (This is also true more generally.)

There’s also the minor point that to type | with a Finnish/Swedish keyboard requires option-7. The keyboard key itself has 7 and / on it.

jzr · March 14, 2021, 10:00am

There is an issue

https://github.com/JuliaLang/julia/issues/5571

and a proposal

https://github.com/JuliaLang/julia/pull/24990

anon74562486 · March 14, 2021, 11:19am

I’m a student and I don’t like people reading my code to have to know about libraries that aren’t strictly necessary.

Topic		Replies	Views
Please help understand \|> operator General Usage flux	1	308	December 12, 2020
Piping operator \|> not documented? New to Julia piping	4	15167	June 27, 2018
Piping in Julia New to Julia piping	13	21272	August 30, 2019
Opinions on piping into variable? General Usage piping	11	793	April 19, 2023
When to use pipes? Offtopic	4	840	April 21, 2021

How often do you use the |> operator?

Related topics