Is there a function for the inverse of `ismissing()`

drarnau · February 1, 2022, 1:14pm

Hi all,

Is there a function that returns the inverse of ismissing()?

Obviously, I can use ismissing(foo) == false but it seems not elegant.

Thanks!

nilshg · February 1, 2022, 1:21pm

!ismissing?

Vasily_Pisarev · February 1, 2022, 1:22pm

!ismissing()
Or, generally, for any function foo that returns a boolean value, !foo is a valid function that returns the inverse value.

jling · February 1, 2022, 1:28pm

it’s not because !ismissing() is a separately defined function though, this is defined in Julia Base:

!(f::Function) = (x...)->!f(x...)

stevengj · February 1, 2022, 1:45pm

To be clear, if you do !ismissing(foo), this just calls ismissing(foo) and then negates the result with !

If you use !ismissing with no function call (...), then Julia uses the higher-order function @jling mentioned, which yields a function that is the antonym of ismissing. This is useful for passing to other functions, e.g. for calling filter(!ismissing, somearray) to extract the non-missing elements.

drarnau · February 1, 2022, 1:57pm

So the answer is that there is no inverse function to ismissing(). But there are plenty of ways to generate the behaviour that such function would have.

lawless-m · February 1, 2022, 1:58pm

anonymous functions are compiled too, so you lose no performance

jling · February 1, 2022, 2:00pm

this usually have a different meaning btw, an inverse function of f(), let’s call it g(), should have the property g(f(x)) == x, which is impossible for ismissing for obvious reason.

there’s nothing mystical about built-in functions, you can see the source code:

ismissing(x) = x === missing

you can define your:

isnotmissing(x) = x!== missing

if you really want…

stevengj · February 1, 2022, 2:03pm

(Yes, I would call what @drarnau wants the “antonym” of ismissing.)

pdeffebach · February 1, 2022, 2:33pm

Fwiw I would like for this to exist. The ! makes broadcasting ugly. .!ismissing.(x). But its purely aesthetic so it can live somewhere else.

stevengj · February 1, 2022, 2:42pm

@. !ismissing(x) isn’t so bad, especially since you probably will have other function calls too. (I feel like @. is under-used.)

pdeffebach · February 1, 2022, 2:44pm

It can’t be used in a function call without more parentheses is one issue.

Actually that might be a nice 2.0 breaking change, make

foo(@mymacro a, b)

have @mymacro only apply to a.

stevengj · February 1, 2022, 2:46pm

@. $foo(!ismissing(x), b) should work, though I don’t know that it is better than foo(.!ismissing.(x), b). But I agree that macro calls inside function arguments are a bit annoying because of the parens.

Henrique_Becker · February 1, 2022, 5:51pm

I have defined the following function for my Jupyter notebooks:

broadwrap(f) = function (args...) broadcast(f, args...) end

But the name could be smaller or a single symbol. This function returns the broadcasted version of a function and I find it useful for some of the DataFrame manipulation functions, which ask for a function working over a whole column as an argument.

pdeffebach · February 1, 2022, 5:53pm

This is what ByRow does. It was written to do this exact operation, right?

In DataFramesMeta there is @rtransform, @rselect, @rsubset, etc. for row-wise operations.

Henrique_Becker · February 1, 2022, 5:57pm

No, it seems like they have some distinctions:

The wrapped function is called exactly once for each element. This differs from map and broadcast, which assume for some types of source vectors (e.g. SparseVector) that the wrapped function is pure, allowing them to call the function only once for multiple equal values. When using such types, for maximal performance with pure functions which are relatively costly, use x → map(f, x) instead of ByRow(f).

But it may be the case that I created broadwrap either by ignorance of ByRow or before it existed; nonetheless, I am not sure if there is an advantage of using a more purpose-specific solution as I may end up using broadwrap for one or another thing that does not relate to DataFrames.jl (this is just its main use).

pdeffebach · February 1, 2022, 5:58pm

I would consider that a relatively niche difference. It’s also quite useful for data analysis, for example if you have a PooledArray and a function like f(x) = x + rand().

But you are right that ByRow lives in DataFrames.jl. Maybe it should live somewhere else. But I’m not sure where.

stevengj · February 1, 2022, 6:02pm

The practical problem with this is that, unlike dot calls, it won’t fuse.

Henrique_Becker · February 2, 2022, 2:20am

Yes… but the goal of broadwrap is to create a function/closure/callable to be passed as an argument, there is a way to guarantee loop fusion when you are passing a function as argument?

Topic		Replies	Views
Ismissing(x) versus x === missing General Usage	15	3026	May 26, 2021
Using `isnan()` with missing values leads to hard to find bugs General Usage	6	520	April 12, 2020
Base including notnan, notnothing, ...? Internals	9	830	March 25, 2022
There is a `f(g) = function(args...) broadcast(g, args...) end` in `Base`? New to Julia	3	365	June 6, 2020
Operations on missing values General Usage question	8	1386	March 19, 2018

Is there a function for the inverse of `ismissing()`

Related topics