Num.min/max with floats · API design

so in a recent Software Unscripted chat with Matt Godbolt, he noted that if min or max are given NaN, it's important that they always return NaN - we actually have a bug with this right now, where you get a different answer depending on the order of the arguments:

» Num.min (0f64 / 0) 1

1 : F64
» Num.min 1 (0f64 / 0)

NaN : F64

Richard Feldman (Oct 23 2023 at 02:31):

Richard Feldman (Oct 23 2023 at 02:32):

so I can see an argument for 3 different potential designs for what we should do instead of this:

Richard Feldman (Oct 23 2023 at 02:33):

Brendan Hansknecht (Oct 23 2023 at 02:39):

Brendan Hansknecht (Oct 23 2023 at 02:40):

I would either do 1 (most correct) or only return NaN if both are NaN (not correct but very useful in certain situations)

Brendan Hansknecht (Oct 23 2023 at 02:40):

Brendan Hansknecht (Oct 23 2023 at 02:41):

Floats are meant explicitly to accept NaN in order to delay error handling and keep code fast. That is why I think 2 is bad.

Richard Feldman (Oct 23 2023 at 02:41):

in defense of 3, wouldn't some of the same arguments apply here as applied when we decided not to give floats Eq? :thinking:

Brendan Hansknecht (Oct 23 2023 at 02:42):

I think 3 is bad cause comparing floats is super common. For example, which is faster?

Brendan Hansknecht (Oct 23 2023 at 02:43):

There are lots of cases for float ordering where you really don't care if two values are basically the same. If they are basically the same, either answer is really fine. Unlike with equality where being off by a little is super common and causes issues.

Brendan Hansknecht (Oct 23 2023 at 02:44):

Like if 2 values are essentially equal and you call max, it really doesn't matter which value is return or if they are actually equal. So I think not the same as Eq

Richard Feldman (Oct 23 2023 at 02:48):

Ayaz Hafiz (Oct 23 2023 at 02:49):

3 should really be an implements Ord constraint right? I think it's reasonable because floats don't have a total ordering. but the problem is you may get the very reasonable question "why is there no min for floats when I can do if a < b then a else if b < a then b else a"

Richard Feldman (Oct 23 2023 at 02:50):

hm, interesting - we don't have Ord yet, but I was assuming we'd implement it for all numbers

Ayaz Hafiz (Oct 23 2023 at 02:50):

Richard Feldman (Oct 23 2023 at 02:50):

Richard Feldman (Oct 23 2023 at 02:52):

although I certainly think there's an argument for those being consistent - like surely if we define Ord for floats, then Num.min and Num.max should be available on them (or else none of them should be available)

Brendan Hansknecht (Oct 23 2023 at 02:54):

For 2, NaN is often expected. It means that the rest of a float computation is invalid. So you let the NaN propagate to the end like you would a result (just faster). At the end of the calculations, you handle the NaN. Crashing on NaN would ruin the usability

Ayaz Hafiz (Oct 23 2023 at 02:55):

well, it depends what Ord is. Is it a total ordering? If so then yeah, certainly min/max must be available. But if Ord is defined such that a < b being false does not imply that a >= b is true, then it's a little bit more free (but at that point you probably want to have something like rust's partial Ord too)

Brendan Hansknecht (Oct 23 2023 at 02:56):

Also, I think it is more reasonable to define min and max on floats than to define ordon them.

Brendan Hansknecht (Oct 23 2023 at 02:57):

That said, I think they should have both (or at least min, max and PartialOrd)

Richard Feldman (Oct 23 2023 at 02:57):

I definitely would prefer not to have both Ord and PartialOrd abilities in builtins :sweat_smile:

Richard Feldman (Oct 23 2023 at 02:58):

Brendan Hansknecht (Oct 23 2023 at 03:00):

The reason min and max are more reasonable than Ord on floats is the potential damage that can be done. With Ord, you will hit cases where two floats are approxEq, but Ord gives them a strict ordering. This can lead to mistakes where a user actually wants to treat to approxEq floats as Eq, but doesn't realize that by using Ord everything is exact instead of approximate.

Brendan Hansknecht (Oct 23 2023 at 03:01):

With min and max. If two floats are approxEq the min or max of them will still be approxEq. So no harm done.

Brendan Hansknecht (Oct 23 2023 at 03:02):

min and max don't get harmed by float values being super close but not exactly equal.

Brendan Hansknecht (Oct 23 2023 at 03:02):

Brendan Hansknecht (Oct 23 2023 at 03:03):

but yeah, float sorting with NaN falls appart because there is no clear place to sort NaNs to.

Brendan Hansknecht (Oct 23 2023 at 03:03):

Ayaz Hafiz (Oct 23 2023 at 03:04):

why is there no clear place? we could follow the ieee 754 standard, or come up with another one. I don't think there's a technical limitation there.

Ayaz Hafiz (Oct 23 2023 at 03:05):

I don't think Ord is a concern for floating points if you only allow a strict order, that is, only comparing a < b rather than a <= b (which is what min/max already do). I think we cannot do a <= b-style total ordering anyway, because otherwise floats can implement Eq via Ord, which is incorrect

Brendan Hansknecht (Oct 23 2023 at 03:07):

There is no clear place because if you implement a userland sort using < or > you will get strange results when NaN is in a the array. It depends on if the comparisons are x < NaN or NaN < x.

Brendan Hansknecht (Oct 23 2023 at 03:08):

The only way the ieee gets a total ordering is by bitcasting to an integer (which roc doesn't allow users to do)

Richard Feldman (Oct 23 2023 at 03:09):

another way of thinking of this: if we don't support Num.min and Num.max on floats, then people will probably implement their own in the naive way and accidentally end up with the same bug we did :grimacing:

Brendan Hansknecht (Oct 23 2023 at 03:09):

So multiple "correct" strict ord based sorting algorythms in roc would put NaNs in different places. This can even happen with the same alg if NaN is just in a different place in the array.

Richard Feldman (Oct 23 2023 at 03:10):

Richard Feldman (Oct 23 2023 at 03:11):

Brendan Hansknecht (Oct 23 2023 at 03:13):

Brendan Hansknecht (Oct 23 2023 at 03:14):

Brendan Hansknecht (Oct 23 2023 at 03:15):

I mean, I guess if we want to follow the float total ordering standard, we would change all comparison operators and floats to treat them as signed integers.

Brendan Hansknecht (Oct 23 2023 at 03:15):

Brendan Hansknecht (Oct 23 2023 at 03:16):

Richard Feldman (Oct 23 2023 at 03:17):

a relevant consideration is that we have Dec for reasonableness and floats for performance

Ayaz Hafiz (Oct 23 2023 at 03:18):

the total ordering isn't based on casting to integers. it's the "regular" ordering, with negative and positive NaNs pinnned to either ends.

Brendan Hansknecht (Oct 23 2023 at 03:18):

Richard Feldman (Oct 23 2023 at 03:19):

so there's a case to be made that our tolerance for error-prone-ness in floats should be reduced for the sake of increasing performance, since that's kind of their whole point

Richard Feldman (Oct 23 2023 at 03:19):

Brendan Hansknecht (Oct 23 2023 at 03:19):

Brendan Hansknecht (Oct 23 2023 at 03:20):

Brendan Hansknecht (Oct 23 2023 at 03:22):

Can you word that differently? At least as I am reading it, I think it is backwards from what I feel you are trying to say.

Richard Feldman (Oct 23 2023 at 03:26):

haha yeah - basically error-prone APIs are more acceptable when it comes to floats

Richard Feldman (Oct 23 2023 at 03:26):

Martin Stewart (Oct 23 2023 at 06:39):

Maybe I misunderstood the decision made in an earlier thread or maybe plans have since changed but I thought Roc wasn’t going to allow NaN to exist in order to avoid these sorts of issues. Any functions that could potentially return NaN would either panic or return an error if it were to happen?

Martin Stewart (Oct 23 2023 at 13:20):

Richard Feldman (Oct 23 2023 at 15:23):

the problem was that if we don't have it in the language, there's no way to get maximum float performance - and the main point of having floats at all is performance!

Stream: API design

Topic: Num.min/max with floats

Richard Feldman (Oct 23 2023 at 02:31):

Richard Feldman (Oct 23 2023 at 02:31):

Richard Feldman (Oct 23 2023 at 02:32):

Richard Feldman (Oct 23 2023 at 02:33):

Brendan Hansknecht (Oct 23 2023 at 02:39):

Brendan Hansknecht (Oct 23 2023 at 02:40):

Brendan Hansknecht (Oct 23 2023 at 02:40):

Brendan Hansknecht (Oct 23 2023 at 02:41):

Richard Feldman (Oct 23 2023 at 02:41):

Brendan Hansknecht (Oct 23 2023 at 02:42):

Brendan Hansknecht (Oct 23 2023 at 02:43):

Brendan Hansknecht (Oct 23 2023 at 02:44):

Richard Feldman (Oct 23 2023 at 02:48):

Ayaz Hafiz (Oct 23 2023 at 02:49):

Richard Feldman (Oct 23 2023 at 02:50):

Ayaz Hafiz (Oct 23 2023 at 02:50):

Richard Feldman (Oct 23 2023 at 02:50):

Richard Feldman (Oct 23 2023 at 02:52):

Brendan Hansknecht (Oct 23 2023 at 02:54):

Ayaz Hafiz (Oct 23 2023 at 02:55):

Brendan Hansknecht (Oct 23 2023 at 02:56):

Brendan Hansknecht (Oct 23 2023 at 02:57):

Richard Feldman (Oct 23 2023 at 02:57):

Richard Feldman (Oct 23 2023 at 02:58):

Brendan Hansknecht (Oct 23 2023 at 03:00):

Brendan Hansknecht (Oct 23 2023 at 03:01):

Brendan Hansknecht (Oct 23 2023 at 03:02):

Brendan Hansknecht (Oct 23 2023 at 03:02):

Brendan Hansknecht (Oct 23 2023 at 03:03):

Brendan Hansknecht (Oct 23 2023 at 03:03):

Ayaz Hafiz (Oct 23 2023 at 03:04):

Ayaz Hafiz (Oct 23 2023 at 03:05):

Brendan Hansknecht (Oct 23 2023 at 03:07):

Brendan Hansknecht (Oct 23 2023 at 03:08):

Richard Feldman (Oct 23 2023 at 03:09):

Brendan Hansknecht (Oct 23 2023 at 03:09):

Richard Feldman (Oct 23 2023 at 03:10):

Richard Feldman (Oct 23 2023 at 03:11):

Richard Feldman (Oct 23 2023 at 03:11):

Brendan Hansknecht (Oct 23 2023 at 03:13):

Brendan Hansknecht (Oct 23 2023 at 03:13):

Brendan Hansknecht (Oct 23 2023 at 03:14):

Brendan Hansknecht (Oct 23 2023 at 03:15):

Brendan Hansknecht (Oct 23 2023 at 03:15):

Brendan Hansknecht (Oct 23 2023 at 03:16):

Richard Feldman (Oct 23 2023 at 03:17):

Ayaz Hafiz (Oct 23 2023 at 03:18):

Brendan Hansknecht (Oct 23 2023 at 03:18):

Richard Feldman (Oct 23 2023 at 03:19):

Richard Feldman (Oct 23 2023 at 03:19):

Brendan Hansknecht (Oct 23 2023 at 03:19):

Brendan Hansknecht (Oct 23 2023 at 03:19):

Brendan Hansknecht (Oct 23 2023 at 03:19):

Brendan Hansknecht (Oct 23 2023 at 03:20):

Brendan Hansknecht (Oct 23 2023 at 03:22):

Richard Feldman (Oct 23 2023 at 03:26):

Richard Feldman (Oct 23 2023 at 03:26):

Martin Stewart (Oct 23 2023 at 06:39):

Martin Stewart (Oct 23 2023 at 13:20):

Richard Feldman (Oct 23 2023 at 15:23):

Richard Feldman (Oct 23 2023 at 15:23):