Trying to resolve some fuzzer issues · compiler development

Is there any reason someone can think of why this line inside of the expr_to_pattern_help function in parse/src/expr.rs could not or should not be implemented?

        Expr::SpaceBefore(..) | Expr::SpaceAfter(..) | Expr::ParensAround(..) => unreachable!(),

Anthony Bullard (Jan 04 2025 at 22:49):

Seems easy enough to translate into a pattern, but I want to make sure there isn't an invariant that I'm missing

Anthony Bullard (Jan 04 2025 at 22:49):

Anthony Bullard (Jan 05 2025 at 00:16):

Anthony Bullard (Jan 05 2025 at 00:17):

For now, I'm going to wrap the fuzz_module function in a std::panic::catch_unwind so that it doesn't blow up on parse errors

Anthony Bullard (Jan 05 2025 at 01:14):

Ok, that doesn't seem to be helping. But it seems like, as Joshua warned me, the problem is trying to extract spaces when there is a SpacesBefore inside a SpacesBefore or a SpacesAfter inside a SpacesAfter....

Anthony Bullard (Jan 05 2025 at 01:16):

It seems to me that to do that would require the ExtractSpaces trait's extract_spaces method to also take an arena as an arg.

Anthony Bullard (Jan 05 2025 at 01:16):

Anthony Bullard (Jan 05 2025 at 16:26):

Alright, I have this compiling. This is either my best idea, or the worst idea. Got rid of several todo!()s though

Anthony Bullard (Jan 05 2025 at 16:26):

Anthony Bullard (Jan 05 2025 at 16:43):

* * * Source code before formatting:
dbg(2ii/fi&fh
)

i


* * * Source code after formatting:
dbg 2ii / fi &fh

i

* * * AST before formatting:
Expr(
    @0-15 SpaceAfter(
        DbgStmt {
            first: @4-13 SpaceAfter(
                BinOps(
                    [
                        (
                            @4-7 Num(
                                "2ii",
                            ),
                            @7-8 Slash,
                        ),
                    ],
                    @8-13 Apply(
                        @8-10 Var {
                            module_name: "",
                            ident: "fi",
                        },
                        [
                            @10-13 RecordUpdater(
                                "fh",
                            ),
                        ],
                        Space,
                    ),
                ),
                [
                    Newline,
                ],
            ),
            extra_args: [],
            continuation: @17-18 SpaceBefore(
                Var {
                    module_name: "",
                    ident: "i",
                },
                [
                    Newline,
                    Newline,
                ],
            ),
        },
        [
            Newline,
        ],
    ),
)

* * * AST after formatting:
Expr(
    @0-19 Defs(
        Defs {
            tags: [
                EitherIndex(2147483648),
            ],
            regions: [
                @0-16,
            ],
            space_before: [
                Slice<roc_parse::ast::CommentOrNewline> { start: 0, length: 0 },
            ],
            space_after: [
                Slice<roc_parse::ast::CommentOrNewline> { start: 0, length: 0 },
            ],
            spaces: [],
            type_defs: [],
            value_defs: [
                Stmt(
                    @0-16 BinOps(
                        [
                            (
                                @0-7 Apply(
                                    @0-3 Dbg,
                                    [
                                        @4-7 Num(
                                            "2ii",
                                        ),
                                    ],
                                    Space,
                                ),
                                @8-9 Slash,
                            ),
                        ],
                        @10-16 Apply(
                            @10-12 Var {
                                module_name: "",
                                ident: "fi",
                            },
                            [
                                @13-16 RecordUpdater(
                                    "fh",
                                ),
                            ],
                            Space,
                        ),
                    ),
                ),
            ],
        },
        @18-19 SpaceBefore(
            Var {
                module_name: "",
                ident: "i",
            },
            [
                Newline,
                Newline,
            ],
        ),
    ),
)

Anthony Bullard (Jan 05 2025 at 16:44):

Ayaz Hafiz (Jan 05 2025 at 16:45):

Anthony Bullard (Jan 05 2025 at 16:46):

Ayaz Hafiz (Jan 05 2025 at 16:47):

Anthony Bullard (Jan 05 2025 at 16:48):

Anthony Bullard (Jan 05 2025 at 16:50):

Ayaz Hafiz (Jan 05 2025 at 16:50):

because you need something to indicate what the rest of the expression after the debug is. So it's like dbg <Expr> <Expr>. otherwise you would need a list as you mention but a nice property of nesting the next expression is that you can get the type of the whole expression just by looking at one node.

Anthony Bullard (Jan 05 2025 at 16:51):

We don't have a separate Statement type. So something that is a statement needs to have a continuation that has the rest of the function

Anthony Bullard (Jan 05 2025 at 16:52):

The problem here is we didn't print the parens around the binop when we formatted

Anthony Bullard (Jan 05 2025 at 16:53):

So now it thinks we are applying dbg to the number 2ii and then that is the left-hand side of the binop

Anthony Bullard (Jan 05 2025 at 16:53):

Anthony Bullard (Jan 05 2025 at 16:54):

Let me know if that sounds correct to you @Ayaz Hafiz and thanks for talking sense into me

Anthony Bullard (Jan 05 2025 at 17:23):

Joshua Warner (Jan 05 2025 at 19:18):

I've made a few fixes for PNC issues locally (didn't notice this thread until after I them!)

Joshua Warner (Jan 05 2025 at 19:19):

Joshua Warner (Jan 05 2025 at 19:23):

Anyway, one thing that I think can't be preserved in the current AST is comments in an empty PNC apply, e.g.:

I think this will require using a different variant for PncApply in the AST, something like:

PncApply(&'a Expr<'a>, Collection<'a, &'a Expr<'a>>),

Anthony Bullard (Jan 05 2025 at 19:24):

Well the args use the collection helper so wouldn’t this apply to empty collections too?

Anthony Bullard (Jan 05 2025 at 19:25):

Joshua Warner (Jan 05 2025 at 19:25):

Anthony Bullard (Jan 05 2025 at 19:25):

Joshua Warner (Jan 05 2025 at 19:25):

Anthony Bullard (Jan 05 2025 at 19:26):

Joshua Warner (Jan 05 2025 at 19:26):

I would rather just have a new Expr variant for Pnc, and eventually delete the old WS one

Anthony Bullard (Jan 05 2025 at 19:26):

Joshua Warner (Jan 05 2025 at 19:27):

Anthony Bullard (Jan 05 2025 at 19:27):

Joshua Warner (Jan 05 2025 at 19:27):

Anthony Bullard (Jan 05 2025 at 19:28):

Not yet. I’m at the museum with my kids, but I’ll be able to push up what I got in a couple of hours

Anthony Bullard (Jan 05 2025 at 19:31):

Joshua Warner (Jan 05 2025 at 19:32):

Anthony Bullard (Jan 05 2025 at 19:33):

Read the above for context. Just made extract spaces take an arena so I could merge spaces recursively from SpacesBefore/SpacesAfter nodes

Anthony Bullard (Jan 05 2025 at 19:35):

Joshua Warner (Jan 05 2025 at 19:35):

Joshua Warner (Jan 05 2025 at 19:36):

I'm going to pull back the PNC-related fuzzing fixes I had in the PR I linked, to avoid conflicting with you

Anthony Bullard (Jan 05 2025 at 19:38):

Anthony Bullard (Jan 05 2025 at 23:18):

Joshua Warner (Jan 06 2025 at 00:35):

Anthony Bullard (Jan 07 2025 at 20:48):

I am in a horrible back and forth where I have all of the tests passing, and then exceptions gone, but the fuzzer still fails relatively quickly

Anthony Bullard (Jan 07 2025 at 20:48):

Sam Mohr (Jan 07 2025 at 20:50):

Suggestion: the fuzzer is really useful for making sure we have coherent coverage for our syntax when it's stable, but is blocking us from progressing when we're making tons of syntax changes

Sam Mohr (Jan 07 2025 at 20:50):

Anthony Bullard (Jan 07 2025 at 20:50):

Sam Mohr (Jan 07 2025 at 20:51):

Anthony Bullard (Jan 07 2025 at 20:51):

(L5(L5
0)
(
5)
(L5
0)
e
0)
dbg(L22
0)

Anthony Bullard (Jan 07 2025 at 20:51):

Brendan Hansknecht (Jan 07 2025 at 20:55):

Up to you guys, but often I find it a mistake to disable fuzzers temporarily. Turning them back on is a huge pain and often never happens.

Brendan Hansknecht (Jan 07 2025 at 20:55):

The fuzzer is catching real bugs in the new syntax. Even if they are ridiculous bugs.

Brendan Hansknecht (Jan 07 2025 at 20:57):

As long as there is a definite plan of when work will be put in to re-enable it, I think it could be reasonable to disable temporarily. Especially if after removing old syntax it will be easier to fix.

Sam Mohr (Jan 07 2025 at 20:58):

I just don't want Anthony Bullard to put his computer in the garbage disposal one piece at a time

Sam Mohr (Jan 07 2025 at 20:58):

Anthony Bullard (Jan 07 2025 at 21:02):

(L5(L5
0)
(
0)
(L5
0)
e
0)
dbg(L(
L5
0)
+
0asebg,L(
5is
if1)
e
0)
(21
0)

Luke Boswell (Jan 07 2025 at 21:02):

Anthony Bullard (Jan 07 2025 at 21:03):

Luke Boswell (Jan 07 2025 at 21:03):

I thought Josh was standing by with some fuzzer fixes until Anthony lands his change

Luke Boswell (Jan 07 2025 at 21:03):

I think we just ignore the fuzzer for now... if @Joshua Warner is ok with that. We can follow up with fixes in a separate PR

Luke Boswell (Jan 07 2025 at 21:04):

And by we I am thinking Josh... he seems to fix these bugs faster than I can find them (and I'm just running an automated tool)

Sam Mohr (Jan 07 2025 at 21:05):

As Brendan said, if we have a concrete plan to disable the fuzzer and how we can improve the parsing impl such that re-enabling the fuzzer puts us in a better spot, I think disabling it is good

Joshua Warner (Jan 07 2025 at 21:05):

If we have folks actively working on this (and we do!), I think temporarily disabling in CI is fine

Sam Mohr (Jan 07 2025 at 21:05):

Anthony Bullard (Jan 07 2025 at 21:05):

(L5(L5 0)(0)(L5 0)e 0)
dbg (L(L5 0) + 0asebg, L(5is ifl) e 0)
(21 0)

Joshua Warner (Jan 07 2025 at 21:06):

Anthony Bullard (Jan 07 2025 at 21:06):

Joshua Warner (Jan 07 2025 at 21:06):

Luke Boswell (Jan 07 2025 at 21:07):

I just ran cargo run --bin minimize expr on the above and concur with Anthony... this is "minimal" according to our tool

Joshua Warner (Jan 07 2025 at 21:08):

Anthony Bullard (Jan 07 2025 at 21:08):

:rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl::rofl:

Joshua Warner (Jan 07 2025 at 21:08):

Anthony Bullard (Jan 07 2025 at 21:08):

Anthony Bullard (Jan 07 2025 at 21:09):

Joshua Warner (Jan 07 2025 at 21:12):

Anthony Bullard (Jan 07 2025 at 21:13):

Joshua Warner (Jan 07 2025 at 21:13):

FWIW I eventually want to get rid of DbgStmt as a separate node (i.e. so this is just an apply in the syntax tree), so I would rather not go down the road of making a special Malformed node for this

Joshua Warner (Jan 07 2025 at 21:14):

Anthony Bullard (Jan 07 2025 at 21:14):

Anthony Bullard (Jan 07 2025 at 21:16):

Joshua Warner (Jan 07 2025 at 21:17):

Anthony Bullard (Jan 07 2025 at 21:17):

Anthony Bullard (Jan 07 2025 at 21:18):

Joshua Warner (Jan 07 2025 at 21:19):

Anyway, I agree this commit will have conflicts with your PncApply refactor, but I think it should be possible to adapt

Anthony Bullard (Jan 07 2025 at 21:19):

Anthony Bullard (Jan 07 2025 at 21:20):

Joshua Warner (Jan 07 2025 at 21:20):

Anthony Bullard (Jan 07 2025 at 21:20):

Joshua Warner (Jan 07 2025 at 21:20):

Anthony Bullard (Jan 07 2025 at 21:21):

Joshua Warner (Jan 07 2025 at 21:22):

Anthony Bullard (Jan 07 2025 at 21:22):

One thing that will make you happy is that I did split a fmt_pnc_apply from fmt_apply

Anthony Bullard (Jan 07 2025 at 21:23):

Sam Mohr (Jan 07 2025 at 21:23):

Joshua Warner (Jan 07 2025 at 21:23):

Anthony Bullard (Jan 07 2025 at 21:23):

Sam Mohr (Jan 07 2025 at 21:23):

I was running into issues formatting abilities that such a change would have solved

Anthony Bullard (Jan 07 2025 at 21:23):

Joshua Warner (Jan 07 2025 at 21:23):

Sam Mohr (Jan 07 2025 at 21:23):

Joshua Warner (Jan 07 2025 at 21:24):

Anthony Bullard (Jan 07 2025 at 21:24):

Anthony Bullard (Jan 07 2025 at 21:25):

Threading the last arg after and the final_comments together consistently made me so mad - solved problem with collection

Anthony Bullard (Jan 07 2025 at 23:23):

Joshua Warner (Jan 08 2025 at 03:07):

Nice; left a few review comments. Most (maybe all?) are things that can be fixed up in follow-up PRs.

Joshua Warner (Jan 08 2025 at 03:08):

I'm not too worried about this stuff going stale / not being actioned, so I'm generally fine either way.

Joshua Warner (Jan 08 2025 at 03:09):

There are a bunch of different changes we have out-standing right now and keeping the momentum feels important - so maybe bias towards action now?

Joshua Warner (Jan 08 2025 at 03:10):

Anton (Jan 08 2025 at 09:49):

Yeah, to give some context; it was failing the ubuntu_x86_64.yml workflow very often, but you can never be sure that nothing else failed so you need to check every time. Also, the ubuntu_x86_64.yml is a required workflow so basically all PRs needed to be force merged. I think the fuzzer should be moved to a new standalone workflow that is not required to pass. So it can inform us but does not require a force merge by an admin.

Luke Boswell (Jan 08 2025 at 09:52):

I saw that and thought it was a good idea.. but wasn't sure how to implement it.

Sam Mohr (Jan 08 2025 at 09:53):

Sam Mohr (Jan 08 2025 at 09:54):

There's an issue to allow CI actions to fail and not block merging, but GH doesn't want to implement it, apparently

Sam Mohr (Jan 08 2025 at 09:54):

So we can probably just do run_fuzzer || true in the shell command in the GH action definition

Anton (Jan 08 2025 at 09:54):

I'll take care of it, there's a way around it, you still get the red x but it's fine to merge

Stream: compiler development

Topic: Trying to resolve some fuzzer issues

Anthony Bullard (Jan 04 2025 at 22:49):

Anthony Bullard (Jan 04 2025 at 22:49):

Anthony Bullard (Jan 04 2025 at 22:49):

Anthony Bullard (Jan 05 2025 at 00:16):

Anthony Bullard (Jan 05 2025 at 00:17):

Anthony Bullard (Jan 05 2025 at 01:14):

Anthony Bullard (Jan 05 2025 at 01:16):

Anthony Bullard (Jan 05 2025 at 01:16):

Anthony Bullard (Jan 05 2025 at 16:26):

Anthony Bullard (Jan 05 2025 at 16:26):

Anthony Bullard (Jan 05 2025 at 16:43):

Anthony Bullard (Jan 05 2025 at 16:43):

Anthony Bullard (Jan 05 2025 at 16:44):

Ayaz Hafiz (Jan 05 2025 at 16:45):

Anthony Bullard (Jan 05 2025 at 16:46):

Ayaz Hafiz (Jan 05 2025 at 16:47):

Ayaz Hafiz (Jan 05 2025 at 16:47):

Anthony Bullard (Jan 05 2025 at 16:48):

Anthony Bullard (Jan 05 2025 at 16:48):

Anthony Bullard (Jan 05 2025 at 16:50):

Ayaz Hafiz (Jan 05 2025 at 16:50):

Anthony Bullard (Jan 05 2025 at 16:51):

Anthony Bullard (Jan 05 2025 at 16:52):

Anthony Bullard (Jan 05 2025 at 16:53):

Anthony Bullard (Jan 05 2025 at 16:53):

Anthony Bullard (Jan 05 2025 at 16:54):

Anthony Bullard (Jan 05 2025 at 17:23):

Anthony Bullard (Jan 05 2025 at 17:23):

Joshua Warner (Jan 05 2025 at 19:18):

Joshua Warner (Jan 05 2025 at 19:19):

Joshua Warner (Jan 05 2025 at 19:23):

Anthony Bullard (Jan 05 2025 at 19:24):

Anthony Bullard (Jan 05 2025 at 19:25):

Anthony Bullard (Jan 05 2025 at 19:25):

Joshua Warner (Jan 05 2025 at 19:25):

Anthony Bullard (Jan 05 2025 at 19:25):

Anthony Bullard (Jan 05 2025 at 19:25):

Joshua Warner (Jan 05 2025 at 19:25):

Anthony Bullard (Jan 05 2025 at 19:26):

Joshua Warner (Jan 05 2025 at 19:26):

Anthony Bullard (Jan 05 2025 at 19:26):

Anthony Bullard (Jan 05 2025 at 19:26):

Joshua Warner (Jan 05 2025 at 19:27):

Anthony Bullard (Jan 05 2025 at 19:27):

Joshua Warner (Jan 05 2025 at 19:27):

Anthony Bullard (Jan 05 2025 at 19:28):

Anthony Bullard (Jan 05 2025 at 19:31):

Joshua Warner (Jan 05 2025 at 19:32):

Anthony Bullard (Jan 05 2025 at 19:33):

Anthony Bullard (Jan 05 2025 at 19:35):

Joshua Warner (Jan 05 2025 at 19:35):

Joshua Warner (Jan 05 2025 at 19:36):

Joshua Warner (Jan 05 2025 at 19:36):

Anthony Bullard (Jan 05 2025 at 19:38):

Anthony Bullard (Jan 05 2025 at 23:18):

Joshua Warner (Jan 06 2025 at 00:35):

Anthony Bullard (Jan 07 2025 at 20:48):

Anthony Bullard (Jan 07 2025 at 20:48):

Sam Mohr (Jan 07 2025 at 20:50):

Sam Mohr (Jan 07 2025 at 20:50):

Anthony Bullard (Jan 07 2025 at 20:50):

Sam Mohr (Jan 07 2025 at 20:51):

Anthony Bullard (Jan 07 2025 at 20:51):

Anthony Bullard (Jan 07 2025 at 20:51):

Brendan Hansknecht (Jan 07 2025 at 20:55):

Brendan Hansknecht (Jan 07 2025 at 20:55):

Brendan Hansknecht (Jan 07 2025 at 20:57):

Sam Mohr (Jan 07 2025 at 20:58):

Sam Mohr (Jan 07 2025 at 20:58):

Anthony Bullard (Jan 07 2025 at 21:02):

Luke Boswell (Jan 07 2025 at 21:02):

Anthony Bullard (Jan 07 2025 at 21:03):

Luke Boswell (Jan 07 2025 at 21:03):

Luke Boswell (Jan 07 2025 at 21:03):

Luke Boswell (Jan 07 2025 at 21:04):

Sam Mohr (Jan 07 2025 at 21:05):

Joshua Warner (Jan 07 2025 at 21:05):

Sam Mohr (Jan 07 2025 at 21:05):