Updated Abilities Syntax · contributing

Over the last couple days I started poking around the compiler to see if I could figure out how to update the abilities syntax. I think I need a hint :upside_down:

Specifically, what part of the compiler identifies specific character sequences in the source file and translates them to AST nodes?

Agus Zubiaga (May 18 2023 at 13:29):

What you describe is the parse crate in general. I don't know the specifics about the abilities parser but I would start at the type_annotations module.

Agus Zubiaga (May 18 2023 at 13:30):

Agus Zubiaga (May 18 2023 at 13:40):

Agus Zubiaga (May 18 2023 at 13:41):

Bryce Miller (May 18 2023 at 13:47):

Ah ha! I did spend some time digging around in the parse crate, but not quite enough by the looks of it.

Bryce Miller (May 18 2023 at 13:49):

This is very helpful, I think I can change the behavior of the parser now. Once I have that working, I’ll update the various types s/Has/Implements/g Then I can update tests, comments, etc.

Bryce Miller (May 18 2023 at 13:51):

I realize we also want to require curly brackets for new ability definitions but I’m hoping I can make that a separate PR.

Bryce Miller (May 18 2023 at 13:53):

Agus Zubiaga (May 18 2023 at 13:54):

Agus Zubiaga (May 18 2023 at 13:57):

As you reading more of the parsing code you'll start to notice some of these helpers everywhere: and!, loc!,wordN, map!, etc.

These are defined in parser.rs. Looking at how these are implemented really helped me demystify them.

Agus Zubiaga (May 18 2023 at 14:12):

Oh, also! You can find where things are parsed by opening parse/src/ast.rs and using your editor's Go to References feature. Look at those under crates/compiler/parse/src and you'll find where the AST node is created:

Bryce Miller (May 18 2023 at 17:41):

Ok, looks like it’s time for me to bite the bullet and learn how macros are defined in Rust. This day was bound to arrive eventually.

Bryce Miller (May 18 2023 at 17:42):

I did use the go to references feature in my editor, but I’m pretty sure I was looking at references to the wrong symbols or that I just missed the key references I needed.

Bryce Miller (May 18 2023 at 17:43):

I also just realized there’s an entire README in the compiler crate that I missed. I guess being productive on a plane is harder than I thought lol.

Bryce Miller (May 18 2023 at 17:43):

Bryce Miller (May 21 2023 at 00:50):

Set k := Dict.Dict k {}
     has [
         Eq {
             isEq,
         },
     ]

Set k := Dict.Dict k {} | k implements Hash & Eq
     implements [
         Eq {
             isEq,
         },
     ]

Bryce Miller (May 21 2023 at 00:51):

Brendan Hansknecht (May 21 2023 at 00:55):

Assuming it doesn't break the formatter anymore, that is a valid change. Just is clearer specification of the type.

Brendan Hansknecht (May 21 2023 at 00:55):

Bryce Miller (May 21 2023 at 01:06):

Ayaz Hafiz (May 21 2023 at 01:57):

Yeah the constraint on k should not be required. Did the definition of Dict change in a similar way?

Bryce Miller (May 21 2023 at 22:31):

All that changed for the constraint on k in Dict is the keyword has changing to implements

Bryce Miller (May 21 2023 at 22:31):

Bryce Miller (May 21 2023 at 22:33):

Set k := Dict.Dict k {}
     implements [
         Eq {
             isEq,
         },
     ]

 ── TOO MANY TYPE ARGUMENTS ─────────────────────────────────────────── Set.roc ─

  The Dict opaque expects 2 type arguments, but it got 4 instead:

  31│>  Set k := Dict.Dict k {}
  32│>       implements [
  33│>           Eq {
  34│>               isEq,
  35│>           },
  36│>       ]

  Are there missing parentheses?

Bryce Miller (May 21 2023 at 22:35):

Set k := Dict.Dict k {} | k implements Hash & Eq
     implements [
         Eq {
             isEq,
         },
     ]

An internal compiler expectation was broken.
This is definitely a compiler bug.
Please file an issue here: https://github.com/roc-lang/roc/issues/new/choose
thread 'main' panicked at 'Formatting bug; formatted code isn't valid

I wrote the incorrect result to this file for debugging purposes:
crates/compiler/builtins/roc/Set.roc-format-failed

Parse error was: NotEndOfFile(@686)

', crates/cli/src/format.rs:79:13

Bryce Miller (May 21 2023 at 22:39):

#  crates/compiler/builtins/roc/Set.roc-format-failed

## Provides a [set](https://en.wikipedia.org/wiki/Set_(abstract_data_type))
## type which stores a collection of unique values, without any ordering
Set k := Dict.Dict k {} | k has Hash & Eq
     has [
         Eq {
             isEq,
         },
     ]

Bryce Miller (May 21 2023 at 22:40):

Are the builtins formatted during the build process? Looks like I failed to update the formatter.

Bryce Miller (May 21 2023 at 22:52):

after fixing the formatter, it seems to be working fine as long as I keep the constraint on k for Set

Bryce Miller (May 21 2023 at 22:57):

Bryce Miller (May 21 2023 at 22:59):

I'm also getting some weird failures on reporting tests. for example, cargo test --package roc_reporting --test test_reporting -- test_reporting::ability_first_demand_not_indented_enough gives me:

-old snapshot
+new results
────────────┬──────────────────────────────────────────────────────────────────────────────────
    0       │-── UNFINISHED ABILITY ── tmp/ability_first_demand_not_indented_enough/Test.roc ─␊
          0 │+── SYNTAX PROBLEM ──────────────────────────────────────── /code/proj/Main.roc ─␊

...

Bryce Miller (May 21 2023 at 23:00):

Bryce Miller (May 21 2023 at 23:42):

This seems to me to be a strong indicator that I either introduced or uncovered a bug when making changes to the parsing logic.

Bryce Miller (May 21 2023 at 23:50):

Ayaz Hafiz (May 22 2023 at 00:23):

I’ll take a look in a few minutes. I think larger sized PRs are fine for this kind of thing, namely an atomic refactor of the language. I think we should figure out why the type definitions changed though, that should not be necessary.

Bryce Miller (May 22 2023 at 00:42):

The type changes were things such as ast::Has -> ast::Implements, ast::HasClause -> ast::ImplementsClause, etc. My thought was that like leaving them as-is could be a source of confusion in the future.

Ayaz Hafiz (May 22 2023 at 00:45):

Bryce Miller (May 22 2023 at 01:56):

Bryce Miller (May 23 2023 at 16:14):

I noticed that has was not defined as a constant in roc_parse::keywords is there a specific reason for this, or is it ok if I add a new constant there for implements and replace the various string literals?

Bryce Miller (May 23 2023 at 16:16):

(I definitely did not spend hours troubleshooting a simple failure to find and replace. Not me. Why would you think that?)

Richard Feldman (May 23 2023 at 18:23):

we should probably rename that - it's just for reserved keywords in expressions specifically

Richard Feldman (May 23 2023 at 18:24):

really, what we should do is have a separate list of keywords that are reserved in expressions vs in types!

Kilian Vounckx (May 23 2023 at 19:54):

If we did that, would that mean you could use type keywords in expressions and vice versa? E.g. Could you have a type signature List if -> List if? Not saying anyone would do this, but would that change make it possible?

Richard Feldman (May 23 2023 at 20:02):

Richard Feldman (May 23 2023 at 20:03):

to be honest, that does make a decent argument for unifying the keywords list just to make the rules easier to understand :thinking:

Bryce Miller (May 23 2023 at 21:11):

Ohh, are the strings in the keywords array automatically allowed in expressions? I was wondering what that was for.

Bryce Miller (May 23 2023 at 21:11):

Bryce Miller (May 23 2023 at 21:18):

How would a separate list of keywords reserved in types allow keywords from the other list to be used in types?

Bryce Miller (May 27 2023 at 03:44):

Aside from names changing in the AST files for test_snapshots, would changes to regions also be expected?

Bryce Miller (May 27 2023 at 03:48):

I'd also like to confirm whether you want me to keep the changes I made to names of types, functions, values, etc. I could revert those changes so that this PR contains only functional changes. Then make another PR changing the names of things to reflect the new syntax.

Bryce Miller (May 27 2023 at 03:50):

The PR is already at +665 −617 and I haven't changed | to where or added the brackets for ability definitions yet.

Richard Feldman (May 27 2023 at 11:40):

Richard Feldman (May 27 2023 at 11:41):

I don't think that's necessary; I think it's fine if both changes are in the same PR :thumbs_up:

Richard Feldman (May 27 2023 at 11:47):

I'd expect regions to change if keywords change lengths, so yeah I'd assume there would be some region changes here!

Richard Feldman (May 27 2023 at 11:48):

if we only give errors for identifiers from one list vs the other depending on whether we're parsing a type vs an expression

Richard Feldman (May 27 2023 at 11:49):

but honestly based on this discussion I think we should unify those lists and just have one list of reserved keywords for both types and expressions :big_smile:

Bryce Miller (May 27 2023 at 13:52):

Bryce Miller (May 27 2023 at 13:56):

Ah ha, right. So it looks like the only changes to these ASTs are names and regions. Would it make sense to just overwrite the old AST files with the new output? And if so, is there a script somewhere to do that?

Bryce Miller (May 27 2023 at 14:00):

Speaking of ASTs, while I was looking at the code I was kinda wondering why most nodes included a reference to a Loc. It just occurred to me that that’s probably there to facilitate error messages. Is that correct?

Bryce Miller (May 29 2023 at 11:04):

I'm running into thread 'gen_abilities::encode_derive_to_encoder_for_opaque' has overflowed its stack
Ignoring that test results in a failure for the next test that uses derived abilities.

Bryce Miller (May 29 2023 at 11:51):

Bryce Miller (May 29 2023 at 11:57):

now gen_tags::issue_5162_recast_nested_nullable_unwrapped_layout is failing with a stack overflow, but we've made progress

Ayaz Hafiz (May 29 2023 at 12:59):

Bryce Miller (May 29 2023 at 13:38):

Bryce Miller (May 30 2023 at 11:29):

Ayaz Hafiz (May 30 2023 at 12:57):

okay then you will want to use the with_larger_stack helper to increase the test stack size in debug mode

Bryce Miller (Jun 02 2023 at 12:16):

Bryce Miller (Jun 10 2023 at 18:03):

I have completed s/has/implements/g and s/|/where/g. I'm now working on adding the braces around ability members in ability definitions.

## Definition of the [DecoderFormatting] ability
DecoderFormatting implements {
    u8 : Decoder U8 fmt where fmt implements DecoderFormatting,
    u16 : Decoder U16 fmt where fmt implements DecoderFormatting,
    u32 : Decoder U32 fmt where fmt implements DecoderFormatting,
}

Ability {
        header: TypeHeader<'a>,
        loc_implements: Loc<Implements<'a>>,
        members: &'a [AbilityMember<'a>],
},

Ability {
        header: TypeHeader<'a>,
        loc_implements: Loc<Implements<'a>>,
        ann: Loc<TypeAnnotation<'a>>,
},

Bryce Miller (Jun 10 2023 at 18:06):

I noticed that the implements keyword plays the same role as : or := in alias or opaque type definitions, but is the only type definition that maintains the location of that sequence in the AST.

Bryce Miller (Jun 10 2023 at 18:10):

An alternative I see to the above approach is to parse the braces in roc_parse::expr::finish_parsing_ability_def_help

Bryce Miller (Jun 10 2023 at 18:20):

I was planning to bring this up later, but I wonder if there is any merit to using something other than the implements keyword for ability definitions. In this context, it has a different meaning than it does in other contexts.

In the the following contexts, the implements keyword means "the preceding type implements the following ability":

# Type annotations:
foo : a -> Str where a implements Bar

# Implementing abilities for opaque types:
Foo := U64 implements [Eq]

In the context of an ability definition, the implements keyword means "The preceding name identifies an ability which can be implemented by providing the following functions"

Bryce Miller (Jun 10 2023 at 18:21):

One idea I had for disambiguating these use cases was to reserve a new keyword such as implemented-with to use instead of implements in ability definitions. I don't like reserving another keyword though, and that one is long and cumbersome.

Bryce Miller (Jun 10 2023 at 18:24):

It occurred to me today that perhaps another approach would be to use a symbol or set of symbols, like all other type and value definitions use:

# Value Def
nice = 69

# Type Def (alias)
Foo : { bar : U64 }

# Type Def (opaque)
Baz := U64

# Ability Def (???)
SomeAbility :@ {
    foo : a -> Str where a implements SomeAbility,
}

Bryce Miller (Jun 10 2023 at 18:27):

This seemed relevant to me because I think the first approach I suggested for implementing the braces requirement would make more sense if ability defs looked more like the other type annotations whose parsers live in type_annotation.rs

Bryce Miller (Jun 10 2023 at 18:29):

Anyway, the last few messages should probably be moved to #ideas > Abilities Syntax. I wasn't even planning to bring that up until after we had a chance to use the new syntax for a while.

Bryce Miller (Jun 10 2023 at 18:34):

When I add the implements keyword to roc_parse::keyword::KEYWORDS, the compiler chokes on ability definitions in the builtins.

Bryce Miller (Jun 12 2023 at 15:44):

Does anybody have any advice on which direction to go in regards to parsing the braces? I’m not liking my first idea because an ability definition really isn’t a type annotation. I know I will also need to provide error messages for missing braces.

Bryce Miller (Jun 12 2023 at 15:46):

I apologize for the parsing questions. I think I’d be much better off here if I had experience implementing a parser for a toy language. I suspect I’m getting a little tripped up simply because I haven’t worked with parsers before.

Richard Feldman (Jun 13 2023 at 11:32):

have you taken a look at the record! macro in the parser? there are various places which use it, and I think that's what I would use here!

Bryce Miller (Jun 13 2023 at 12:26):

Richard Feldman (Jun 13 2023 at 12:58):

Bryce Miller (Jun 19 2023 at 15:55):

I’ve noticed special treatment of indentation for ability definitions. For example, ability::demand_parser takes an ability::IndentLevel and returns a parser which produces a (u32, AbilityDemand) value.

Bryce Miller (Jun 19 2023 at 15:57):

Does adding braces around the list of demands change how we treat indentation here?

Bryce Miller (Jun 19 2023 at 15:59):

It looks like I’m going to have to make more than a couple tweaks to get the types to work out for the record! macro, because the parsing logic for ability definitions seems different from most of the rest of the parsing logic.

Richard Feldman (Jun 19 2023 at 16:48):

yeah I don't think we should have to care about indentation anymore after introducing the record literal :thumbs_up:

Bryce Miller (Aug 11 2023 at 21:02):

I see @Richard Feldman and @Anton have merged my branch into main. I feel I owe you all a status update, and regret that I haven't done so before now.

As you can see, I completed the | -> where and has -> implements changes. I have not yet completed the proposed addition of curly braces in ability definitions. This is because I was getting tripped up a lot in my attempts to implement the change, and my personal life has also been hairy.

Realistically, I don't think I'll be able to implement the last part of the proposal. I apologize for not recognizing this and letting you know sooner. I kept thinking that I might be able to try again and get it, so that we would have only one breaking syntax change instead of two. But alas, weeks have passed and nothing of the sort has happened.

I could possibly manage it with a pairing session, but anyone who would pair with me on this could probably get it done faster alone. I don't want to take anyone away from the other important things they are doing to make Roc awesome. Ergo, I think I'm going to need to pass the torch.

Richard Feldman (Aug 11 2023 at 21:18):

hey no worries! I really appreciate all the work you did on it - you got the most valuable part done for sure!

Stream: contributing

Topic: Updated Abilities Syntax

Bryce Miller (May 18 2023 at 12:30):

Agus Zubiaga (May 18 2023 at 13:29):

Agus Zubiaga (May 18 2023 at 13:30):

Agus Zubiaga (May 18 2023 at 13:40):

Agus Zubiaga (May 18 2023 at 13:41):

Bryce Miller (May 18 2023 at 13:47):

Bryce Miller (May 18 2023 at 13:49):

Bryce Miller (May 18 2023 at 13:51):

Bryce Miller (May 18 2023 at 13:53):

Agus Zubiaga (May 18 2023 at 13:54):

Agus Zubiaga (May 18 2023 at 13:57):

Agus Zubiaga (May 18 2023 at 14:12):

Bryce Miller (May 18 2023 at 17:41):

Bryce Miller (May 18 2023 at 17:42):

Bryce Miller (May 18 2023 at 17:43):

Bryce Miller (May 18 2023 at 17:43):

Bryce Miller (May 21 2023 at 00:50):

Bryce Miller (May 21 2023 at 00:51):

Brendan Hansknecht (May 21 2023 at 00:55):

Brendan Hansknecht (May 21 2023 at 00:55):

Bryce Miller (May 21 2023 at 01:06):

Ayaz Hafiz (May 21 2023 at 01:57):

Bryce Miller (May 21 2023 at 22:31):

Bryce Miller (May 21 2023 at 22:31):

Bryce Miller (May 21 2023 at 22:33):

Bryce Miller (May 21 2023 at 22:35):

Bryce Miller (May 21 2023 at 22:39):

Bryce Miller (May 21 2023 at 22:40):

Bryce Miller (May 21 2023 at 22:52):

Bryce Miller (May 21 2023 at 22:57):

Bryce Miller (May 21 2023 at 22:59):

Bryce Miller (May 21 2023 at 23:00):

Bryce Miller (May 21 2023 at 23:42):

Bryce Miller (May 21 2023 at 23:50):

Ayaz Hafiz (May 22 2023 at 00:23):

Bryce Miller (May 22 2023 at 00:42):

Ayaz Hafiz (May 22 2023 at 00:45):

Bryce Miller (May 22 2023 at 01:56):

Bryce Miller (May 23 2023 at 16:14):

Bryce Miller (May 23 2023 at 16:16):

Richard Feldman (May 23 2023 at 18:23):

Richard Feldman (May 23 2023 at 18:24):

Richard Feldman (May 23 2023 at 18:24):

Kilian Vounckx (May 23 2023 at 19:54):

Richard Feldman (May 23 2023 at 20:02):

Richard Feldman (May 23 2023 at 20:03):

Bryce Miller (May 23 2023 at 21:11):

Bryce Miller (May 23 2023 at 21:11):

Bryce Miller (May 23 2023 at 21:18):

Bryce Miller (May 27 2023 at 03:44):

Bryce Miller (May 27 2023 at 03:48):

Bryce Miller (May 27 2023 at 03:50):

Richard Feldman (May 27 2023 at 11:40):

Richard Feldman (May 27 2023 at 11:41):

Richard Feldman (May 27 2023 at 11:47):

Richard Feldman (May 27 2023 at 11:48):

Richard Feldman (May 27 2023 at 11:49):

Bryce Miller (May 27 2023 at 13:52):

Bryce Miller (May 27 2023 at 13:56):

Bryce Miller (May 27 2023 at 14:00):

Bryce Miller (May 29 2023 at 11:04):

Bryce Miller (May 29 2023 at 11:51):

Bryce Miller (May 29 2023 at 11:57):

Ayaz Hafiz (May 29 2023 at 12:59):

Bryce Miller (May 29 2023 at 13:38):

Bryce Miller (May 30 2023 at 11:29):

Ayaz Hafiz (May 30 2023 at 12:57):

Bryce Miller (Jun 02 2023 at 12:16):

Bryce Miller (Jun 10 2023 at 18:03):

Bryce Miller (Jun 10 2023 at 18:06):

Bryce Miller (Jun 10 2023 at 18:10):

Bryce Miller (Jun 10 2023 at 18:20):

Bryce Miller (Jun 10 2023 at 18:21):

Bryce Miller (Jun 10 2023 at 18:24):

Bryce Miller (Jun 10 2023 at 18:27):

Bryce Miller (Jun 10 2023 at 18:29):

Bryce Miller (Jun 10 2023 at 18:34):

Bryce Miller (Jun 12 2023 at 15:44):