braces syntax · ideas · Zulip Chat Archive

I found a design I really like that removes significant indentation from Roc's syntax in a way that feels mostly like a visual tweak to me instead of something really invasive

Richard Feldman (Feb 11 2025 at 20:47):

Richard Feldman (Feb 11 2025 at 20:48):

Richard Feldman (Feb 11 2025 at 20:51):

Richard Feldman (Feb 11 2025 at 20:53):

this design is 100% indentation-agnostic, which means copy/paste from anywhere can Just Work, and also large language models should work fine with them (at work, I've seen them struggle to get indentation right when suggesting changes to an existing code base)

Richard Feldman (Feb 11 2025 at 20:53):

compared to the do and end designs we'd discussed in #ideas > insignificant whitespace, the {and } delimiters are a lot less noisy. As a bonus they work for if and when and for and while and are more mainstream.

Richard Feldman (Feb 11 2025 at 20:54):

I also like that the { and } are only a character apiece, and can very often be omitted without sacrificing any clarity or losing any of the above benefits. It makes the change feel more like a tweak to me than an overhaul.

Richard Feldman (Feb 11 2025 at 20:55):

I know there was a ton of iteration on the other thread, but this is the design I've ended up liking the most. I'm curious what others think of it!

Sam Mohr (Feb 11 2025 at 20:57):

This would avoid commas in weird places, like on the last line of a function in a record:

logger = {
    init: |config|
        env_level =
            config.env.get("log_level") ?? "warn"
        set_up_logs(path, env_level),
}

logger = {
    init: |config| {
        env_level =
            config.env.get("log_level") ?? "warn"
        set_up_logs(path, env_level)
    },
}

Joshua Warner (Feb 11 2025 at 20:59):

Coming from using 90+% {}-delimited languages thru-out my career (C/C++, Java, Javascript, Typescript, Rust), this definitely makes me feel more comfortable

Sam Mohr (Feb 11 2025 at 20:59):

Anyway, I think using braces aren't as nice looking as today's brace-less syntax, but they are more readable because I can scan scope easily and more consistently

Joshua Warner (Feb 11 2025 at 20:59):

Sam Mohr (Feb 11 2025 at 20:59):

Joshua Warner (Feb 11 2025 at 20:59):

I think my primary concern here would be alienating other members of the community

Richard Feldman (Feb 11 2025 at 21:01):

Brendan Hansknecht (Feb 11 2025 at 21:03):

I'm overall for it. I think braces are standard and consistent while being pretty visually minimal (especially if you can avoid them for simple cases). I feel like removal of WSA was a much bigger change and this is just minor and not a big deal either way.

Brendan Hansknecht (Feb 11 2025 at 21:04):

I personally am 100% in on auto formatting that everyone should use. I think braces make it easier for that to be consistent.

Luke Boswell (Feb 11 2025 at 21:18):

Luke Boswell (Feb 11 2025 at 21:19):

The lambda expressions have just one expression for thier body seems new... but it feels familiar in practice.

Dawid Danieluk (Feb 11 2025 at 21:27):

I really like changes in the diff.
when -> match - new people no longer need to think about difference between when and if, anyone who's heard of pattern matching will immiedietly recognize what's going on
I also really like braces to limit scopes. It helps both - visually but also when editing.
There's something that I just thought of, sometimes I want to delete whole function body and start writing it again from scratch, using braces instead of whitespace allows me to do that using modal editors (mi{d in helix, di{ in vim).
With all the latest changes and braces Roc is more similar to mainstream languages (TS/Rust) in a good way as if it was taking good ideas from all of them. I know that usually comparing something to TS is used as an insult but I promise that I mean no disrespect here xD

Richard Feldman (Feb 11 2025 at 21:31):

Jared Ramirez (Feb 11 2025 at 21:38):

One possible use case for a 1-field record is when you expect the a return type of a function to evolve. If you have a function that initially just needs to return some data { x: Str }, but you know that soon you're going to expand the function to also return { y }. You can do this without single field records by first having fun : Str -> Str, then updating it to fun : Str -> { x: Str, y: Str }, but then you have to update all the callsites because the return type is different (was Str but now is a record). But if it was a record from the start, it's a non-breaking change to add y to the return type.

Jared Ramirez (Feb 11 2025 at 21:38):

Dawid Danieluk (Feb 11 2025 at 21:38):

Personally I'd use phrase TS syntax done right. Less noisy (no parens around if conditions, no semicolons when they provide no benefits, match instead of if/else chains or switch statements requiring 'break'), more feature rich and information dense.
Definitely "good and familiar", even "cozy" :-)

Joshua Warner (Feb 11 2025 at 21:43):

Do we still want to do some amount of indent-sensitivity?
How would you expect we distinguish these, if at all?

a = |x| {
  x!()-x!()
}
b = |x| {
  x!() -x!()
}
c = |x| {
  x!()
  -x!()
}
d = |x| {
  x!()
    -x!()
}
e = |x| {
  x!()
    - x!() # note the space!
}

Which of those are one statement with a subtract op vs two statements one of which is negated?

Joshua Warner (Feb 11 2025 at 21:48):

(I specifically used - because that's the only op I know of that could be either unary or binary; I don't _think_ there are other ambiguous cases right now, but I also don't know if I could completely rule it out)

Joshua Warner (Feb 11 2025 at 21:50):

Richard Feldman (Feb 11 2025 at 22:02):

Richard Feldman (Feb 11 2025 at 22:03):

Richard Feldman (Feb 11 2025 at 22:05):

  x!()-x!()

  x!() -x!()

  x!()
  -x!()

  x!()
    -x!()

Richard Feldman (Feb 11 2025 at 22:05):

Richard Feldman (Feb 11 2025 at 22:11):

Sam Mohr (Feb 11 2025 at 22:21):

debugged_negative_epoch = ||
    Stdout.line!("Getting the negative epoch")
    -get_epoch!()
}

Sam Mohr (Feb 11 2025 at 22:22):

Richard Feldman (Feb 11 2025 at 22:35):

Richard Feldman (Feb 11 2025 at 22:36):

also in that case it's prob enough to just see that the dash is touching on one side only

Anthony Bullard (Feb 11 2025 at 22:37):

Anthony Bullard (Feb 11 2025 at 22:38):

Luke Boswell (Feb 11 2025 at 22:38):

Anthony Bullard (Feb 11 2025 at 22:40):

And also Richard, I think that actually |x| { x } could be treated as unambiguously as a single field record, as a block is expecting more than a single expression. Don't know what's more surprising:

|x| {
    x
}

Joshua Warner (Feb 11 2025 at 22:45):

debugged_negative_epoch = ||
    Stdout.line!("Getting the negative epoch")
    - get_epoch!()
}

That would compile and be treated as Stdout.line!("Getting the negative epoch") - get_epoch!() (one expression)

i.e. the user didn't realize there needs to be no space between the negative and the value - how do we make sure that's not a super surprising experience?

Joshua Warner (Feb 11 2025 at 22:46):

This isn't a _new_ problem, since exactly the same thing can happen currently - but I imagine folks coming from '{}' languages may not realize the spaces around a unary - are important.

Joshua Warner (Feb 11 2025 at 22:49):

I'd be tempted to say something like: you _should_ always indent binary operators that continue from the previous line, and we warn if you don't.

Anthony Bullard (Feb 11 2025 at 22:52):

I hate to say this, but I didn't realize that the unary operator DID NOT have to touch in other languages, as I've never NOT done that, or inadvertently done that in a way that was surprising in 27 years of coding

Joshua Warner (Feb 11 2025 at 22:52):

Joshua Warner (Feb 11 2025 at 22:54):

:chili_pepper: Maybe you did do that at one point but never noticed because the compiler didn't yell at you and in that language it meant the same thing anyway.

Joshua Warner (Feb 11 2025 at 22:54):

Anthony Bullard (Feb 11 2025 at 22:54):

Anthony Bullard (Feb 11 2025 at 22:55):

Richard Feldman (Feb 11 2025 at 22:56):

yeah I think it's 100% fine to assume people only want unary op if it's touching

Joshua Warner (Feb 11 2025 at 22:56):

I think we probably ought to have the formatter indent that line to make things clear tho

Richard Feldman (Feb 11 2025 at 22:57):

Anthony Bullard (Feb 11 2025 at 22:57):

Anthony Bullard (Feb 11 2025 at 23:00):

I think in JS I may have been saved by ASI (automatic semicolon insertion) It's incredibly hard for me to do what you did above in JS

Richard Feldman (Feb 11 2025 at 23:01):

I actually like the idea of having the parser detect semicolons and then we have the formatter discard them for you

Richard Feldman (Feb 11 2025 at 23:01):

Anthony Bullard (Feb 11 2025 at 23:01):

Richard Feldman (Feb 11 2025 at 23:02):

Anthony Bullard (Feb 11 2025 at 23:02):

Yep, have it (collapsed with a following newline if it exists) into a newline token

Anthony Bullard (Feb 11 2025 at 23:02):

Anthony Bullard (Feb 11 2025 at 23:03):

Anthony Bullard (Feb 11 2025 at 23:04):

Richard Feldman (Feb 11 2025 at 23:12):

Anthony Bullard (Feb 11 2025 at 23:13):

Richard Feldman (Feb 11 2025 at 23:13):

Anthony Bullard (Feb 11 2025 at 23:13):

Richard Feldman (Feb 11 2025 at 23:13):

Anthony Bullard (Feb 11 2025 at 23:14):

Richard Feldman (Feb 11 2025 at 23:14):

like maybe something related to string interpolation (but probably not that one)

Richard Feldman (Feb 11 2025 at 23:14):

Anthony Bullard (Feb 11 2025 at 23:14):

Joshua Warner (Feb 11 2025 at 23:17):

Technically multiline strings need a non-context-free-grammar in order to understand the semantics correctly, but not just for building a valid-enough syntax tree

Richard Feldman (Feb 11 2025 at 23:18):

what if they were just "line begins with """ and ends with newline, and indentation doesn't matter"?

Richard Feldman (Feb 11 2025 at 23:18):

and then as many consecutive lines of those as you have, they all go together in one string literal

Joshua Warner (Feb 11 2025 at 23:19):

Richard Feldman (Feb 11 2025 at 23:19):

my_str =
    """foo
  """bar
       """baz

Richard Feldman (Feb 11 2025 at 23:19):

Anthony Bullard (Feb 11 2025 at 23:19):

Notification Bot (Feb 11 2025 at 23:37):

Elias Mulhall (Feb 12 2025 at 01:43):

Richard Feldman (Feb 12 2025 at 01:43):

Isaac Van Doren (Feb 12 2025 at 03:07):

This looks really nice. My only gripe is that now there are multiple ways to write lambdas. In languages with optional braces around lambdas like this, I always want my lambda to be without braces if possible and then end up a tiny bit frustrated when I need to add braces later to move to multiple lines.

I will happily make that sacrifice for the familiarity boost and getting rid of whitespace significance though :smiley:

Sam Mohr (Feb 12 2025 at 03:19):

I know this isn't quite what you're getting at Isaac, but it makes me wonder, maybe we can make braces get formatted away if there's only a single expression inside to force a single way of writing these

Sam Mohr (Feb 12 2025 at 03:19):

Isaac Van Doren (Feb 12 2025 at 03:20):

I do like the consistency, but that would be annoying if you know you need to write a multiline lambda, but you've just written one line so far, and then it gets formatted away

Sam Mohr (Feb 12 2025 at 03:21):

Sam Mohr (Feb 12 2025 at 03:22):

If you control when the formatter runs, it works, but if you have it run on auto save it'll get annoying

Richard Feldman (Feb 12 2025 at 03:26):

Sam Mohr (Feb 12 2025 at 03:26):

Joshua Warner (Feb 12 2025 at 03:27):

That said, I wouldn't be sad if we stripped away braces for expressions below some complexity level

Joshua Warner (Feb 12 2025 at 03:28):

e.g. constants and variables are fine to have outside of braces. Everything else probably deserves braces. (arguable!)

Richard Feldman (Feb 12 2025 at 03:28):

Joshua Warner (Feb 12 2025 at 03:29):

Actually constants could be a bit confusing with if, e.g. if foo 1 else 1 has the condition and then branch jumbled together. Technically not ambiguous in that case, but confusing.

Richard Feldman (Feb 12 2025 at 03:29):

we can see if there's demand for formatter intervention (e.g. because people are arguing over style preferences or something and want an authoritative resolution so they can stop arguing), but it doesn't seem obviously necessary, and might be annoying

Sam Mohr (Feb 12 2025 at 03:31):

Ray Myers (Feb 12 2025 at 06:35):

Loving this. It's exactly what I would do, and I know that because I recently went through the exercise of making a simplified Roc-ish syntax without indentation for EYG (mentioned in #off topic). Also liking when going to match.

By biggest worry about end keyword was needing to keep track of which constructs need them. Whereas it's easy to understand that for every { there is a }.

"indentation as nesting" is nice for reading (though IMO not a clear win overall) and maybe someday that becomes more common as an editor-view so we can see look at shorter code without messing up copy-paste

Norbert Hajagos (Feb 12 2025 at 08:09):

Traditionally, you need some kind of delimiter after the condition within an if, be that ) (like in C, javascript), or { like in rust. That makes parsing easy. I can't think of an example that would cause a problem in current Roc though.
if ident -3 else 3 comes to mind, but that wouldn't be a problem in Roc because of how the unary operator works. I still think we should stick with the braces for ifs, just so that people don't have to think about it. It's more consistent, an extra } at the EoL isn't that bad.

Norbert Hajagos (Feb 12 2025 at 08:34):

Also, this match syntax would allow piping into match, if that's something we want later.

inverted_directions = directions.map(.match {
    Left => Right
    Right => Left
})

Sky Rose (Feb 12 2025 at 14:52):

Overall, seems good.
What if the formatter always added braces if the body was on a different line, and always removed braces from same-line functions and ifs?

Niclas Ahden (Feb 12 2025 at 15:06):

I'd prefer the status quo (to no surprise). However, I think this is a better solution than do/end. Even though I would much prefer to not have to type {}, and have the LOC growth, I'm used to it from from Rust. I wonder what Roc would look like if the compiler was written in Haskell? Most of the changes seem Rust-inspired (?, ||, {}, discussion about parens in types which is like <>, semicolon sugar for {} = ). I really do think that these suggestions and our tendency to agree with them stems from everyone's daily Rust/Zig usage.

Niclas Ahden (Feb 12 2025 at 15:07):

There's probably something to the tooling ideas, like Sky's above. Perhaps that's a best of both worlds? I don't have to type all those braces, they just appear.

Niclas Ahden (Feb 12 2025 at 15:09):

Overall though, I think this may very well be the best decision for Roc. Everyone seems on board, it is familiar to a lot of developers, and it solves some issues (which to me are not huge, like copy/paste, but issues nonetheless). It's uncomfortable for me to be this contrarian here, as I generally just want the project to succeed and move forward. That's more important to me than exactly what syntax it'll have.

Niclas Ahden (Feb 12 2025 at 15:17):

My attraction to Roc is: "ML syntax, error-handling from heaven, can be used for anything, and it's fast (iteration + runtime)". That really feels like "a language for life" to me. This would kill the first point and of course that stings a bit. The others are still true though, and I get to start a new project! www.arewerustyet.com :joy:

Richard Feldman (Feb 12 2025 at 15:30):

I actually think it's more that we really explored the full range of options in that other thread - we started with the most Haskellish syntax, talked about do ... end from Ruby, talked about braces...really the only widely-used option we didn't seriously discuss is S-Expressions, and I don't think there was a need to discuss that one

Richard Feldman (Feb 12 2025 at 15:32):

to me, the main advantage of { compared to do .. end is that it's less visually noisy, and the main advantage compared to significant indentation is that it doesn't bring the drawbacks of significant indentation that motivated the other thread

Richard Feldman (Feb 12 2025 at 15:32):

I think ( instead of { would have all those same characteristics, but of course { is way more mainstream of a choice to use than ( and they're both equally concise, so { makes more sense to me as a choice because its weirdness budget cost is dramatically lower

Richard Feldman (Feb 12 2025 at 15:33):

so overall, I think it's more of a "this had the best tradeoffs" than "this looks Rusty/Ziggy" - although it's fair to observe that they do similar things! :big_smile:

Richard Feldman (Feb 12 2025 at 15:36):

btw I can't emphasize enough how much I appreciate your being up-front about your preferences but being on board with this even though it's not your first choice...if there's one thing I've learned from all our syntax discussions over the years, it's that every decision will always have some amount of support and some amount of opposition, and full consensus is never going to happen :sweat_smile:

Richard Feldman (Feb 12 2025 at 15:36):

the amount of consensus in this thread is definitely the most we've had on the subject, and I really appreciate your going with it! :heart:

jan kili (Feb 12 2025 at 20:53):

Elias Mulhall (Feb 13 2025 at 05:40):

Brendan Hansknecht (Feb 13 2025 at 06:01):

Agus Zubiaga (Feb 14 2025 at 02:52):

I haven’t kept up with Zulip, but I just wanted to mention I’d really like Roc to get braces. I think they just make code easier to navigate (with things like % in vim) and easier to edit in the era of formatters.

Richard Feldman (Feb 15 2025 at 13:56):

I think we can do a decent amount of "parser doesn't require braces and formatter adds them" - e.g. parser accepts if a b else c and the formatter changes it if a { b } else { c }

Anthony Bullard (Feb 15 2025 at 16:30):

So at the end of the day - as I'm implementing this in the Parser this weekend - is that braces delimit blocks and whitespace (read: indentation) significance goes away.

Anthony Bullard (Feb 15 2025 at 16:31):

Joshua Warner (Feb 15 2025 at 16:32):

Anthony Bullard (Feb 15 2025 at 16:32):

So any def in a lambda body, or any "statement" like a null def, has to be in a block

Anthony Bullard (Feb 15 2025 at 16:32):

Anthony Bullard (Feb 15 2025 at 16:34):

Anthony Bullard (Feb 15 2025 at 16:35):

Joshua Warner (Feb 15 2025 at 16:36):

Anthony Bullard (Feb 15 2025 at 16:36):

Richard Feldman (Feb 15 2025 at 16:36):

Richard Feldman (Feb 15 2025 at 16:37):

I'm ok if so, I just thought the parser could treat them as equivalent with no problem

Anthony Bullard (Feb 15 2025 at 16:38):

|a| { b = 1 c = 2 d = 4 a + b + c + d }

Richard Feldman (Feb 15 2025 at 16:38):

and more significantly, if I'm right that they can be considered equivalent, I think that makes the grammar both simpler to implement and also simpler to understand

Richard Feldman (Feb 15 2025 at 16:38):

Anthony Bullard (Feb 15 2025 at 16:38):

Joshua Warner (Feb 15 2025 at 16:38):

There are some key places where we absolutely have to still pay attention to new lines (unless we make further syntax changes)

Anthony Bullard (Feb 15 2025 at 16:38):

Joshua Warner (Feb 15 2025 at 16:39):

Richard Feldman (Feb 15 2025 at 16:39):

Joshua Warner (Feb 15 2025 at 16:40):

Richard Feldman (Feb 15 2025 at 16:41):

Joshua Warner (Feb 15 2025 at 16:41):

Richard Feldman (Feb 15 2025 at 16:41):

Joshua Warner (Feb 15 2025 at 16:42):

There are a couple of interesting places where, with indent insensitive parsing, we need to disallow line breaks at that point in the expression.

For example, you can't put a line break after the function and before the parentheses. and you can't put a line break between the ? operator and it's right operand.

# If we allowed this:
foo
     (1, 2) # user intends these to be args of a funtion

# ... then we'd have trouble with this:
y = 1 + x
(1, 2) # returning a tuple

# If we allowed this:
text = File.readUtf8!(path) ?
                    ErrorReadingConfig # user intend this to be a binary '?'

# ... then we'd have trouble with this:
text = File.readUtf8!(path)? # Unary '?' is intended
MyTag value = give_me_a_tagged_return_value(text)

Richard Feldman (Feb 15 2025 at 16:51):

ah! So in these cases, I think it's more about whether any whitespace at all is allowed than newlines vs spaces. Specifically, I think:

Joshua Warner (Feb 15 2025 at 17:02):

Joshua Warner (Feb 15 2025 at 17:03):

I still think we should warn if you don’t have a new line between statements in a block

Richard Feldman (Feb 15 2025 at 17:05):

Richard Feldman (Feb 15 2025 at 17:06):

but I do like the idea of all whitespace being interchangeable if we can get away with it

Richard Feldman (Feb 15 2025 at 17:07):

I think it makes it a bit easier to teach if you never have to think about what particular type of whitespace you're dealing with, but also I think it helps simplify the mental model

Richard Feldman (Feb 15 2025 at 17:10):

|a, b| { c = a + b d = c + 1 d * 2 }`

...even if you would never write it that way, I think it helps in understanding where the boundaries are

Richard Feldman (Feb 15 2025 at 17:11):

because you only get to have whitespace-separated expressions and statements inside braces

Joshua Warner (Feb 15 2025 at 17:13):

We can easily add an assert in tests that if you replace all new lines with spaces, that it still parses to the same thing.

Richard Feldman (Feb 15 2025 at 17:13):

Richard Feldman (Feb 15 2025 at 17:14):

because it can be taught that blocks ({ ... }) can be used anywhere an expression is accepted

Richard Feldman (Feb 15 2025 at 17:15):

and then we still have the property that else if is not special - it's just if a b else if c else d

Richard Feldman (Feb 15 2025 at 17:15):

and we choose to require braces as a matter of formatting style to make things read better

jan kili (Feb 15 2025 at 17:18):

Does this unlock anything useful via alternative formattings? Not necessarily by third parties or for source code at rest, but perhaps... shrinking hints in CLI output? shrinking types in tooltips? Those sorts of sneaky spots

Richard Feldman (Feb 15 2025 at 17:24):

Joshua Warner (Feb 15 2025 at 17:38):

If you have short variables or literals, if foo 1 else 2 has less visual noise than if foo { 1 } else { 2 }

Joshua Warner (Feb 15 2025 at 17:39):

jan kili (Feb 15 2025 at 19:19):

I thought about proposing preserving the then for that reason, but if we're really always going to multiline it then it's not worth it.

Anthony Bullard (Feb 16 2025 at 12:10):

test {
    try moduleFmtsSame(
        \\app [main!] { pf: platform "../basic-cli/platform.roc" }
        \\
        \\import pf.Stdout
        \\
        \\main! = Stdout.line!("Hello, world!")
    );

    try moduleFmtsSame(
        \\app [main!] { pf: platform "../basic-cli/platform.roc" }
        \\
        \\import pf.Stdout
        \\
        \\main! = {
        \\    world = "World"
        \\    Stdout.line!("Hello, world!")
        \\}
    );
    try moduleFmtsTo(
        \\app [main!] { pf: platform "../basic-cli/platform.roc" }
        \\
        \\import pf.Stdout
        \\
        \\main! = {world = "World" Stdout.line!("Hello, world!")}
    ,
        \\app [main!] { pf: platform "../basic-cli/platform.roc" }
        \\
        \\import pf.Stdout
        \\
        \\main! = {
        \\    world = "World"
        \\    Stdout.line!("Hello, world!")
        \\}
    );
}

❯ zig build test --summary all
Build Summary: 5/5 steps succeeded; 9/9 tests passed
test success
└─ run test 9 passed 256ms MaxRSS:2M
   └─ zig test Debug native success 1s MaxRSS:368M
      └─ run gencat (gencat.bin.z) cached
         └─ zig build-exe gencat Debug native cached 27ms MaxRSS:32M

Richard Feldman (Feb 16 2025 at 12:43):

Anthony Bullard (Feb 18 2025 at 15:09):

Anthony Bullard (Feb 18 2025 at 15:10):

One question for you @Richard Feldman because I haven't seen it in the thread (I also have like 1000 unread Zulip messages right now). Are we allowing arbitrary block expressions?

Anthony Bullard (Feb 18 2025 at 15:10):

I ask because it makes parsing records slightly more annoying, but I totally understand we may want it

Anthony Bullard (Feb 18 2025 at 15:12):

foo = {
    some_fn!()?
    some_other_fn!()?
    some_expr
}

foo = {
    bar: {
        some_fn!()?
        some_other_fn!()?
        some_expr
    },
}

Richard Feldman (Feb 18 2025 at 15:42):

Anthony Bullard (Feb 18 2025 at 15:50):

Joshua Warner (Feb 18 2025 at 17:31):

Hmm, I am a bit concerned that this means we have to look ahead an unbounded number of tokens to determine if we should be parsing a type decl or a record field

Joshua Warner (Feb 18 2025 at 17:32):

Richard Feldman (Feb 18 2025 at 17:36):

Richard Feldman (Feb 18 2025 at 17:37):

Anthony Bullard (Feb 18 2025 at 17:40):

Anthony Bullard (Feb 18 2025 at 17:41):

That's a fair amount of lookahead - especially when you add in potential newlines

Anthony Bullard (Feb 18 2025 at 17:41):

I think I'm only slightly worried due to the importance of records in the language

Anthony Bullard (Feb 18 2025 at 17:43):

I know what to do here, just calling it out. There's similar problems with tuples and parenthesized expressions (luckily tuples are typically pretty rare)

Joshua Warner (Feb 18 2025 at 17:48):

Joshua Warner (Feb 18 2025 at 17:56):

Here is a record and a block, where we won't be able to determine whether we should be parsing a type or an expr until we see the thing after qux::

foo = {
    bar: {
        baz: {
            qux: 42,
        }
    }
}

foo = {
    bar: {
        baz: {
            qux: Str -> Str,
        }
    }
    bar = {
        baz: {
            qux: |s| s,
        }
    }
    bar
}

I can construct other such examples that further delay that distinction arbitrarily.

Joshua Warner (Feb 18 2025 at 17:57):

Just reading that, it's hard to tell what's going on, which is IMO a readability problem

Richard Feldman (Feb 18 2025 at 17:57):

Richard Feldman (Feb 18 2025 at 17:58):

well the reason it's confusing to read is that it's bar: whereas type annotations are always written as bar :

Joshua Warner (Feb 18 2025 at 17:59):

Richard Feldman (Feb 18 2025 at 17:59):

Richard Feldman (Feb 18 2025 at 18:00):

Joshua Warner (Feb 18 2025 at 18:00):

That doesn't simplify the parser, unless we're allowed to give an error and bail out if we don't see that.

Richard Feldman (Feb 18 2025 at 18:00):

Joshua Warner (Feb 18 2025 at 18:00):

Joshua Warner (Feb 18 2025 at 18:19):

Allowing/needing backtracking is IMO a sign of a poorly-designed language grammar

Joshua Warner (Feb 18 2025 at 18:20):

It is indicative of issues that affect not just whether the machine can parse the language, but how easy it is for humans to parse as well

Joshua Warner (Feb 18 2025 at 18:20):

And furthermore, it's easy for it to become a performance blackhole that can make fuzzing difficult

Richard Feldman (Feb 18 2025 at 18:23):

Richard Feldman (Feb 18 2025 at 19:07):

but yeah, overall this reminds me of type inference on local declarations in general: technically Hindley-Milner type inference has bad asymptotics on them, so if you made a gigantic number of local variables in a row it would really hurt performance, but in practice nobody notices because people don't write real-world code that way

Richard Feldman (Feb 18 2025 at 19:08):

foo :
foo =

Richard Feldman (Feb 18 2025 at 19:09):

jan kili (Feb 18 2025 at 19:15):

Braces seem to be the consensus solution for implementing #ideas > insignificant whitespace ! Any objection to me resolving that ~~centithread~~ kilothread?

Joshua Warner (Feb 18 2025 at 19:33):

It sounds like the path forward here is to create a NoSpaceColon token that we output if there's no whitespace before the colon. Use that one for records, and the normal Colon for types. Give an error if you use the wrong one in a context that's unambiguous, and if it's otherwise ambiguous, use it for disambiguation.

Joshua Warner (Feb 18 2025 at 19:34):

(side note: I'm still skeptical of the readability here, but that's a larger discussion that we definitely need more evidence for...)

Anthony Bullard (Feb 18 2025 at 22:36):

Joshua Warner (Feb 18 2025 at 23:14):

Yes, but we can't tell whether the rhs of that first ':' is a type or a record expr, without this rule about NoSpaceColon vs Colon.

Richard Feldman (Feb 18 2025 at 23:43):

hm, will that rule be a problem if people write a record type as { name: Str } without the space, because that's what they're used to from other languages?

Joshua Warner (Feb 18 2025 at 23:44):

If they do so in a context where, up-to-that-point, it's ambiguous whether that's a type or a record literal, they'll get an error (so, yes)

Joshua Warner (Feb 18 2025 at 23:45):

Richard Feldman (Feb 18 2025 at 23:45):

Kiryl Dziamura (Feb 19 2025 at 12:28):

Kiryl Dziamura (Feb 20 2025 at 09:26):

fn = |x| {
    |y| { x + y }
}

fn = |x| |y| { x + y }

Sam Mohr (Feb 20 2025 at 09:44):

Brendan Hansknecht (Feb 25 2025 at 17:39):

Ben (Feb 25 2025 at 20:41):

Sorry for the basic question, but why are braces needed? Why is it insufficient to parse a function as a series of assignments ending in an expression? Is it because of calling an effectual function that doesn't assign to anything? Are there other ambiguous cases?

Brendan Hansknecht (Feb 25 2025 at 21:30):

Braces are being used for multiline functions to remove white space significance from the language. This enables the parser to be much more tolerant of various code formats and makes copy and paste just work. Those are a least the top two things that come to mind for me.

Richard Feldman (Feb 25 2025 at 21:37):

a way to think of braces is that they're a way to add statements to an expression

Richard Feldman (Feb 25 2025 at 21:38):

so if I have an expression like foo I can add statements in front of it using braces, e.g.

{
    x = bar * 2
    expect bar == baz
    foo + x
}

Richard Feldman (Feb 25 2025 at 21:38):

Brendan Hansknecht (Feb 25 2025 at 21:43):

fn! = || {
    {
        x = read!()
        write!(x)
    }
    {
        x = read!()
        write!(x)
    }
}

Richard Feldman (Feb 25 2025 at 21:44):

I was writing up docs for them and I think :point_up: is the actual definition we want

Richard Feldman (Feb 25 2025 at 21:46):

like "if you want to add statements in front of an expression, surround both the statements and the expression in { ... }. That whole { ... } is called a block, and it is an expression."

Richard Feldman (Feb 25 2025 at 21:47):

then, separately, there's the rule that "aside from the expression at the very end of a block, anything inside that block which looks like a standalone expression rather than a statement desugars to having {} = in front of it" - e.g. a write!(x) in the middle of a bunch of statements desugars into {} = write!(x)

Richard Feldman (Feb 25 2025 at 21:48):

(of course { .... } also comes up in records, as well as delimiting the list of patterns in a match, as well as in some module headers)

Anthony Bullard (Feb 26 2025 at 14:18):

This is working in my latest PR. A block is just an expression that contains a series of atatements

Anthony Bullard (Feb 26 2025 at 14:19):

So you can pass them as function args, have them as list items even have them as the predicate of an if

Brendan Hansknecht (Feb 26 2025 at 18:07):

Brendan Hansknecht (Feb 26 2025 at 18:08):

That sounds like terrible syntax, but I guess it makes sense it can work anywhere

Anthony Bullard (Feb 26 2025 at 18:53):

Brendan Hansknecht (Feb 26 2025 at 19:23):

I guess I could see someone doing something like this (just with a different context).

out = my_list.map({
    calculation_to_cache = something_super_slow!(...)
    |x| update(x, calculation_to_cache)
})

Richard Feldman (Feb 26 2025 at 19:34):

Richard Feldman (Feb 26 2025 at 19:35):

Brendan Hansknecht (Feb 26 2025 at 19:40):

Simon Taeter (Feb 27 2025 at 11:59):

These brackets as Brendan Hansknecht defined look an awful lot like the way fusion of the let-in and () from Elm.

fn =
    fn1
        ( let a = b + 123
           in fn2 a b
        )
        c

fn =
    fn1
        ( a = b + 123
           fn2 a b
        )
        c

Wouldn't it make sense to simply extend the definition of those parenthesis instead of using the brackets?

Anthony Bullard (Feb 27 2025 at 12:05):

We talked about that, but it ends up being semantic overload on ()....it would be apply args, tuples, parenthesized expressions, AND blocks.

Anthony Bullard (Feb 27 2025 at 12:05):

jan kili (Feb 27 2025 at 15:55):

fn =
    fn1(
        (
            a = b + 123
            fn2(a, b)
        ),
        c
    )

fn =
    fn1(
        {
            a = b + 123
            fn2(a, b)
        },
        c
    )

Kiryl Dziamura (Feb 27 2025 at 16:01):

I just realized. the following is possible as well, right? kinda funny but why not :D

x = 42 - { 21 * 2 }

Kiryl Dziamura (Feb 27 2025 at 16:08):

on the other hand... block expressions feel like a multi-statement version of parens. does it make sense to require more than a single statement in the block? or maybe fmt may help?
yeah, I know, noone sane would write this kind of code anyway

jan kili (Feb 27 2025 at 16:21):

Maybe the formatter should convert braces to parens if they only contain a single expression.

jan kili (Feb 27 2025 at 16:22):

Anthony Bullard (Feb 27 2025 at 16:41):

Yes a block with a single expression and no newlines/comments will have the braces removed by the formatter

Richard Feldman (Feb 27 2025 at 16:44):

Kiryl Dziamura (Feb 27 2025 at 16:51):

it feels like body (either of if/else or function) is not the same as block expr on the ir level. but both can be records ofc

Richard Feldman (Feb 27 2025 at 17:31):

Derin Eryilmaz (Feb 27 2025 at 18:49):

# function that returns one
returns_one = { 1 }

# traditional function
identity = { x -> x }
add = { a: I32, b: I32 -> a + b }

# function with multiple "cases", replaces when-is / match:
is_ok = {
  Ok _ -> Bool.true,
  Err _ -> Bool.false
}
divide = { a, b ->
  b |> {
    0 -> crash "can't.",
    _ -> a / b
  }
}

Simon Taeter (Mar 02 2025 at 10:01):

I haven't read the whole discussion here so I might be repeating someone but is the aim here only to make white spaces insignificant? If that's the only goal I personally think that enforcing readability is more important.

In JS, you can stuff any amount of code on a single line and make it completely unreadable thanks to the fact that spaces are insignificant in that language. That also gives potential for malicious code to be hiding in.

For JS that makes sense because you want your code to be as tiny as possible and remove every possible character when sending it to client's browsers. But I think that doesn't work for Roc as it is compiled.

Completely leaning on indentation and new lines for the language syntax might also be a viable approach. It would force both users and code generators to make code with a structure that can be understood at first glance. I think that actually would be a feature.

Sam Mohr (Mar 02 2025 at 10:02):

I agree that readability is more important than write-ability, but I think that this isn't a drop in readability. I would prefer the aesthetic of whitespaces over braces, but I think braces are actually very easy to read

Sam Mohr (Mar 02 2025 at 10:03):

And since it doesn't seem like a drop in readability, a less frustrating experience when copy-pasting code and working with LLMs and writing unindented code (which braces are better at for all three of these experiences) seems like a good trade-off

Sam Mohr (Mar 02 2025 at 10:03):

If it was a drop in readability, then we'd want to maybe reconsider more strongly

Brendan Hansknecht (Mar 02 2025 at 18:36):

Note: we also have an opinionated formatter that will stop the super giant single line of code thing.

Luke Boswell (Jul 09 2025 at 22:49):

I've been thinking about if-then-else ... I keep getting tripped up on not having then and I see other people making the same mistake.

I understand that we decided to try not having it because then that is one less keyword, and then is free for people to use as a variable name, and it isn't needed with the braces design.

But I'm wondering if the cost from a strangeness budget is larger than we thought when we considered this design.

I also think since we had that discussion we also decided that the braces are optional because they're for block expressions.

Basically after writing a fair amount of 0.1 roc (in the snapshots) I'm definitely feeling like we should re-consider then.

Anthony Bullard (Jul 09 2025 at 22:55):

i agree with this. I never struggle with this in other languages that don't require parens around the condition because they require {} around the expressions

Anthony Bullard (Jul 09 2025 at 22:56):

Brendan Hansknecht (Jul 09 2025 at 23:27):

Anthony Bullard (Jul 10 2025 at 00:03):

it seems the human needs a keyword or punctuation even if the machine doesn't :rolling_on_the_floor_laughing:

Richard Feldman (Jul 10 2025 at 00:42):

Richard Feldman (Jul 10 2025 at 00:44):

the only mainstream programming languages that have then are Bash and Ruby, so I don't think "lack of then" can possibly be the problem here

Richard Feldman (Jul 10 2025 at 00:45):

if lack of curly braces is the problem, we can just have a convention of putting curly braces around the branches, right?

Anthony Bullard (Jul 10 2025 at 00:45):

i don't know why it is, but i keep putting parens around the condition or braces around the expression or I as a person have trouble reading it

Richard Feldman (Jul 10 2025 at 00:45):

Anthony Bullard (Jul 10 2025 at 00:45):

Richard Feldman (Jul 10 2025 at 00:46):

right, but to me this seems like a formatter question and not a syntax design question

Richard Feldman (Jul 10 2025 at 00:46):

like I don't see a case for reintroducing then or switching to 1 if foo or anything like that

Anthony Bullard (Jul 10 2025 at 00:46):

Richard Feldman (Jul 10 2025 at 00:46):

yeah the lowest strangeness budget way to address these concerns is to have the formatter use one or more of these interventions that are already supported

Anthony Bullard (Jul 10 2025 at 00:46):

As long as the thing most people will do just works, then it's all about the conventional style

Anthony Bullard (Jul 10 2025 at 00:47):

Anthony Bullard (Jul 10 2025 at 00:48):

Brendan Hansknecht (Jul 10 2025 at 00:49):

Richard Feldman (Jul 10 2025 at 00:49):

I think doing it the way Go and Rust do, where we have braces around the branches and no parens around the conditionals, should be uncontroversial

Brendan Hansknecht (Jul 10 2025 at 00:49):

Anthony Bullard (Jul 10 2025 at 00:49):

Richard Feldman (Jul 10 2025 at 00:49):

Richard Feldman (Jul 10 2025 at 00:50):

Brendan Hansknecht (Jul 10 2025 at 00:50):

Richard Feldman (Jul 10 2025 at 00:50):

e.g. this is valid Rust code that does the same thing as what it would do in Roc:

if foo {
    a
} else {
    b
}

Richard Feldman (Jul 10 2025 at 00:50):

Brendan Hansknecht (Jul 10 2025 at 00:50):

Richard Feldman (Jul 10 2025 at 00:51):

honestly my only hesitation for requiring that the formatter add braces is that it creates an inconsistency

Richard Feldman (Jul 10 2025 at 00:52):

if foo { 1 } else { 2 }

uses braces but is single-line, so the simple formatter rule of "braces == multiline" would turn this into multiline

Luke Boswell (Jul 10 2025 at 00:52):

is that because you're used to the old Roc syntax though?

This is probably a major factor then. For some reason I thought if-then-else was the norm in most languages... but I haven't really touched anything besides Rust/Zig or Roc for a while now. I'm definitely a minority here, I just wanted to flag it because I've been tripping up on it.

Luke Boswell (Jul 10 2025 at 00:55):

Ironically Rust and Zig don't have then... so I'm definitely getting this from some place else

Luke Boswell (Jul 10 2025 at 00:57):

It must be current Roc's influence... or maybe even I'm having a stroke and think I'm writing VB

Tobias Steckenborn (Jul 10 2025 at 02:18):

I think as usual it comes down to what you’re used to. See my other thread where - well unhelpful error message aside - I first tripped on the bracketing, then on needing then. Simply due to me being used to if (condition) {code block when true}. I personally would opt for the curly ones even when single line (also in js) just to have consistency. Also when copying something over where the formatting is partially lost that gives another visual cue.

Personally I really don’t grasp the conciseness factor in a lot of things (take e.g. something like arr for array or err for error) given modern tooling can autocomplete easily. Having worked more on the enterprise side of things with larger teams but also quite a lot of business users all of these are possible places for different understanding or confusion :sweat_smile:

Tldr: Would prefer a uniform approach, not here’s single line expression variant, here’s the multiline one perhaps heres single condition or the like

Stream: ideas

Topic: braces syntax

Richard Feldman (Feb 11 2025 at 20:47):

Richard Feldman (Feb 11 2025 at 20:47):

Richard Feldman (Feb 11 2025 at 20:48):

Richard Feldman (Feb 11 2025 at 20:51):

Richard Feldman (Feb 11 2025 at 20:53):

Richard Feldman (Feb 11 2025 at 20:53):

Richard Feldman (Feb 11 2025 at 20:54):

Richard Feldman (Feb 11 2025 at 20:55):

Sam Mohr (Feb 11 2025 at 20:57):

Joshua Warner (Feb 11 2025 at 20:59):

Sam Mohr (Feb 11 2025 at 20:59):

Joshua Warner (Feb 11 2025 at 20:59):

Sam Mohr (Feb 11 2025 at 20:59):

Joshua Warner (Feb 11 2025 at 20:59):

Richard Feldman (Feb 11 2025 at 21:01):

Brendan Hansknecht (Feb 11 2025 at 21:03):

Brendan Hansknecht (Feb 11 2025 at 21:04):

Luke Boswell (Feb 11 2025 at 21:18):

Luke Boswell (Feb 11 2025 at 21:19):

Dawid Danieluk (Feb 11 2025 at 21:27):

Richard Feldman (Feb 11 2025 at 21:31):

Richard Feldman (Feb 11 2025 at 21:31):

Jared Ramirez (Feb 11 2025 at 21:38):

Jared Ramirez (Feb 11 2025 at 21:38):

Dawid Danieluk (Feb 11 2025 at 21:38):

Joshua Warner (Feb 11 2025 at 21:43):

Joshua Warner (Feb 11 2025 at 21:48):

Joshua Warner (Feb 11 2025 at 21:50):

Richard Feldman (Feb 11 2025 at 22:02):

Richard Feldman (Feb 11 2025 at 22:03):

Richard Feldman (Feb 11 2025 at 22:05):

Richard Feldman (Feb 11 2025 at 22:05):

Richard Feldman (Feb 11 2025 at 22:11):

Sam Mohr (Feb 11 2025 at 22:21):

Sam Mohr (Feb 11 2025 at 22:22):

Richard Feldman (Feb 11 2025 at 22:35):

Richard Feldman (Feb 11 2025 at 22:36):

Anthony Bullard (Feb 11 2025 at 22:37):

Anthony Bullard (Feb 11 2025 at 22:38):

Anthony Bullard (Feb 11 2025 at 22:38):

Luke Boswell (Feb 11 2025 at 22:38):

Anthony Bullard (Feb 11 2025 at 22:40):

Joshua Warner (Feb 11 2025 at 22:45):

Joshua Warner (Feb 11 2025 at 22:46):

Joshua Warner (Feb 11 2025 at 22:49):

Anthony Bullard (Feb 11 2025 at 22:52):

Joshua Warner (Feb 11 2025 at 22:52):

Joshua Warner (Feb 11 2025 at 22:54):

Joshua Warner (Feb 11 2025 at 22:54):

Anthony Bullard (Feb 11 2025 at 22:54):

Anthony Bullard (Feb 11 2025 at 22:55):

Anthony Bullard (Feb 11 2025 at 22:55):

Richard Feldman (Feb 11 2025 at 22:56):

Joshua Warner (Feb 11 2025 at 22:56):

Richard Feldman (Feb 11 2025 at 22:57):

Anthony Bullard (Feb 11 2025 at 22:57):

Anthony Bullard (Feb 11 2025 at 23:00):

Richard Feldman (Feb 11 2025 at 23:01):

Richard Feldman (Feb 11 2025 at 23:01):

Anthony Bullard (Feb 11 2025 at 23:01):

Anthony Bullard (Feb 11 2025 at 23:01):

Richard Feldman (Feb 11 2025 at 23:02):

Anthony Bullard (Feb 11 2025 at 23:02):

Anthony Bullard (Feb 11 2025 at 23:02):

Anthony Bullard (Feb 11 2025 at 23:03):

Anthony Bullard (Feb 11 2025 at 23:04):

Richard Feldman (Feb 11 2025 at 23:12):

Anthony Bullard (Feb 11 2025 at 23:13):

Anthony Bullard (Feb 11 2025 at 23:13):

Richard Feldman (Feb 11 2025 at 23:13):

Anthony Bullard (Feb 11 2025 at 23:13):

Anthony Bullard (Feb 11 2025 at 23:13):

Richard Feldman (Feb 11 2025 at 23:13):

Anthony Bullard (Feb 11 2025 at 23:14):

Anthony Bullard (Feb 11 2025 at 23:14):

Richard Feldman (Feb 11 2025 at 23:14):

Richard Feldman (Feb 11 2025 at 23:14):

Richard Feldman (Feb 11 2025 at 23:14):