number literals for custom number types · ideas

x = 1u8

x = 1.U8

Richard Feldman (Aug 21 2025 at 14:13):

so you can put whatever module you want there, including user-defined ones and then this Just Works:

x = 1.BigInt

Richard Feldman (Aug 21 2025 at 14:14):

I think doing it as a suffix makes more sense than as a prefix, because 1.U8 is a lot less confusing to me than U8.1, which I can't help but read as 8.1 :sweat_smile:

Richard Feldman (Aug 21 2025 at 16:26):

duration = 5.Seconds

earlier = 2.Days->ago!()

Richard Feldman (Aug 21 2025 at 16:26):

as long as we have Seconds.from_digits and Days.from_digits in those modules

Richard Feldman (Aug 21 2025 at 16:27):

for testability, could even have ago be in the module itself and take a Clock, which could be mocked:

duration = 5.Seconds

earlier = 2.Days.ago!(clock)

Richard Feldman (Aug 21 2025 at 16:29):

Richard Feldman (Aug 21 2025 at 16:30):

Richard Feldman (Aug 21 2025 at 16:41):

a related idea: if this were done with static dispatch on types (e.g. it's not the Days module but rather the Days type, assuming that type is in scope), then you could have type aliases for singular vs plural, e.g. Day.roc exposes Days : Day, and then you can make them more readable:

duration = 5.Seconds
short = 1.Second

earlier = 2.Days.ago!(clock)
yesterday = 1.Day.ago!(clock)

Richard Feldman (Aug 21 2025 at 16:44):

actually I guess you can do that yourself with modules using import with as:

import date.Day as Days
import date.Day

# ...

yesterday = 1.Day.ago!(clock)
earlier = 2.Days.ago!(clock)

Richard Feldman (Aug 21 2025 at 16:53):

Kiryl Dziamura (Aug 21 2025 at 19:35):

It looks amazing aestestically, would be interesting to explore how it affects naming conventions.

E.g. I see how this literal syntax affects not only api, but also variables naming. Like, 1.Day.ago looks great, but abstract duration.ago looks weird, and timespan.ago a bit better but still meh.

Also, singular/plural take is too much for me :smile:importing and aliasing multiple variants will definitely annoy me but I will do that anyway because of my stupid perfectionism :sweat_smile:

Richard Feldman (Aug 21 2025 at 19:42):

I like the idea of it just being a technique the user of the library can use (or not) vs. something the library changes

Fabian Schmalzried (Aug 21 2025 at 19:43):

1.U8 should be rather quick to get used to, even though it looks strange to me at first.

2.Days.ago!(clock) looks awesome. I have a question about implementing a Time package like this. Would this require a module for each supported time interval or can this be done by types within one module somehow? Can there then several from_digita functions in a module? I think I'm just not up to date with the latest dispatch plans.

Richard Feldman (Aug 21 2025 at 19:43):

put another way: it is a fact that you can do import with as to make module names plural, so then there's a stylistic preference as to whether you choose to do that :smile:

Richard Feldman (Aug 21 2025 at 19:44):

ago! : Duration, Clock => Instant

Richard Feldman (Aug 21 2025 at 19:44):

Richard Feldman (Aug 21 2025 at 19:45):

ago! : amount, Clock => Instant
    where module(amount).to_duration : amount -> Duration

Richard Feldman (Aug 21 2025 at 19:46):

so that way it accepts any nominal type with a to_duration method, which in turn means you can define Day to be a nominal type with a to_duration method

Richard Feldman (Aug 21 2025 at 19:47):

that way you can define a function that says it takes a Day count as input specifically, and if you try to give it like minutes or something, it errors

Richard Feldman (Aug 21 2025 at 19:47):

(or if that seems like an antipattern in practice, can always do the type alias approach instead)

Jasper Woudenberg (Aug 21 2025 at 19:54):

Yeah, personally I think having all the durations use the same type seems more powerful to me. Otherwise you wouldn't be able to write 3.Weeks + 2.Days, which I think is reasonable code?

Fabian Schmalzried (Aug 21 2025 at 19:56):

Thanks for the explanation, this could be useful for a lot of stuff. item.shift(2.Pixels.left) or something.

Jasper Woudenberg (Aug 21 2025 at 21:14):

## Durations.roc
days : number -> Duration where module(number).to_number : number -> Number
hours: number -> Duration where module(number).to_number : number -> Number
seconds: number -> Duration where module(number).to_number : number -> Number
# ... etc

## Roc standard library
Number : [Digits, Int U64, Float F64]
Digits := List(U8)

## Usage in application code
duration = 4=>hours().ago!(clock)

There's a bit of weird wrapping going on behind the scenes: taking a primitive number type, adding a tag on it to produce a Number, then immediately casing on it again to get back the primitive number type. I imagine that would reliably get optimized out again.

Jasper Woudenberg (Aug 21 2025 at 21:42):

Separately, I like Rails' 3.hours.ago syntax, but given we have to pass a clock I wonder if this would be clearer:

.before(clock.now!())
.after(clock.now!())

Luke Boswell (Aug 21 2025 at 22:51):

This would mean we can remove the whole complexity around parsing/can for those literals, which is another simplification I imagine.

Richard Feldman (Aug 21 2025 at 22:52):

Luke Boswell (Aug 21 2025 at 22:55):

Richard Feldman (Aug 21 2025 at 22:56):

Brendan Hansknecht (Aug 23 2025 at 17:07):

Seems pretty reasonable.... Is it only for raw numbers? Also, I guess it has it really depends on compile time evaluation to not have a perf hit.

How does it deal with hex or binary for example. Like a hex number can be positive or negative without a numeric sign (I guess these are issues from any custom numbers and not this syntax proposal).

Also this looks pretty solid: 27.Complex + 12.Complex.i()... Definitely noisy, but maybe reasonable.... idk...

Richard Feldman (Aug 23 2025 at 17:08):

Sky Rose (Aug 25 2025 at 22:20):

I didn't quite follow everything, but would this require every suffix to define its own from_digits? Like, Day.from_digits and Pixels.from_digits and ... That could be a lot, especially if each one has to handle hex inputs as well.

Sky Rose (Aug 25 2025 at 22:21):

Sky Rose (Aug 25 2025 at 22:22):

Richard Feldman (Aug 25 2025 at 22:35):

I think it would make more sense to use a string, since strings will be able to do the same thing

Richard Feldman (Aug 25 2025 at 22:37):

hex inputs would get converted automatically. I actually suspect we'll want them to be base-2 digits because that's the most efficient for the compiler to both store and operate on.

it would be trivial to implement these from_digits functions for wrapped integers because they'd just delegate to a builtin from_digits - e.g.

from_digits : Iter(U8) -> Result(Day, OutOfRange)
from_digits = |iter| U32.from_digits(iter).map_ok(Day.from_u32)

Brendan Hansknecht (Aug 26 2025 at 00:01):

I guess my biggest concern is that I feel like it always needs to fold. Like that should be a requirement at comptime

Brendan Hansknecht (Aug 26 2025 at 00:01):

Cause it would be really bad perf otherwise and really awkward ux if the result manifests

Richard Feldman (Aug 26 2025 at 00:59):

Brendan Hansknecht (Aug 26 2025 at 01:01):

I guess my comment doesn't apply to this syntax specifically but to the original design.

Joshua Warner (Aug 26 2025 at 01:02):

Possibly silly idea: add a comptime designator like zig that requires that arg to be constant folded at compile time, else failing the compilation

Richard Feldman (Aug 31 2025 at 12:45):

I just realized we could make record builder syntax use this same metaphor, e.g.

color = {
    r: Random.u8(),
    g: Random.u8(),
    b: Random.u8(),
}.Random

Richard Feldman (Aug 31 2025 at 12:46):

so just like how 1.U8 would desugar to U8.from_digits([1]), { ... }.Random would desugar to using Random.map_both (or whatever we decide to call it) to build up all the fields

Richard Feldman (Aug 31 2025 at 12:47):

color = { Random.map_both <-
    r: Random.u8(),
    g: Random.u8(),
    b: Random.u8(),
}

Richard Feldman (Aug 31 2025 at 12:48):

conceptually we're doing the same thing in both cases - we have a literal (either a number literal or a record literal) and we want to use a pure function to concisely transform it in a particular way

Richard Feldman (Aug 31 2025 at 12:50):

also this made me realize that in both cases (if desired) we could let you customize the exact function, e.g. 1.(U8.something_other_than_from_digits)

Richard Feldman (Aug 31 2025 at 14:22):

we could do it for list literals too, and use the fact that the conversion functions return a Result to validate things at compile time - so for example this could give you an error at build time:

[1, 2, 2, 3].Set

basically telling you about the duplicate entry in the Set literal at compile time!

Richard Feldman (Aug 31 2025 at 14:47):

[
    ("a", 1),
    ("b", 1),
    ("b", 1),
    ("c", 1),
].Dict

Richard Feldman (Aug 31 2025 at 15:09):

there was a thing we used to do in Elm where we had a => operator that just made a tuple, so it was like a => b was the same as (a, b), so you could do things like:

[
    "a" => 1,
    "b" => 2,
    "b" => 3,
    "c" => 4,
].Dict

Brendan Hansknecht (Aug 31 2025 at 18:29):

Brendan Hansknecht (Aug 31 2025 at 18:33):

I can see folks being confused and asking:
Why is {...}.U8 trying to call U8.map_both (which doesn't exist), but 0b1101.U8 is calling U8.from_digits? What is map_both (given record builders are rare) and why does it have special syntax as opposed to any other function?

Brendan Hansknecht (Aug 31 2025 at 18:33):

Also, this: [1, 2, 2, 3].Set just makes me really want a normal constructor Set([1, 2, 2, 3]). Same with the Dict example.

Richard Feldman (Aug 31 2025 at 18:51):

Richard Feldman (Aug 31 2025 at 18:52):

Brendan Hansknecht (Aug 31 2025 at 19:06):

Richard Feldman (Aug 31 2025 at 19:30):

the difference is that the latter can't tell you at compile time that you had a duplicate in there

Richard Feldman (Aug 31 2025 at 19:31):

or rather, the only way it could tell you at compile time would be if it had a crash on duplicates, which would be a really bad API because then it would potentially crash in production at runtime :sweat_smile:

Richard Feldman (Aug 31 2025 at 19:31):

the cool part about the "literal suffix" is that the conversion function returns a Result, so it's always safe to use in production at runtime, and yet if it returns Err when the compiler is evaluating it with the literal at compile time, the compiler can give you an error

Luke Boswell (Aug 31 2025 at 23:12):

Luke Boswell (Aug 31 2025 at 23:14):

Is it ever going to make sense on other literals... like have you thought about string literals?

Brendan Hansknecht (Aug 31 2025 at 23:22):

To me, this just sounds like some form of comptime crash is missing and we are band-aiding over it. Like we could make Set.new have a comptime crash if wanted.

Brendan Hansknecht (Aug 31 2025 at 23:23):

Richard Feldman (Sep 01 2025 at 00:58):

Richard Feldman (Sep 01 2025 at 00:59):

Richard Feldman (Sep 01 2025 at 02:25):

{
    "a": 1,
    "b": 2,
    "b": 3,
    "c": 4,
}.Dict

the design could be that if you make a record literal where instead of fields like foo: they are expressions, e.g. 1: or "a": (or if you really wanted to put a lookup in there, (foo):) and all the keys have to have the same type, and all the values have to have the same type, and it's sugar for an Iter((key, val))

Brendan Hansknecht (Sep 01 2025 at 02:30):

Brendan Hansknecht (Sep 01 2025 at 02:31):

Feels a bit odd for roc, but don't crazy or anything. I guess a lot of it very fundamentally depends on comptime, but that's fine

Luke Boswell (Sep 01 2025 at 02:35):

"and it's sugar for an Iter((key, val))" can you humour me and give me a couple of usecases for using this feature?

Richard Feldman (Sep 01 2025 at 02:39):

another is if you want to write a JSON object where the keys aren't valid Roc syntax so you can't just use the normal auto-encoding

Richard Feldman (Sep 01 2025 at 02:40):

that was one we used to use => for in Elm - Json.Encode.object takes a list of key/value pairs where the key is a string

Richard Feldman (Sep 01 2025 at 02:41):

but of course { "_": underscore_thing, "$": dollar } reads nicer for JSON in particular (since it's exactly JSON syntax for objects) than [("_" => underscore_thing), ("$" => dollar)]

Luke Boswell (Sep 01 2025 at 02:41):

This is just more convenient API wise than the alternative. Because we planned to comp-time eval top-levels anyway

Richard Feldman (Sep 01 2025 at 02:42):

Luke Boswell (Sep 01 2025 at 02:43):

One thing I really like about the Record literall syntax { .. }.RecordThing is that we are not using the <- back arrow. That arrow just always seemed a little out of place there, especially with backpassing removed now.

Brendan Hansknecht (Sep 01 2025 at 02:55):

So we feel this adds to a lot of weirdness/beginner complexity? I feel like roc as a language has been transition to be more complex for new users with more things to know about.

Richard Feldman (Sep 01 2025 at 02:58):

Brendan Hansknecht (Sep 01 2025 at 03:53):

Yeah, it isn't any form of deep or nestable complexity. It is just more surface level things in roc that the user is required to know. Kinda like adding more sugar/more ways to do similar things. Just very non-obvious without reading about it ina tutorial or looking it up.

Richard Feldman (Sep 02 2025 at 01:40):

{ http_result, read_result, write_result } = {
    http_result: || Http.get!(url, Json.utf8),
    read_result: || File.read!(path1),
    write_result: || Fs.write!(path2, data),
}.Task.timeout(500.Ms).run!()

Richard Feldman (Sep 02 2025 at 01:41):

(without going on a huge tangent, I realized we do need a Task module and Task wrapper type for this specific case, where you want to say "run all of these concurrently, and tell me when they're all done" - but the wrapper type is kinda useful anyway so you can put a timeout on the whole batched operation and/or make it cancelable etc.)

Brendan Hansknecht (Sep 02 2025 at 01:46):

I wonder if "Task" is the right name in this context...vs like Future or something.....

Brendan Hansknecht (Sep 02 2025 at 01:47):

I guess explicit threading may use a different system or just direct calls....idk

Richard Feldman (Sep 02 2025 at 01:50):

yeah the || are necessary because otherwise you'd just be evaluating them immediately and sequentially :laughing:

Richard Feldman (Sep 02 2025 at 01:52):

we should definitely explore additional concurrency primitives to see what makes sense (threads? channels? other stuff?) but mainly for the purposes of this thread, it's relevant how the specific use case of "I have some effectful functions I want to run concurrently" would look :smile:

Richard Feldman (Sep 02 2025 at 02:31):

side note to the side note: I just realized timeout is probably something that shouldn't be in a builtin, because some platforms will be single-threaded and couldn't possibly support it!

Richard Feldman (Sep 02 2025 at 02:33):

but if Tasks are cancelable, then any platform can offer a timeout operation which takes a task, makes it cancelable, and then cancels it if it hasn't completed by the specified time

Isaac Van Doren (Sep 02 2025 at 15:22):

Isaac Van Doren (Sep 02 2025 at 15:24):

I personally would not want to have an error because of a duplicate entry in a set, I would rather the set drop duplicates automatically. That's the behavior I normally want from a set

Brendan Hansknecht (Sep 02 2025 at 16:23):

Note: this is explicitly for literals. Like if you have a literal written out in your source code that happens to have duplication in it. Which I think is likely to be accidental/a bug.

Richard Feldman (Sep 02 2025 at 18:30):

if I'm inserting a variable at runtime, e.g. set.insert(foo) and it's a duplicate, then yeah it should silently drop it

Richard Feldman (Sep 02 2025 at 18:30):

but if I write out a literal that is just flat-out incorrect, and there is a 100% chance it's that I made a mistake, then sure, I'd prefer to know about it! :smile:

Brendan Hansknecht (Sep 02 2025 at 19:30):

Other note, any user could opt out by just using Set.new or Dict.new instead of literals

Kiryl Dziamura (Sep 03 2025 at 13:31):

div = |content|
  """
  <div>${content}</div>
  """.Html

pattern = "([A-Z])\w+".RegExp

Kiryl Dziamura (Sep 05 2025 at 09:23):

div = |content|
  """
  <div>${content}</div>
  """.Html

Anton (Sep 05 2025 at 09:33):

Kiryl Dziamura (Sep 05 2025 at 10:11):

you probably right. my assumption is that it's a literal, not a string itself. it would be great to be able to validate html syntax this way and transform it to a sequence of calls

div = |content|
  """
  <div>${content}</div>
  """.Html

div = |content|
  Html.div([], [content])

div = |content|
  Html.div([], [Html.text(content)])

Module.from_iterpolation : List([String(Str), Arg(Module)]) -> Result(Module)

Kiryl Dziamura (Sep 05 2025 at 10:16):

Brendan Hansknecht (Sep 05 2025 at 22:42):

I think the oddity of me is that it feels weird to do interpolation to a multiline string. Especially so if it is done before apply the html constructor. Unclear order of operations and I think interpolation is less common in multiline strings in general

Brendan Hansknecht (Sep 05 2025 at 22:43):

As I read it by default, html is a custom literal, so why should it run interpolation first. Or at least whys should it run string interpolation and not custom html interpolation

Luke Boswell (Sep 05 2025 at 22:54):

I would have expected the .Html to run at compile time, and the interpolation to run at runtime

Brendan Hansknecht (Sep 05 2025 at 23:34):

I expect the same, but in I feel like it could make sense that the HTML literal which runs at compile time decides how interpolation works at runtime (like maybe automatically sanitizing).

Brendan Hansknecht (Sep 05 2025 at 23:34):

Richard Feldman (Sep 06 2025 at 00:25):

for this design to work, the entire expression before the .Html would have to be evaluated at compile time

Richard Feldman (Sep 06 2025 at 00:25):

Richard Feldman (Sep 06 2025 at 00:42):

but it's important for the design that the function being called on the string's contents is evaluated at compile time, so that it can return a Result that gets unwrapped at compile time

Richard Feldman (Sep 06 2025 at 00:43):

so that it can give you a compile-time error if the string literal wasn't valid for that purpose (e.g. invalid regex from the example right after the html one), but you don't need to deal with the Result at runtime because it was handled at compile time instead

Fabian Schmalzried (Sep 06 2025 at 11:52):

Would it the make sense to have some kind of ComptimeResult type that makes sure it is evaluating at comptime? That would make it more clear that a function is only meant for comptime evaluations. Or would that result on unnecessary double implementation of those from_digits functions, because they might also be useful at runtime?

Richard Feldman (Sep 06 2025 at 12:32):

Kiryl Dziamura (Sep 06 2025 at 20:59):

"<div>${content}</div>".Html

"<div>".Html.concat(content).concat("</div>".Html)

concat is what interpolation expects the module to implement, a generalization of Str.concat : Str, Str -> Str so Html.concat : Html, Html -> Html. From this perspective we don't need to comptime evaluate the function argument but only "...".Html parts that are constants.

Now, "<div>".Html is a string literal overloading (Html.from_str : Str -> Result.Html). It may have implementation Html.openTag("div"). So the whole expression is desugared (with $ for comptime evaluation) to

$(Html.openTag("div")).concat(content).concat($(Html.closeTag("div"))

It looks pretty sound and consistent to me. I don't know if its a great power or a heavy responsibility tho.

Kiryl Dziamura (Sep 06 2025 at 21:06):

I'm still leaning towards inconvenience of it. It's a funny code golf, but in reality string parsing makes more sense when it's context aware, which is not possible here. On the other hand, what's the other way to have string interpolation overloading?

Kiryl Dziamura (Sep 06 2025 at 21:25):

Brendan Hansknecht (Sep 06 2025 at 21:44):

It definitely is an intriguing possibility. I could see it as quite helpful for html templating with smart escaping, but that would need to be at runtime for at least some of the work

Kiryl Dziamura (Sep 07 2025 at 08:07):

An obvious use of the string literal overload is comptime tokenization. You stil have to parse the resulting sequence of tokens in runtime, but with this approach it's slightly more optimal and gives takenization errors in comptime. E.g. html, sql may be used in string form but comptime would validate their tokens

Kiryl Dziamura (Sep 07 2025 at 08:16):

Or, by anology with custom numbers, it leads to custom strings. E.g "abc${var}".Hex or Base64, or whatever else where you want a subset of graphemes (or tokens) in the string.

Kiryl Dziamura (Sep 07 2025 at 08:25):

Kiryl Dziamura (Sep 07 2025 at 08:26):

Kiryl Dziamura (Sep 11 2025 at 06:00):

TypeB := TypeA

itemA : TypeA

itemB : TypeB
itemB = itemA.TypeB

Kiryl Dziamura (Sep 11 2025 at 06:59):

Id := U32

nextId = |id| id.U32.add(1).Id

So type casting from number literal may be done like this: number literal passed to U32 and casted to Id:

id = 42.U32.Id

Not sure about at which level it should be accessible. Probably everywhere since it's very explicit.

Richard Feldman (Sep 11 2025 at 11:09):

Kiryl Dziamura (Sep 11 2025 at 11:09):

Richard Feldman (Sep 11 2025 at 11:10):

if I'm making a type nominal and exposing the type but not a way to get its internal representation, it's very important that the details of what its internal representation are not get leaked like this

Richard Feldman (Sep 11 2025 at 11:10):

I need to be free to change what TypeA's internal representation is without breaking anyone's code

Richard Feldman (Sep 11 2025 at 11:11):

if this exists, I can never do that in Roc anymore because anyone who has called .TypeA to construct something that used to have the same internal representation will break

Kiryl Dziamura (Sep 11 2025 at 11:12):

Richard Feldman (Sep 11 2025 at 11:12):

Kiryl Dziamura (Sep 11 2025 at 11:13):

Email := Str

empty : {} -> Email
empty = \{} -> "".Str.Email

from_str : Str -> Email
from_str = \str -> str.Email

to_str : Email -> Str
to_str = \email -> email.Str

Richard Feldman (Sep 11 2025 at 11:13):

so within the same scope where the type is already defined, you'd have to be referring to the same module you're already inside, and at that point I'm not even sure if it's more concise :sweat_smile:

Kiryl Dziamura (Sep 11 2025 at 11:14):

Kiryl Dziamura (Sep 11 2025 at 11:18):

Could you please show how the example I wrote above looks like in roc without such type casting? I'm not sure how would it look like, was away from roc syntax for quite a bit

Richard Feldman (Sep 11 2025 at 11:25):

TypeB := TypeA

itemA : TypeA

itemB : TypeB
itemB = TypeB.(itemA)

Kiryl Dziamura (Sep 11 2025 at 11:45):

Kiryl Dziamura (Sep 11 2025 at 11:46):

Kiryl Dziamura (Sep 11 2025 at 11:48):

Email := Str

empty : {} -> Email
empty = \{} -> Email.("")

from_str : Str -> Email
from_str = \str -> Email.(str)

to_str : Email -> Str
to_str = \Email.(str) -> str

Kiryl Dziamura (Sep 11 2025 at 11:57):

Ok, I'm convinced. Much better :smile:
I just noticed .Type may be used not only for literals. But agree, unsafe and verbose

Kiryl Dziamura (Sep 19 2025 at 15:44):

What do you think about that? I feel it got lost in the discussion. I find this design very obvious and don't think there is a reason of why single quote is needed for that. Is char needed so often that single quote has better ergonomics?

Richard Feldman (Sep 19 2025 at 16:25):

Richard Feldman (Sep 19 2025 at 16:26):

and I'm not sure what the advantage would be of "r".U8 if we already have single quote

Kiryl Dziamura (Sep 20 2025 at 09:31):

The advantage is that if strings are only double quotes - there are no missreads or misuse of them from people came from js and python. Also, if it's used with comptime constructor - it's explicitly shows the type U8, so it's clear there's no char concept in roc. It would also slightly simplify parsing and errors logic in compiler, but ofc would move the complexity to roc implementation, however it would be a good example of how custom strings may be implemented. I'm likely biased so I don't see single quotes for u8 as something really important. It's basically a U8 literal that looks like a string. So why not using overloaded string literal?
It's also not clear why pattern match ".".U8 => won't work if it's a static U8. Like, if pattern match works for numbers - why it wouldn't work for custom numbers?

But, I'm not a savage, I understand the ergonomics advantage of single quote for parser implementations. And I also understand that once you learnt about single quote and double quote difference - it's with you forever.

Richard Feldman (Sep 20 2025 at 11:37):

Brendan Hansknecht (Sep 20 2025 at 16:23):

Richard Feldman (Sep 20 2025 at 16:29):

Brendan Hansknecht (Sep 20 2025 at 16:31):

Kiryl Dziamura (Sep 20 2025 at 16:32):

Richard Feldman (Sep 20 2025 at 17:01):

Kiryl Dziamura (Sep 20 2025 at 18:13):

Yes, but current roc has 42u32 or 42u8 for specifing numeric type in literal, but there's no analog for 'x'u32 or 'x'u8

Richard Feldman (Sep 21 2025 at 00:16):

Norbert Hajagos (Sep 23 2025 at 11:37):

Got an idea from reading this thread. It has amazing misuse opportunities, but it's a fun idea :)
What if the single quotes meant for the compiler: "I don't know what this is right now, but based on usage, I see that later it is used as an U8, so I will call U8.from_str_literal on it. This will enable the current behaviour:
new_list = bytes.set(0, 'A')
But also more exotic ones, like:

#CodePoint is a U32 backed nominal type
code_points : List(CodePoint)
code_points = ['h', 'i', '!', '😀']

But ofc, it can do any computation at compile time. It's good, when the verbosity would be too much, like "h".CodePoint, but I would not want to see things like this in my codebase:

# User.from_str_literal is basically a specialized JSON parser
user = '{"name": "Jon"}'
expect user == {name: "Jon"} # true

Brendan Hansknecht (Sep 23 2025 at 15:55):

Kiryl Dziamura (Sep 24 2025 at 13:05):

why roc needs single quotes at all? I'm not even talking about an alternative. the only justification I can come up with is smaller memory footprint for e.g. U8. afaiu, single quote aka char is needed for C interop and a matter of legacy and tradition, no?

Brendan Hansknecht (Sep 24 2025 at 13:43):

Single quotes are just a convenient way define ASCII/Unicode characters. That definitely comes in handy at times (like pattern matching on a list of bytes).

Richard Feldman (Sep 24 2025 at 14:54):

Richard Feldman (Sep 24 2025 at 14:55):

the specific reason we added them was that there was a really nasty tension between wanting to do certain tasks (e.g. writing a JSON parser) in a performant vs readable way

Richard Feldman (Sep 24 2025 at 14:55):

if you can only pattern match on double-quoted strings, then you have to convert a single byte into a Str just to pattern match on it

Richard Feldman (Sep 24 2025 at 14:55):

otherwise you have to hardcode the Unicode Code Point number to compare to the U8, which people did, and it was terrible for readability

Richard Feldman (Sep 24 2025 at 14:56):

it's important to be able to write high-performance parsers that are readable, so we added single quote to the language to fix that problem

albx (Sep 24 2025 at 17:57):

Hmm I understood the question not as "do we need character literals?" (we clearly do), but as "do we need the ' to express a character literal?" (which is the most obvious choice because that's what most languages use). But now that we have the <literal>.<type> syntax to express literals if any type, do we still need the single quote for chars? Maybe that syntactic space can be freed for some other use. (I'm not pushing for this change btw, just saying how I've read the question)

Richard Feldman (Sep 24 2025 at 18:28):

Richard Feldman (Sep 24 2025 at 19:06):

Richard Feldman (Sep 24 2025 at 19:07):

e.g. make the new compiler not support ' once we have "x".U8 in patterns, and see if even with that option available we still have sufficient demand for (re)adding ' to the language

Richard Feldman (Sep 24 2025 at 19:07):

we certainly know how to do it if desired, and obviously "x".U8 patterns can be used for a lot more

Brendan Hansknecht (Sep 24 2025 at 19:38):

Feels like a ton of noise in pattern matching, but maybe that would get glazed over

Richard Feldman (Sep 24 2025 at 20:16):

it's definitely noisier, but I wonder if it's ok in practice...here's an old example

Richard Feldman (Sep 24 2025 at 20:20):

Numbers only

# Prepend an "\" escape byte
escaped_byte_to_json : U8 -> List U8
escaped_byte_to_json = |b|
    match b {
        0x22 => [0x5c, 0x22] # U+0022 Quotation mark
        0x5c => [0x5c, 0x5c] # U+005c Reverse solidus
        0x0a => [0x5c, 'n'] # U+000a Line feed
        0x0d => [0x5c, 'r'] # U+000d Carriage return
        0x09 => [0x5c, 'r'] # U+0009 Tab
        _ => [b]
    }

Richard Feldman (Sep 24 2025 at 20:20):

Single quotes

# Prepend an "\" escape byte
escaped_byte_to_json : U8 -> List U8
escaped_byte_to_json = |b|
    match b {
        '"' => ['\\', '"']
        '\\' => ['\\', '\\']
        '\n' => ['\\', 'n']
        '\r' => ['\\', 'r']
        '\t' => ['\\', 't']
        _ => [b]
    }

Richard Feldman (Sep 24 2025 at 20:21):

.U8 suffix

# Prepend an "\" escape byte
escaped_byte_to_json : U8 -> List U8
escaped_byte_to_json = |b|
    match b {
        "\"".U8 => "\\\"".to_utf8()
        "\\".U8 => "\\\\".to_utf8()
        "\n".U8 => "\\n".to_utf8()
        "\r".U8 => "\\r".to_utf8()
        "\t".U8 => "\\t".to_utf8()
        _ => [b]
    }

Richard Feldman (Sep 24 2025 at 20:21):

Richard Feldman (Sep 24 2025 at 20:22):

Karl (Sep 24 2025 at 20:44):

Kiryl Dziamura (Sep 24 2025 at 21:21):

Richard Feldman (Sep 24 2025 at 21:49):

Kiryl Dziamura (Sep 24 2025 at 21:50):

What's the default type of number literal and how inference would work for it? Would it infer built-in numeric types?
I was thinking about explicit inference operator (_) in different parts of the language, and was wondering how bad would it be to have such for the literal overloads:

# Prepend an "\" escape byte
escaped_byte_to_json : U8 -> List U8
escaped_byte_to_json = |b|
    match b {
        "\""._ => "\\\"".to_utf8()
        "\\"._ => "\\\\".to_utf8()
        "\n"._ => "\\n".to_utf8()
        "\r"._ => "\\r".to_utf8()
        "\t"._ => "\\t".to_utf8()
        _ => [b]
    }

So the explicit inference operator would mean "allow inference from any type, not only native ones".

I'm coming from the fact that any mention of a type name in code is virtually the same as a separate type annotation. I know, it's impractical to have a strict separation of types and logic in code. But maybe it makes sense to give the user ability to explicitly unlock inference (I'm not talking about this particular case, but in general). It also means more generic code, which is not always a great idea.

Richard Feldman (Sep 24 2025 at 22:22):

I think ._ should be a separate thread, seems like a potential rabbit hole discussion :smile:

Richard Feldman (Sep 24 2025 at 22:23):

num where [
    module(num).from_digits : List(U8) -> Result(num, BadDigits)
]

Richard Feldman (Sep 24 2025 at 22:24):

one of the ideas behind this design is to not show the types of number literals (or strings) in the repl anymore

Richard Feldman (Sep 24 2025 at 22:25):

Richard Feldman (Sep 24 2025 at 22:27):

1 + 1
2 : num where [num.Numeric]

Brendan Hansknecht (Sep 25 2025 at 01:20):

Brendan Hansknecht (Sep 25 2025 at 01:21):

Luke Boswell (Dec 31 2025 at 20:11):

Just had to go back and find this thread to remind myself what the design is. I'd even forgotten about the awesome Record Builder syntax... another one for the langref maybe?

Stream: ideas

Topic: number literals for custom number types

Richard Feldman (Aug 21 2025 at 14:12):

Richard Feldman (Aug 21 2025 at 14:13):

Richard Feldman (Aug 21 2025 at 14:13):

Richard Feldman (Aug 21 2025 at 14:13):

Richard Feldman (Aug 21 2025 at 14:14):

Richard Feldman (Aug 21 2025 at 16:26):

Richard Feldman (Aug 21 2025 at 16:26):

Richard Feldman (Aug 21 2025 at 16:27):

Richard Feldman (Aug 21 2025 at 16:29):

Richard Feldman (Aug 21 2025 at 16:30):

Richard Feldman (Aug 21 2025 at 16:30):

Richard Feldman (Aug 21 2025 at 16:41):

Richard Feldman (Aug 21 2025 at 16:44):

Richard Feldman (Aug 21 2025 at 16:53):

Kiryl Dziamura (Aug 21 2025 at 19:35):

Richard Feldman (Aug 21 2025 at 19:42):

Fabian Schmalzried (Aug 21 2025 at 19:43):

Richard Feldman (Aug 21 2025 at 19:43):

Richard Feldman (Aug 21 2025 at 19:44):

Richard Feldman (Aug 21 2025 at 19:44):

Richard Feldman (Aug 21 2025 at 19:45):

Richard Feldman (Aug 21 2025 at 19:46):

Richard Feldman (Aug 21 2025 at 19:47):

Richard Feldman (Aug 21 2025 at 19:47):

Jasper Woudenberg (Aug 21 2025 at 19:54):

Fabian Schmalzried (Aug 21 2025 at 19:56):

Jasper Woudenberg (Aug 21 2025 at 21:14):

Jasper Woudenberg (Aug 21 2025 at 21:42):

Luke Boswell (Aug 21 2025 at 22:51):

Richard Feldman (Aug 21 2025 at 22:52):

Luke Boswell (Aug 21 2025 at 22:55):

Richard Feldman (Aug 21 2025 at 22:56):

Brendan Hansknecht (Aug 23 2025 at 17:07):

Richard Feldman (Aug 23 2025 at 17:08):

Sky Rose (Aug 25 2025 at 22:20):

Sky Rose (Aug 25 2025 at 22:21):

Sky Rose (Aug 25 2025 at 22:22):

Richard Feldman (Aug 25 2025 at 22:35):

Richard Feldman (Aug 25 2025 at 22:37):

Brendan Hansknecht (Aug 26 2025 at 00:01):

Brendan Hansknecht (Aug 26 2025 at 00:01):

Richard Feldman (Aug 26 2025 at 00:59):

Brendan Hansknecht (Aug 26 2025 at 01:01):

Joshua Warner (Aug 26 2025 at 01:02):

Richard Feldman (Aug 31 2025 at 12:45):

Richard Feldman (Aug 31 2025 at 12:46):

Richard Feldman (Aug 31 2025 at 12:47):

Richard Feldman (Aug 31 2025 at 12:48):

Richard Feldman (Aug 31 2025 at 12:50):

Richard Feldman (Aug 31 2025 at 14:22):

Richard Feldman (Aug 31 2025 at 14:47):

Richard Feldman (Aug 31 2025 at 15:09):

Brendan Hansknecht (Aug 31 2025 at 18:29):

Brendan Hansknecht (Aug 31 2025 at 18:33):

Brendan Hansknecht (Aug 31 2025 at 18:33):

Richard Feldman (Aug 31 2025 at 18:51):

Richard Feldman (Aug 31 2025 at 18:52):

Brendan Hansknecht (Aug 31 2025 at 19:06):

Richard Feldman (Aug 31 2025 at 19:30):

Richard Feldman (Aug 31 2025 at 19:31):

Richard Feldman (Aug 31 2025 at 19:31):

Luke Boswell (Aug 31 2025 at 23:12):

Luke Boswell (Aug 31 2025 at 23:12):

Luke Boswell (Aug 31 2025 at 23:14):

Brendan Hansknecht (Aug 31 2025 at 23:22):

Brendan Hansknecht (Aug 31 2025 at 23:23):

Richard Feldman (Sep 01 2025 at 00:58):

Richard Feldman (Sep 01 2025 at 00:59):

Richard Feldman (Sep 01 2025 at 00:59):

Richard Feldman (Sep 01 2025 at 02:25):

Brendan Hansknecht (Sep 01 2025 at 02:30):

Brendan Hansknecht (Sep 01 2025 at 02:31):

Luke Boswell (Sep 01 2025 at 02:35):

Richard Feldman (Sep 01 2025 at 02:39):

Richard Feldman (Sep 01 2025 at 02:39):

Richard Feldman (Sep 01 2025 at 02:40):

Richard Feldman (Sep 01 2025 at 02:41):

Luke Boswell (Sep 01 2025 at 02:41):