2023¶

November 21, 2023
10 min read

Say hello to C3 0.5

Originally from: https://c3.handmade.network/blog/p/8824-say_hello_to_c3_0.5

C3 is a programming language that builds on the syntax and semantics of the C language, with the goal of evolving it while still retaining familiarity for C programmers. It's an evolution, not a revolution: the C-like for programmers who like C.

It is finally time to release C3 0.5. This version is the first version of the C3 compiler (and by extension, the C3 language) which is feature-stable.

Before 0.5, the language changed in the same minor version, so the 0.4.1 version of the compiler might not compile code written for 0.4.20 and vice versa.

From 0.5 and forward this changes: each future version will have its own branch where bug fixes will happen, but otherwise the features are frozen. New features will be reserved for the dev and master branches. Consequently, as we announce 0.5, work will actually move on to 0.6 which is where the active development will happen.

This allows people to pick a version to confidently work with, knowing that there will be no changes to language semantics or the standard library.

Feature complete

With 0.5, C3 language itself can also be considered feature complete, and for 0.6, 0.7, 0.8, 0.9 the focus will be on the standard library. A good standard library should address real life use-cases, to solve commonly encountered issues of the users.

In order to properly know what those use-cases are, a diverse set of projects must be written in C3. And for people to build non-trivial projects in C3 without problems there must be some stability guarantees to the compiler itself. This is what 0.5 provides, and why we now switch forward to refining the standard library.

Explore C3

Interested in trying out C3 0.5? Learn more on the language's official site: https://c3-lang.org. Obtain the compiler from GitHub at https://github.com/c3lang/c3c/issues and join the community shaping the future of the C3 programming language.

Comments

Comment by Christoffer Lernö

C3 is a programming language that builds on the syntax and semantics of the C language, with the goal of evolving it while still retaining familiarity for C programmers. It's an evolution, not a revolution: the C-like for programmers who like C.

It is finally time to release C3 0.5. This version is the first version of the C3 compiler (and by extension, the C3 language) which is feature-stable.

Before 0.5, the language changed in the same minor version, so the 0.4.1 version of the compiler might not compile code written for 0.4.20 and vice versa.

From 0.5 and forward this changes: each future version will have its own branch where bug fixes will happen, but otherwise the features are frozen. New features will be reserved for the dev and master branches. Consequently, as we announce 0.5, work will actually move on to 0.6 which is where the active development will happen.

This allows people to pick a version to confidently work with, knowing that there will be no changes to language semantics or the standard library.

Feature complete

With 0.5, C3 language itself can also be considered feature complete, and for 0.6, 0.7, 0.8, 0.9 the focus will be on the standard library. A good standard library should address real life use-cases, to solve commonly encountered issues of the users.

In order to properly know what those use-cases are, a diverse set of projects must be written in C3. And for people to build non-trivial projects in C3 without problems there must be some stability guarantees to the compiler itself. This is what 0.5 provides, and why we now switch forward to refining the standard library.

Explore C3

Interested in trying out C3 0.5? Learn more on the language's official site: https://c3-lang.org. Obtain the compiler from GitHub at https://github.com/c3lang/c3c/issues and join the community shaping the future of the C3 programming language.

Comment by Christoffer Lernö

The change list for 0.5:

Changes / improvements

Trackable allocator with leak allocation backtraces.
$defined can take a list of expressions.
$and compile time "and" which does not check expressions after the first is an error.
$is_const returns true if an expression is compile time const.
$assignable returns true is an expression may be implicitly cast to a type.
$checks and @checked removed, replaced by an improved $defined
Asm string blocks use AT&T syntax for better reliability.
Distinct methods changed to separate syntax.
'exec' directive to run scripts at compile time.
Project key descriptions in --list command.
Added init-lib to simplify library creation.
Local const work like namespaced global const.
Added $$atomic_fetch_* builtins.
vectors may now contain pointers.
void! does not convert to anyfault.
$$masked_load / $$masked_store / $$gather / $$scatter for vector masked load/store.
$$select builtin for vector masked select.
Added builtin benchmarks by benchmark, compile-benchmark commands and @benchmark attribute.
Subtype matching in type switches.
Added parentof typeid property.
Slice assignment is expanded.
Enforced optional handling.
Better dead code analysis, and added dead code errors.
Exhaustive switches with enums has better analysis.
Globals may now be initialized with optional values.
New generic syntax.
Slice initialization.
$feature for feature flags.
Native stacktrace for Linux, MacOS and Windows.
Macro ref parameters are now of pointer type and ref parameters are not assignable.
Added nextcase default.
Added $embed to embed binary data.
Ad hoc generics are now allowed.
Allow inferred type on method first argument.
Fix to void expression blocks
Temporary objects may now invoke methods using ref parameters.
Delete object files after successful linking.
Compile time subscript of constant strings and bytes.
@if introduced, other top level conditional compilation removed.
Dynamically dispatched interfaces with optional methods.
$if now uses $if <expr>: syntax.
$assert now uses $assert <expr> : <optional message>
$error is syntax sugar for $assert false : "Some message"
$include, $echo no longer has mandatory () around the arguments.
$exec for including the output of files.
assert no longer allows "try unwrap"
Updated cpu arguments for x86
Removed support for ranged case statements that were floats or enums, or non-constant.
nextcase with a constant expression that does not match any case is an error.
Dropped support for LLVM 13-14.
Updated grammar and lexer definition.
Removal of $elif.
any / anyfault may now be aliased.
@stdcall etc removed in favor of @callconv
Empty fault definitions is now an error.
Better errors on incorrect bitstruct syntax.
Internal use wildcard type rather than optional wildcard.
Experimental scaled vector type removed.
Disallow parameterize attributes without parameters eg define @Foo() = { @inline }.
Handle @optreturn contract, renamed @return!.
Restrict interface style functions.
Optional propagation and assignment '!' and '?' are flipped.
Add l suffix (alias for i64).
Allow getting the underlying type of anyfault.
De-duplicate string constants.
Change @extname => @extern.
define and typedef removed.
define is replaced by def.
LLVM "wrapper" library compilation is exception free.
private is replaced by attribute @private.
Addition of @local for file local visibility.
Addition of @public for overriding default visibility.
Default visibility can be overridden per module compile unit. Eg module foo @private.
Optimized macro codegen for -O0.
Addition of unary +.
Remove possibility to elide length when using ':' for slices.
Remove the : and ; used in $if, $switch etc.
Faults have an ordinal.
Generic module contracts.
Type inference on enum comparisons, e.g foo_enum == ABC.
Allow {} to initialize basic types.
String literals default to String.
More const modification detection.
C3L zip support.
Support printing object files.
Downloading of libraries using vendor "fetch".
Structural casts removed.
Added "native" option for vector capability.
$$shufflevector replaced with $$swizzle and $$swizzle2.
Builtin swizzle accessors.
Lambdas, e.g a = int(x, y) => x + y.
$$FILEPATH builtin constant.
variant renamed any.
anyerr renamed anyfault.
Added $$wasm_memory_size and $$wasm_memory_grow builtins.
Add "link-args" for project.
Possible to suppress entry points using --no-entry.
Added memory-env option.
Use the .wasm extension on WASM binaries.
Update precedence clarification rules for ^|&.
Support for casting any expression to void.
Win 32-bit processor target removed.
Insert null-check for contracts declaring & params.
Support user defined attributes in generic modules.
--strip-unused directive for small binaries.
$$atomic_store and $$atomic_load added.
usz/isz replaces usize and isize.
@export attribute to determine what is visible in precompiled libraries.
Disallow obviously wrong code returning a pointer to a stack variable.
Add &^| operations for bitstructs.
@noinit replaces = void to opt-out of implicit zeroing.
Multiple declarations are now allowed in most places, eg int a, b;.
Allow simplified (boolean) bitstruct definitions.
Allow @test to be placed on module declarations.
Updated name mangling for non-exports.
defer catch and defer try statements added.
Better errors from $assert.
@deprecated attribute added.
Allow complex array length inference, eg int[*][2][*] a = ....
Cleanup of cast code.
Removal of generic keyword.
Remove implicit cast enum <-> int.
Allow enums to use a distinct type as the backing type.
Update addition and subtraction on enums.
@ensure checks only non-optional results.
assert may now take varargs for formatting.

Stdlib changes

Tracking allocator with location.
init_new/init_temp for allocating init methods.
DString.printf is now DString.appendf.
Tuple and Maybe types.
.as_str() replaced by .str_view()
Added math::log(x , base) and math::ln(x).
Hashmap keys implicitly copied if copy/free are defined.
Socket handling.
csv package.
Many random functions.
Updated posix/win32 stdlib namespacing
process stdlib
Stdlib updates to string.
Many additions to List: remove, array_view, add_all, compact etc
Added dstringwriter.
Improved printf formatting.
is_finite/is_nam/is_inf added.
OnStack allocator to easily allocate a stack buffer.
File enhancements: mkdir, rmdir, chdir.
Path type for file path handling.
Distinct String type.
VarString replaced by DString.
Removal of std::core::str.
JSON parser and general Object type.
Addition of EnumMap.
RC4 crypto.
Matrix identity macros.
compare_exchange added.
printfln and println renamed printfn and printn.
Support of roundeven.
Added easings.
Updated complex/matrix, added quaternion maths.
Improved support for freestanding.
Improved windows main support, with @winmain annotations.
SimpleHeapAllocator added.
Added win32 standard types.
Added saturated math.
Added @expect, @unlikely and @likely macros.
Temp allocator uses memory-env to determine starting size.
Temp allocator is now accessed using mem::temp(), heap allocator using mem::heap().
Float parsing added.
Additions to std::net, ipv4/ipv6 parsing.
Stream api.
Random api.
Sha1 hash function.
Extended enumset functionality.
Updated malloc/calloc/realloc/free removing old helper functions.
Added TrackingAllocator.
Add checks to prevent incorrect alignment on malloc.
Updated clamp.
Added Clock and DateTime.
Added posix socket functions.

Fixes

Structs returned from macros and then indexed into directly could previously be miscompiled.
Naked functions now correctly handles asm.
Indexing into arrays would not always widen the index safely.
Macros with implicit return didn't correctly deduct the return type.
Reevaluating a bitstruct (due to checked) would break.
Fix missing comparison between any.
Fix issue of designated initializers containing bitstructs.
Fix issue of designated initializers that had optional arguments.
Fixed ++ and -- for bitstructs.
Fix to bug where library source files were sometimes ignored.
Types of arrays and vectors are consistently checked to be valid.
Anonymous bitstructs check of duplicate member names fixed.
Assignment to anonymous bitstruct members in structs.
Fix casts on empty initializers.
Fix to DString reserve.
Fix where aliases did not do arithmetic promotion.
@local declarations in generic modules available by accident.
Fixes missing checks to body arguments.
Do not create debug declaration for value-only parameter.
Bug in alignment for atomics.
Fix to bug when comparing nested arrays.
Fix to bug when a macro is using rethrow.
Fixes bug initializing a const struct with a const struct value.
Fixes bug when void is passed to an "any"-vararg.
Fixed defer/return value ordering in certain cases.
Fixes to the x64 ABI.
Updates to how variadics are implemented.
Fixes to shift checks.
Fixes to string parsing.
Bug when rethrowing an optional from a macro which didn't return an optional.
Fixed issues with ranged cases.
Disallow trailing ',' in function parameter list.
Fixed errors on flexible array slices.
Fix of readdir issues on macOS.
Fix to slice assignment of distinct types.
Fix of issue casting subarrays to distinct types.
Fixes to split, rindex_of.
List no longer uses the temp allocator by default.
Remove test global when not in test mode.
Fix sum/product on floats.
Fix error on void! return of macros.
Removed too permissive casts on subarrays.
Using C files correctly places objects in the build folder.
Fix of overaligned deref.
Fix negating a float vector.
Fix where $typeof(x) { ... } would not be a valid compound literal.
Fix so that using var in if (var x = ...) works correctly.
Fix int[] -> void* casts.
Fix in utf8to16 conversions.
Updated builtin checking.
Reduce formatter register memory usage.
Fixes to the "any" type.
Fix bug in associated values.
More RISC-V tests and fixes to the ABI.
Fix issue with hex floats assumed being double despite f suffix.
Fix of the tan function.
Fixes to the aarch64 ABI when passing invalid vectors.
Fix creating typed compile time variables.
Fix bug in !floatval codegen.
Fix of visibility issues for generic methods.
Fixes to $include.
Fix of LLVM codegen for optionals in certain cases.
Fix of $vasplat when invoked repeatedly.
Fix to $$DATE.
Fix of attributes on nested bitstructs.
Fix comparing const values > 64 bits.
Defer now correctly invoked in expressions like return a > 0 ? Foo.ABC! : 1.
Fix conversion in if (int x = foo()).
Delay C ABI lowering until requested to prevent circular dependencies.
Fix issue with decls accidentally invalidated during $checked eval.
Fold optional when casting slice to pointer.
Fixed issue when using named arguments after varargs.
Fix bug initializing nested struct/unions.
Fix of bool -> vector cast.
Correctly widen C style varargs for distinct types and optionals.
Fix of too aggressive codegen in ternary codegen with array indexing.

Comment by Christoffer Lernö

It allows the language to be easily parsable. The classic problem in a C-like grammar is that it is ambiguous with respect to types vs variables. In C this is typically solved using the "lexer hack", where the parser feeds types back into the lexer. Other methods include outlawing certain types of expressions and using infinite lookahead, this is the method D uses for example.

In C3, the distinct naming rules for types disambiguates the grammar, making it LL(1). Also see here: https://c3-lang.org/faq/#syntax-language-design

So to be clear, it's not about trying to enforce some arbitrary name standards, but rather to simplify the grammar. Picking PascalCase for the types was pretty much the only possible choice. I might write a blog post about this some time.

October 25, 2023
5 min read

Too much power, too poor accuracy - the story of $checks in C3

Originally from: https://c3.handmade.network/blog/p/8810-too_much_power%252C_too_poor_accuracy_-_the_story_of_checks_in_c3

Recently C3 lost its $checks() function. It would take any sequence of declarations and expressions, and if it failed to semantically check anywhere, return false.

It was an extremely powerful and flexible way of testing pretty much anything at compile time. Some examples:

// Test if a value may be indexed:
$checks(a[0]);
// Test if something supports addition:
$checks(a + a);
// Test if you can assign something to the type of another variable
$checks(b = a);
// Test if you can call a function with the values of two variables
$checks(foo(a, b));
// Check if a type has a particular field
$checks(Foo x, x.my_field);
// Check if a type is ordered
$checks(Foo x, x < x);

In essence, $checks was a Swiss Army knife for compile-time validation, making it redundant to employ multiple compile-time functions like $defined(x). So, why did we part ways with $checks (and its contract counterpart @checked)?

Well, it turns out that with power comes also lack of clarity. Take, for example, the $checks(foo(a, b)) call – it could potentially fail for a multitude of reasons:

foo might not be visible in the scope.
foo needs to be called with the module name, e.g. my_module::foo
foo might not be a callable variable pointer or function.
a might not be visible in the scope.
b might not be visible in the scope.
foo might take fewer than 2 or more than 2 arguments.
There could be a type mismatch between a and the first parameter of foo.
There could be a type mismatch between b and the second parameter of foo.

So while we might have wanted to test for some of these, it might fail for any of the listed cases and there is no way we can determine which one, unless we move it out of the $checks and test it so that it errors just the same way.

While this is a problem when writing the $checks, it also poses a problem when refactoring, as it is hard to tell when you accidentally change something that breaks inside of $checks, causing it to reject legitimate parameters.

So $checks unfortunately combines power with inexactness. In fact, its power comes from being inexact and just bundling all the implicit checks together.

The alternative solution

C3 already had $defined(...) which would do a lightweight check if a variable or a field was defined. Its functionality had almost completely been eclipsed by $checks(...) but now got a new life: $defined would semantically check all but the outermost part of a nested expression. The final expression would then be conditionally checked.

The new behaviour was reminiscent of $checks, but would only have a single "tested" semantic check. For example, $defined(foo(a, b)) would return true if it checked correctly, and false only if "foo" wasn't callable or didn't accept 2 arguments.

The downside is that $defined must be carefully crafted to correctly do each "test" it supports.

But all in all, this is a substantial upgrade to correct compile time checking, which is very important in C3.

Addition: without $checks the various examples instead become:

// Test if a value may be indexed:
$defined(a[0]);
// Test if something supports addition:
types::is_numerical($typeof(a))
// Test if you can assign something to the type of another variable
$assignable(a, $typeof(b));
// Test if you can call a function with the values of two variables
$defined(foo(a, b));
// Check if a type has a particular field
$defined(Foo{}.my_field);
// Check if a type is ordered
Foo.is_ordered

Comments

Comment by Christoffer Lernö

Recently C3 lost its $checks() function. It would take any sequence of declarations and expressions, and if it failed to semantically check anywhere, return false.

It was an extremely powerful and flexible way of testing pretty much anything at compile time. Some examples:

// Test if a value may be indexed:
$checks(a[0]);
// Test if something supports addition:
$checks(a + a);
// Test if you can assign something to the type of another variable
$checks(b = a);
// Test if you can call a function with the values of two variables
$checks(foo(a, b));
// Check if a type has a particular field
$checks(Foo x, x.my_field);
// Check if a type is ordered
$checks(Foo x, x < x);

In essence, $checks was a Swiss Army knife for compile-time validation, making it redundant to employ multiple compile-time functions like $defined(x). So, why did we part ways with $checks (and its contract counterpart @checked)?

Well, it turns out that with power comes also lack of clarity. Take, for example, the $checks(foo(a, b)) call – it could potentially fail for a multitude of reasons:

foo might not be visible in the scope.
foo needs to be called with the module name, e.g. my_module::foo
foo might not be a callable variable pointer or function.
a might not be visible in the scope.
b might not be visible in the scope.
foo might take fewer than 2 or more than 2 arguments.
There could be a type mismatch between a and the first parameter of foo.
There could be a type mismatch between b and the second parameter of foo.

So while we might have wanted to test for some of these, it might fail for any of the listed cases and there is no way we can determine which one, unless we move it out of the $checks and test it so that it errors just the same way.

While this is a problem when writing the $checks, it also poses a problem when refactoring, as it is hard to tell when you accidentally change something that breaks inside of $checks, causing it to reject legitimate parameters.

So $checks unfortunately combines power with inexactness. In fact, its power comes from being inexact and just bundling all the implicit checks together.

The alternative solution

C3 already had $defined(...) which would do a lightweight check if a variable or a field was defined. Its functionality had almost completely been eclipsed by $checks(...) but now got a new life: $defined would semantically check all but the outermost part of a nested expression. The final expression would then be conditionally checked.

The new behaviour was reminiscent of $checks, but would only have a single "tested" semantic check. For example, $defined(foo(a, b)) would return true if it checked correctly, and false only if "foo" wasn't callable or didn't accept 2 arguments.

The downside is that $defined must be carefully crafted to correctly do each "test" it supports.

But all in all, this is a substantial upgrade to correct compile time checking, which is very important in C3.

Addition: without $checks the various examples instead become:

// Test if a value may be indexed:
$defined(a[0]);
// Test if something supports addition:
types::is_numerical($typeof(a))
// Test if you can assign something to the type of another variable
$assignable(a, $typeof(b));
// Test if you can call a function with the values of two variables
$defined(foo(a, b));
// Check if a type has a particular field
$defined(Foo{}.my_field);
// Check if a type is ordered
Foo.is_ordered

September 11, 2023
2 min read

Some guidelines to new syntax design

Originally from: https://c3.handmade.network/blog/p/8778-some_guidelines_to_new_syntax_design

Syntax discussions tend to be highly contextual. The syntax of a language is not a standalone, separate entity, but rather interacts with what type of algorithmic solutions you envision users to employ. On top of that, one must be aware of that syntax shapes the solutions users will prefer in sometimes unpredictable ways.

This makes completely new syntax very hard to analyze. And also hard to write any guidelines for.

That said, I think there are some things we can say about syntax design, to form some very simple (and obvious) guidelines:

In general, an easy-to-parse syntax tend to be easier for a user to read quickly than a complex-to-parse syntax.
Newly invented syntax will initially be harder for people to grok than established syntax. So it is bad if you try to make experienced programmers understand it "at a glance".
Newly invented syntax does makes the language feel more "different" (unique, inventive etc) than established syntax. So it is good if you want to make the language stand out as being different at a glance.
It's harder to know the downsides of newly invented syntax. So much more research is needed, and it's important to be ready to change it down the line if it doesn't work out.
One's personal opinions of what "nice looking syntax" is very unlikely to be the objectively most accurate opinion, so be aware how that "beautiful" syntax might be hideous to someone else.

Happy hacking!

Comments

Comment by Christoffer Lernö

Syntax discussions tend to be highly contextual. The syntax of a language is not a standalone, separate entity, but rather interacts with what type of algorithmic solutions you envision users to employ. On top of that, one must be aware of that syntax shapes the solutions users will prefer in sometimes unpredictable ways.

This makes completely new syntax very hard to analyze. And also hard to write any guidelines for.

That said, I think there are some things we can say about syntax design, to form some very simple (and obvious) guidelines:

In general, an easy-to-parse syntax tend to be easier for a user to read quickly than a complex-to-parse syntax.
Newly invented syntax will initially be harder for people to grok than established syntax. So it is bad if you try to make experienced programmers understand it "at a glance".
Newly invented syntax does makes the language feel more "different" (unique, inventive etc) than established syntax. So it is good if you want to make the language stand out as being different at a glance.
It's harder to know the downsides of newly invented syntax. So much more research is needed, and it's important to be ready to change it down the line if it doesn't work out.
One's personal opinions of what "nice looking syntax" is very unlikely to be the objectively most accurate opinion, so be aware how that "beautiful" syntax might be hideous to someone else.

Happy hacking!

August 30, 2023
5 min read

Compile-time and short-circuit evaluation

Originally from: https://c3.handmade.network/blog/p/8773-compile-time_and_short-circuit_evaluation

Recently a user had a problem with the following code in C3:

$if $foo != "" && $foo[0] != '_':
    ...
$endif

As a reminder, compile time evaluation is distinguished using a $ sigil, so in this case the idea was to check whether the compile time variable $foo was an empty string, and if it wasn't, compare the first character with '_'.

If $foo is indeed an empty string, this code will fail at compile time.

This is because constant folding in C3 follows semantic evaluation, and a binary expression will first type check the sub expressions && was evaluated. That is, at compile time there is no short-circuit evaluation.

The curious effect of short-circuit evaluation

We could say that for && we only evaluate the left hand side, and if that one is false, then we don't evaluate the rest. This is perfectly legitimate behaviour BUT it would mean this would pass semantic checking as well:

if (false && okeoefkepofke[3.141592])
{
    ...
}

Why? Because constant folding would need to work the same way: we evaluate the first part to false, so now we never check the expression okeoefkepofke[3.141592].

So now we got this big piece of code that is wrong and never checked...

But obviously no one would write that, right? Except for something like this is quite reasonable code:

macro foo($foo)
{
  if ($foo && abc()) { ... }
}

This problem is not unique, people using any sort of dynamically typed scripting languages will be familiar with this exact problem. And the solution – if you care about the code actually working – is to write more tests.

Trying to eat the cake and keep it

One possibility one can consider, is to have short-circuit behaviour only in compile time constant environments, so:

// Const global? Don't evaluate the right hand side.
const bool FOO = false && foewkfoewkf[fefeji]; 

fn void test()
{
    // Compile time conditional? Don't evaluate the right hand side
    $if false && foofoekfe[kfiejfie]:
        ...
    $endif
    // And same with switch:
    $switch
        $case false && fokeokfe[ofkeofk]:
            ...
    $endswitch
    // But this would be an error:
    bool b = false && fokefoek[ofofke]; // Error!
}

But if "never short-circuiting" is annoying and unexpected, and "always short-circuiting" requires much more testing, this "a little of both", creates a corner in the language which can be just as problematic as the former two. Having expression evaluation behave differently depending on where it's evaluated, is something likely to confuse even experienced users.

As usual, language design is a trade-off

For C3, semantic checking is prioritized over compile time convenience. I think everyone who's been working with macros in C3 knows the lazy evaluation of macros can easily hide bugs already, and having short-circuiting constant evaluation would just magnify this problem.

There are languages that consistently uses short-circuiting constant evaluation at compile time instead. This allows leveraging this the feature for all its conditional compilation. Where C3 uses $if or $switch and very clear "this is evaluated at compile time" blocks to facilitate finding compile-time bugs, other languages may prefer to streamline the look of the code allowing compile-time and runtime evaluation blur but also being consistent in following the same rules. While this comes at the aforementioned added cost of testing, it might be a trade-off its users prefer.

Comments

Comment by Christoffer Lernö

Recently a user had a problem with the following code in C3:

$if $foo != "" && $foo[0] != '_':
    ...
$endif

As a reminder, compile time evaluation is distinguished using a $ sigil, so in this case the idea was to check whether the compile time variable $foo was an empty string, and if it wasn't, compare the first character with '_'.

If $foo is indeed an empty string, this code will fail at compile time.

This is because constant folding in C3 follows semantic evaluation, and a binary expression will first type check the sub expressions && was evaluated. That is, at compile time there is no short-circuit evaluation.

The curious effect of short-circuit evaluation

We could say that for && we only evaluate the left hand side, and if that one is false, then we don't evaluate the rest. This is perfectly legitimate behaviour BUT it would mean this would pass semantic checking as well:

if (false && okeoefkepofke[3.141592])
{
    ...
}

Why? Because constant folding would need to work the same way: we evaluate the first part to false, so now we never check the expression okeoefkepofke[3.141592].

So now we got this big piece of code that is wrong and never checked...

But obviously no one would write that, right? Except for something like this is quite reasonable code:

macro foo($foo)
{
  if ($foo && abc()) { ... }
}

This problem is not unique, people using any sort of dynamically typed scripting languages will be familiar with this exact problem. And the solution – if you care about the code actually working – is to write more tests.

Trying to eat the cake and keep it

One possibility one can consider, is to have short-circuit behaviour only in compile time constant environments, so:

// Const global? Don't evaluate the right hand side.
const bool FOO = false && foewkfoewkf[fefeji]; 

fn void test()
{
    // Compile time conditional? Don't evaluate the right hand side
    $if false && foofoekfe[kfiejfie]:
        ...
    $endif
    // And same with switch:
    $switch
        $case false && fokeokfe[ofkeofk]:
            ...
    $endswitch
    // But this would be an error:
    bool b = false && fokefoek[ofofke]; // Error!
}

But if "never short-circuiting" is annoying and unexpected, and "always short-circuiting" requires much more testing, this "a little of both", creates a corner in the language which can be just as problematic as the former two. Having expression evaluation behave differently depending on where it's evaluated, is something likely to confuse even experienced users.

As usual, language design is a trade-off

For C3, semantic checking is prioritized over compile time convenience. I think everyone who's been working with macros in C3 knows the lazy evaluation of macros can easily hide bugs already, and having short-circuiting constant evaluation would just magnify this problem.

There are languages that consistently uses short-circuiting constant evaluation at compile time instead. This allows leveraging this the feature for all its conditional compilation. Where C3 uses $if or $switch and very clear "this is evaluated at compile time" blocks to facilitate finding compile-time bugs, other languages may prefer to streamline the look of the code allowing compile-time and runtime evaluation blur but also being consistent in following the same rules. While this comes at the aforementioned added cost of testing, it might be a trade-off its users prefer.

June 10, 2023
8 min read

Inspirations for C3's features

Originally from: https://c3.handmade.network/blog/p/8723-inspirations_for_c3%2527s_features

When designing a new programming language, research is incredibly important. While research can be investigating new syntax and new semantics, most of it is actually looking at other language's features and seeing if anything worked extra well and wether it could be useful for your own language.

C3 is derived from C2, which in turn is an evolution of C, so the basis of the language itself is clear. But what about the features on top of C -where do they come from? I thought it might be amusing to list the features and where they originated.

Features and where they come from

Modules – Java was probably the primary inspiration for a lot of it, since it has a very simple and well understood system with packages. However, Java's imports are actually only about visibility, not about really importing anything, so there are very clear differences. I've written more in detail here.

Generic modules - This was inspired by macro based container libraries in C, as well as ASTEC's "@module" macro.

Faults/optionals - Originally this was similar to Zig's design, but took on inspiration from Herbceptions, Haskell/Rust results, C and Go error handling into something original.

Macros - This was based largely on ASTEC but added things like iteration.

Struct subtyping - This is a Plan9 feature that also ended up in Go. I got it from reading about the Plan9 C compiler.

Slices - This exists in many languages, it's hard to say what languages I based it on.

Slicing syntax - The ^1 syntax comes from C#, otherwise it's mostly D with some looks at Swift and Odin.

Contracts - While a lot of languages try to add a bit of contract support, Eiffel is the language I looked at. Placing the contracts in the docs was a change for C3.

Def - I started by looking at D. The inclusion of "distinct" types comes from Odin. The restriction that function types only to be accessed through def is from C2. The idea that generic modules are instantiated using def occurs in earlier languages, I remember looking at Ada in particular.

Reflection - I'd say Jai got the ball rolling here, with some additional inspiration from Odin. It was clear from the start that reflection like Java or Objective-C was out of question, and that Jai's runtime information was more than I wanted. I read about reflection in other languages as well, with D having quite a bit of influence on the syntax.

Operator overloading - I certainly looked at overloading in C++, D and other languages, but in the end the result was a bit in between everything.

Dynamic calls - This is from Objective-C.

Undefined behaviour - The C3 attitude to UB is strongly influenced by Odin, but doesn't go quite that far.

Implicit conversions - Originally this borrowed from Zig, but after a lot of research, it ended as a unique blend of C and Java ideas, without the need for untyped literals.

Precedence rules - Just trying to avoid retaining the poor precedence rules of C.

Project files - Derived from C2, but modified.

Any and typeid types - mainly inspired by Odin.

Enum associated values - derived from Java enums.

Bitstructs - inspired by PacketC.

Extended switch - pattern matching in many languages.

Flowtyping to unwrap - Java / Kotlin in JetBrains' IDEs.

Foreach - ObjC and Java originally. The idea (and syntax!) to allow getting values by ref comes from PHP.

Base64 and hex literals - Inspired by language "wish lists" on the web :D.

Zero init by default - Ultimately Odin convinced me this was a good idea.

Array/slice arithmetics - A subset of Odin and D functionality.

Type methods - An extension of C2 struct functions.

Attributes - Based on C2 attributes

Defer - Based on Swift and Jai defer. Extensions defer catch and defer try were added on top. While Zig has a errdefer which works like defer catch the C3 feature was developed without knowledge of that Zig addition(!)

Special syntax for compile time - Mostly driven by a need to make compile time clearer than compile time code in Zig.

Visibility rules - I did lots of research on this, so it's hard to say where it comes from. Certainly some I made up for C3. "Public by default" comes from Odin. Some ideas for export and visibility came from D.

Raw strings - I experimented with a lot of different styles, ultimately I picked Go style from comparing with Odin. Escaping a single backtick by having two in a row is also from some language, but unfortunately I don't recall which one.

Ranges in initializers - This is a GCC extension.

Expression block - This is a variant of the GCC statement expression that I changed be a self contained block where return only jumped out of the block. So it's an evolution of the GCC feature.

Ranges in case statements - Yes, this is a GCC extension as well.

Named arguments - Probably borrowed from Swift originally.

Trailing macro body - This is a unique functionality, but it is somewhat similar to trailing body lambdas in Ruby and later Swift.

Lambdas - These are syntactically very similar to Java's lambdas. But of course C3 does not capture closures.

Static initializers and finalizers - Syntactically somewhat derived from Java static blocks.

Function syntax - This is from C2, but in shortened form (C2 uses func)

Allocators - Influences from Jai, Odin and Zig, but ultimately C3 picks its own trade off.

Temp allocators - Mostly based off Odin originally.

Inline asm - Mostly based on MSVC inline asm.

Final words

On top of the above, C3 is of course indebted to all the people I've engaged in language discussions with over the years. I should mention Jon Goodwin (Cone) and Andrey Penechko (Vox) in particular, but I want to thank everyone who helped with thoughts and feedback (and complaints!) over the years.

Thank you!

If you are curious about C3 you can try it at https://learn-c3.org or download the compiler from https://github.com/c3lang/c3c

P.S. A bonus tidbit: the use of printn and printfn instead of println and printfln comes from F#