Wrap your NIF with Zig

Posted on Aug 9, 2023

If you’ve heard about Zig, you’ve probably heard about comptime, its mechanism for doing introspection, metaprogramming, and more. I’m probably in my honeymoon phase with comptime and I like to throw it at almost everything. The great thing about it is that you write some code and think “This makes sense, will it Just Work™?” and it actually does.

Today I’m going to show you an example of how it can be used to abstract away some common code that appears at the beginning of all Elixir¹ NIFs, allowing the resulting code to be more clear in its intentions by removing some mechanical noise.

Note that you usually don’t have to implement this manually if you want to write a NIF using Zig, since Zigler takes care of this for you. This is mainly meant as an example that shows some comptime cool features.

NIF Interface

As a quick reminder, in the Elixir world NIFs are functions that are implemented in C (or in any language with an FFI to C, e.g. Zig or Rust). From the caller perspective, they appear like normal Elixir functions. NIFs have their pros and cons (the biggest con being that they can bring down the whole Erlang VM if they crash) but we won’t go too much into the details about those here.

The important thing for this example is the interface a NIF must adhere to, which in C is

1ERL_NIF_TERM (*fptr)(ErlNifEnv* env, int argc, const ERL_NIF_TERM argv[])

or, using Zig²

1*const fn (env: beam.Env, argc: c_int, argv: [*c]const beam.Term) callconv(.C) beam.Term

A couple of definitions can make the signature clearer:

A term represents a value in the Erlang VM. It is an opaque type and the values can be extracted from a term using functions in the erl_nif API. Conversely, “native” values have to be converted to a term before being returned from the NIF.
An environment is a structure that hosts Erlang terms. Terms cannot be destructed individually, their lifetime is bound to the one of their environment.

From these definitions we can see our NIF accepts an environment as first parameter, then an argument count and then a pointer to an array of argument terms, and it returns a term. All the terms are hosted in the environment that is passed as first parameter, including the returned term.

The Issue

Let’s show an example to highlight what the problem is. We will start implementing 2 different NIFs: add_two_ints, multiply_three_doubles.

We will assume to have some helpers to convert terms to and from Zig values³. Note that these helpers already help us reduce some of the noise thanks to the fact that Zig allows returning errors from functions.

Here’s the implementation of our NIFs:

 1pub fn add_two_ints(env: beam.Env, argc: c_int, argv: [*c]const beam.Term) callconv(.C) beam.Term {
 2    assert(argc == 2);
 3
 4    // Convert to a slice to leverage Zig out of bound checks
 5    const argv_slice = @as([*]const beam.Term, @ptrCast(argv))[0..@intCast(argc)];
 6
 7    const a = beam.get_u32(env, argv_slice[0]) catch {
 8        return beam.raise_badarg(env);
 9    };
10
11    const b = beam.get_u32(env, argv_slice[1]) catch {
12        return beam.raise_badarg(env);
13    };
14
15    const result = a + b;
16
17    return beam.make_u32(env, result);
18}
19
20pub fn multiply_three_doubles(env: beam.Env, argc: c_int, argv: [*c]const beam.Term) callconv(.C) beam.Term {
21    assert(argc == 3);
22
23    const argv_slice = @as([*]const beam.Term, @ptrCast(argv))[0..@intCast(argc)];
24
25    const a = beam.get_f64(env, argv_slice[0]) catch {
26        return beam.raise_badarg(env);
27    };
28
29    const b = beam.get_f64(env, argv_slice[1]) catch {
30        return beam.raise_badarg(env);
31    };
32
33    const c = beam.get_f64(env, argv_slice[2]) catch {
34        return beam.raise_badarg(env);
35    };
36
37    const result = a * b * c;
38
39    return beam.make_f64(env, result);
40}

As you can see, there’s a lot of mechanical noise in the way. Moreover, you can’t easily tell the type and number of parameters by looking at the function signature, you have to parse its implementation to extract this information.

Wouldn’t it be nice to expose the actual signature of the function and magically get a NIF wrapper for it?

`comptime` to the Rescue

First, we’re going to tackle the input arguments, since they’re the ones causing most of the noise. Our new functions will just be:

 1pub const add_two_ints = make_nif_wrapper(add_two_ints_impl);
 2
 3fn add_two_ints_impl(env: beam.Env, a: u32, b: u32) beam.Term {
 4    const result = a + b;
 5
 6    return beam.make_u32(env, result);
 7}
 8
 9pub const multiply_three_doubles = make_nif_wrapper(multiply_three_doubles_impl);
10
11fn multiply_three_doubles_impl(env: beam.Env, a: f64, b: f64, c: f64) beam.Term {
12    const result = a * b * c;
13
14    return beam.make_f64(env, result);
15}

That’s already much better! Of course the interesting part is the implementation of make_nif_wrapper. Since there’s a lot to unpack there, let’s go through it bit by bit.

1const Nif = *const fn (beam.Env, argc: c_int, argv: [*c]const beam.Term) callconv(.C) beam.Term;

In the beginning, we just define a handy Nif type that that is a shorthand for the NIF interface we talked about earlier.

 1fn make_nif_wrapper(comptime fun: anytype) Nif {
 2    const Function = @TypeOf(fun);
 3
 4    const function_info = switch (@typeInfo(Function)) {
 5        .Fn => |f| f,
 6        else => @compileError("Only functions can be wrapped"),
 7    };
 8
 9    const params = function_info.params;
10    if (params[0].type != beam.Env) {
11        @compileError("The type of the first parameter of a NIF must be `beam.Env`");
12    }
13    // Env is not counted in argc, so subtract one
14    const expected_argc = params.len - 1;

The make_nif_wrapper function takes a comptime parameter of type anytype. This is because the different functions we’re going to pass to make_nif_wrapper will have different types. Inside the function, we assign the actual type of the function to the Function constant, we verify that it’s actually a function, and, if it is, we extract its type info.

From the info, we extract the parameter info, which is useful to ensure that the first parameter has the beam.Env type (that’s a requirement of our current interface since the function must have an environment to make its return value) and we save the expected argc.

 1    return struct {
 2        pub fn wrapper(
 3            env: beam.Env,
 4            argc: c_int,
 5            argv: [*c]const beam.Term,
 6        ) callconv(.C) beam.Term {
 7            if (argc != expected_argc) @panic("NIF called with the wrong number of arguments");
 8
 9            const argv_slice = @as([*]const beam.Term, @ptrCast(argv))[0..@intCast(argc)];
10            var args: std.meta.ArgsTuple(Function) = undefined;
11            inline for (&args, 0..) |*arg, arg_idx| {
12                if (arg_idx == 0) {
13                    arg.* = env;
14                    continue;
15                }
16
17                // Adjust for the abscence of env in argv
18                const argv_idx = arg_idx - 1;
19                const ArgType = @TypeOf(arg.*);
20                arg.* = get_arg_from_term(ArgType, env, argv_slice[argv_idx]) catch
21                    return beam.raise_badarg(env);
22            }
23
24            return @call(.auto, fun, args);
25        }
26    }.wrapper;
27}

After we have all this information, we can create our NIF wrapper. We start by creating an anonymous struct, which contains a wrapper function definition. If you skip at the and, you can see we then return just the function with .wrapper. This is the usual pattern to create a pointer to an “anonymous” function in Zig.

The wrapper function implements our NIF interface, and it starts by checking that argc actually matches expected_argc. Since this should be ensured by the Erlang VM, we @panic if it does not.

After creating the argv_slice as before, we have to deal with calling our impl function. To do that, we use the @call builtin, which allows calling a function given its address and a tuple containing its arguments. The std.meta.ArgsTuple conveniently returns the tuple type of the correct size and types to contain the arguments of a given function type, so we just have to iterate through all the elements of the tuple with inline for and assigning all the arguments after extracting them from their beam.Term.

The only piece missing is the implementation of get_arg_from_term. This basically just switches on the type of the argument and calls the appropriate beam.get function. The implementation also includes a useful compilation error on missing types so the compiler can guide us if we add a new NIF with an unsupported argument type.

1fn get_arg_from_term(comptime T: type, env: beam.Env, term: beam.Term) !T {
2    return switch (T) {
3        u32 => try beam.get_u32(env, term),
4        f64 => try beam.get_f64(env, term),
5        else => @compileError("Type " ++ @typeName(T) ++ " is not handled by get_arg_from_term"),
6    };
7}

Being Dynamic

This works well with functions that map to static types, but the Elixir is dynamically typed, so we might want to support that in our NIF. Say, for example, we want to implement a term_burrito NIF that takes an arbitrary term and wraps it in a list.

1pub const term_burrito = make_nif_wrapper(term_burrito_impl);
2
3fn term_burrito_impl(env: beam.Env, term: beam.Term) beam.Term {
4    return e.enif_make_list1(env, term);
5}

In this case we still unwrap the argv into single args, but our input is a beam.Term. The change to support that is literally one line.

1  fn get_arg_from_term(comptime T: type, env: beam.Env, term: beam.Term) !T {
2     return switch (T) {
3+        beam.Term => term,
4         u32 => try beam.get_u32(env, term),
5         f64 => try beam.get_f64(env, term),
6         else => @compileError("Type " ++ @typeName(T) ++ " is not handled by get_arg_from_term"),

If we encounter a beam.Term in get_arg_from_term, we simply return it as-is.

`comptime` Returns

As the last step in this example, let’s also try to include in our NIF wrapper the conversion of the return value back to a beam.Term.

term_burrito_impl remains the same since it’s accepting and returning a beam.Term, while the first two impls become just

1fn add_two_ints_impl(a: u32, b: u32) u32 {
2    return a + b;
3}
4
5fn multiply_three_doubles_impl(a: f64, b: f64, c: f64) f64 {
6    return a * b * c;
7}

Note that we don’t need the env anymore since we’re not using it inside the function, but we do still need it for term_burrito, so we have to handle both cases. Let’s see how the implementation of make_nif_wrapper changes.

1-    if (function_info.params[0].type != beam.Env) {
2-        @compileError("The type of the first parameter of a NIF must be `beam.Env`");
3-    }
4+    const with_env = function_info.params[0].type == beam.Env;
5+    const env_offset = if (with_env) 1 else 0;
6-    const expected_argc = params.len - 1;
7+    const expected_argc = params.len - env_offset;

First, we determine if our function accepts env as first parameter (removing the previous check), and based on that we also set the offset between args and argv to 0 or 1. Once we have the offset, we can calculate expected_argc.

 1-                if (arg_idx == 0) {
 2+                if (with_env and arg_idx == 0) {
 3                     arg.* = env;
 4                     continue;
 5                 }
 6
 7-                const argv_idx = arg_idx - 1;
 8+                const argv_idx = arg_idx - env_offset;
 9                 const ArgType = @TypeOf(arg.*);
10                 arg.* = get_arg_from_term(ArgType, env, argv_slice[argv_idx]) catch
11                     return beam.raise_badarg(env);
12             }

We use again the information about the presence of env and the offset in the loop that populates the args.

1-            return @call(.auto, fun, args);
2+            const result = @call(.auto, fun, args);
3+            return make_result_term(env, result);

Finally, we call make_result_term on the result before returning it.

1fn make_result_term(env: beam.Env, result: anytype) beam.Term {
2    return switch (@TypeOf(result)) {
3        beam.Term => result,
4        u32 => beam.make_u32(env, result),
5        f64 => beam.make_f64(env, result),
6        else => |T| @compileError("Type " ++ @typeName(T) ++ " is not handled by make_result_term"),
7    };
8}

Note that we don’t have to pass the type explicitly here, since we can extract it from the type of the result, while in get_args_from_term we couldn’t because the input was always a beam.Term. If we had needed the return type from the function type info, we could have used function_info.return_type.

Conclusion

This concludes our comptime journey. You can find the working code (with a separate commit for each section) on GitHub.

While this is a minimal example, and you probably end up writing more code than in the beginning, this is amortized once you have many different NIFs, and in fact this exact technique helped me reduce mechanical noise by a lot in my Elixir TigerBeetle client (I will explore that a little more in a future blog post).

Finally, if you’re interested in a more complete example of this technique used to handle any possible type conversion between Zig and BEAM types, make sure to checkout Zigler (start here for the argument parsing, here for term to type conversion and here for the inverse).

NIFs can also be used in Erlang and all other languages based on the BEAM. Since for my example I’m using Elixir, I will just say “Elixir” throughout the post. ↩︎
I’m going to be using these type definitions throughout the post, assuming they’re in the beam.zig module that gets imported in our code:
```
1pub const e = @cImport(@cInclude("erl_nif.h"));
2
3pub const Env = ?*e.ErlNifEnv;
4pub const Term = e.ERL_NIF_TERM;
```
These are not strictly needed but they help keep the code more compact. Note that Env is actually defined to be an (optional) pointer to ErlNifEnv, since it’s always used as opaque and passed around as a pointer. ↩︎

Here’s their implementation (again in beam.zig):

 1pub fn get_u32(env: Env, term: Term) !u32 {
 2    var result: c_uint = undefined;
 3    if (e.enif_get_uint(env, term, &result) == 0) {
 4        return error.ArgumentError;
 5    }
 6    return @intCast(result);
 7}
 8
 9pub fn get_f64(env: Env, term: Term) !f64 {
10    var result: f64 = undefined;
11    if (e.enif_get_double(env, term, &result) == 0) {
12        return error.ArgumentError;
13    }
14    return result;
15}
16
17pub fn make_u32(env: Env, value: u32) Term {
18    return e.enif_make_uint(env, @intCast(value));
19}
20
21pub fn make_f64(env: Env, value: f64) Term {
22    return e.enif_make_double(env, value);
23}
24
25pub fn raise_badarg(env: Env) Term {
26    return e.enif_make_badarg(env);
27}

Note that the getters can return error.ArgumentError since Elixir is dynamically typed and the caller could always call the NIF with an argument of the wrong type. raise_badarg is there exactly to signal this in the standard BEAM way. ↩︎