Skip to content

NativeLink

License OpenSSF Scorecard OpenSSF Best Practices Slack

NativeLink is an extremely (blazingly?) fast and efficient build cache and remote executor for systems that communicate using the Remote execution protocol such as Bazel, Buck2, Goma and Reclient. NativeLink powers over one billion requests per month for customers using the system for their production workloads.

Supports Unix-based operating systems and Windows.

Below, you will find a few different options for getting started with NativeLink.

  1. Go to the NativeLink repository on GitHub. Clone the repository via SSH or HTTPS. In this example the repository is cloned via SSH:
Terminal window
git clone git@github.com:TraceMachina/nativelink.git

📦 Installing with Cargo

  1. First install Rust, but skip to step 2 if you have it already.
Terminal window
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
  1. Install NativeLink with Cargo.
Terminal window
cargo install --git https://github.com/TraceMachina/nativelink --tag v0.4.0

The nativelink executable reads a JSON file as it’s only parameter, --config. See nativelink-config for more details and examples.

To grab the example in your current working directory, run:

Terminal window
curl -O https://raw.githubusercontent.com/TraceMachina/nativelink/main/nativelink-config/examples/basic_cas.json
### you can modify the example above to replace the filesystem store with the memory store if you favor speed over data durability.
nativelink basic_cas.json
  1. Once you’ve built NativeLink and have an instance running with the basic_cas.json configuration, launch a separate terminal session.
  2. Navigate to where you cloned the NativeLink repository:
Terminal window
cd $HOME/nativelink
  1. In the new terminal, run the following command to connect the running server launched above to Bazel or another RBE client:
Terminal window
bazel test //... \
--remote_instance_name=main \
--remote_cache=grpc://127.0.0.1:50051 \
--remote_executor=grpc://127.0.0.1:50051 \
--remote_default_exec_properties=cpu_count=1

For Windows PowerShell;

Terminal window
bazel test //... `
--remote_instance_name=main `
--remote_cache=grpc://127.0.0.1:50051 `
--remote_executor=grpc://127.0.0.1:50051 `
--remote_default_exec_properties=cpu_count=1

This causes Bazel to run the commands through an all-in-one CAS, scheduler and worker.

If you’re using MacOS, encountering errors is anticipated at this stage. Our team is actively working on enhancing support for executing remoteable Bazel builds with MacOS. For now, you can run with or a Linux virtual machine. If you have any questions, reach out on the slack.

How it Works

This diagram is a high-level overview of the data flow in the NativeLink system. It refers to NativeLink concepts like Scheduler pool, Worker pool, and CAS rather than the cloud concepts like functions, compute nodes, and object storage to which they correspond.

❄️ Installing with Nix

Installation requirements:

This build doesn’t require cloning the repository, but you need to provide a configuration file, for instance the one at nativelink-config/examples/basic_cas.json.

The following command builds and runs NativeLink in release (optimized) mode:

Terminal window
nix run github:TraceMachina/nativelink ./basic_cas.json

For use in production pin the executable to a specific revision:

Terminal window
nix run github:TraceMachina/nativelink/<revision> ./basic_cas.json

🌱 Building with Bazel

Build requirements:

  • Bazel 7.0.2
  • A recent C++ toolchain with LLD as linker

The following commands places an executable in ./bazel-bin/nativelink and starts the service:

Terminal window
# Unoptimized development build on Unix
bazel run nativelink -- $(pwd)/nativelink-config/examples/basic_cas.json
# Optimized release build on Unix
bazel run -c opt nativelink -- $(pwd)/nativelink-config/examples/basic_cas.json
# Unoptimized development build on Windows
bazel run --config=windows nativelink -- $(pwd)/nativelink-config/examples/basic_cas.json
# Optimized release build on Windows
bazel run --config=windows -c opt nativelink -- $(pwd)/nativelink-config/examples/basic_cas.json

The Rust compiler rustc generates numerous artifacts during compilation, including dependencies, macros, and intermediate files. When compiling programs from source, be mindful of the associated files’ impact on your disk usage in the bazel-bin/ directory. This directory can grow substantially in size. If the facing issues due to this, run the following command to clear cache files: bazel clean --expunge

📦 Building with Cargo

Build requirements:

  • Cargo 1.74.0+
  • A recent C++ toolchain with LLD as linker
Terminal window
# Unoptimized development build
cargo run --bin nativelink -- ./nativelink-config/examples/basic_cas.json
# Optimized release build
cargo run --release --bin nativelink -- ./nativelink-config/examples/basic_cas.json

The Rust compiler rustc generates numerous artifacts during compilation, including dependencies, macros, and intermediate files. When compiling programs from source, be mindful of the associated files’ impact on your disk usage in the target/ directory. This directory can grow substantially in size. If the facing issues due to this, run the following command to clear cache files: cargo clean

🚀 Example Deployments

You can find a few example deployments in the deployment-examples directory.

🏺 History

This project was first created due to frustration with similar projects not working or being extremely inefficient. Rust was chosen as the language to write it in because at the time Rust was going through a revolution in the new-ish feature async-await. This made making multi-threading simpler when paired with a runtime like Tokio while still giving all the lifetime and other protections that Rust gives. This pretty much guarantees that we will never have crashes due to race conditions. This kind of project seemed perfect, since there is so much asynchronous activity happening and running them on different threads is most preferable. Other languages like Go are good candidates, but other similar projects rely heavily on channels and mutex locks which are cumbersome and have to be carefully designed by the developer. Rust doesn’t have these issues, since the compiler will always tell you when the code you are writing might introduce undefined behavior. The last major reason is because Rust is extremely fast and has no garbage collection (like C++, but unlike Java, Go, or Typescript).

📜 License

Copyright 2020–2024 Trace Machina, Inc.

Licensed under the Apache 2.0 License, SPDX identifier Apache-2.0.