rust/src/bootstrap
Michael Goulet 7124590dcc
Rollup merge of #104046 - RalfJung:run-miri-run, r=oli-obk
bootstrap: add support for running Miri on a file

This enables:
```
./x.py run src/tools/miri --stage 0 --args src/tools/miri/tests/pass/hello.rs
```
That can be super helpful for debugging.

Also avoid sharing the Miri sysroot dir with a system-wide (rustup-managed) installation of Miri.

Fixes https://github.com/rust-lang/rust/issues/76666
2022-11-09 21:53:35 -08:00
..
bin Place config.toml in current working directory if config not found 2022-11-05 15:07:10 -06:00
builder Add a new component, rust-json-docs, to distribute the JSON-formatted documentation for std crates in nightly toolchains. 2022-09-17 21:58:34 +02:00
defaults Enable compiler-docs by default for compiler, codegen, and tools profiles. 2022-05-05 20:05:12 -05:00
mk Use an extensionless x script for non-Windows 2022-08-12 15:39:26 -07:00
bolt.rs Use BOLT in x64 dist CI to optimize LLVM 2022-10-09 11:31:09 +02:00
bootstrap_test.py
bootstrap.py Auto merge of #101949 - matthiaskrgr:rollup-xu5cqnd, r=matthiaskrgr 2022-09-17 22:04:28 +00:00
build.rs Don't set RUSTC in the bootstrap build script 2022-09-25 09:42:01 -05:00
builder.rs Rollup merge of #104046 - RalfJung:run-miri-run, r=oli-obk 2022-11-09 21:53:35 -08:00
cache.rs Allow building single crates for the compiler and standard library 2022-07-02 19:29:39 -05:00
Cargo.lock Update several crates for improved support of the new targets 2022-11-05 12:19:55 +01:00
Cargo.toml Update several crates for improved support of the new targets 2022-11-05 12:19:55 +01:00
cc_detect.rs Update CI to use Android NDK r25b 2022-10-04 12:14:14 -07:00
CHANGELOG.md add changelog lines for the remote-test-server changes 2022-09-23 15:43:01 +02:00
channel.rs re-add git-commit-hash file to tarballs 2022-10-02 23:58:40 -04:00
check.rs Detect unused files in src/test/mir-opt and error on them in tidy. 2022-10-31 21:45:41 -07:00
clean.rs
compile.rs Revert "Make the c feature for compiler-builtins opt-in instead of inferred" 2022-10-29 10:49:05 -04:00
config.rs Only set config.config to None when using default path 2022-11-07 15:27:42 -07:00
configure.py Remove unsupported options in configure.py 2022-07-10 16:28:57 -05:00
dist.rs merge JsonStd and Std steps 2022-11-02 11:23:42 +01:00
doc.rs Rollup merge of #103851 - viandoxdev:103816_bootstrap_fix_json_doc, r=jyn514 2022-11-06 08:35:26 +01:00
download-ci-llvm-stamp Bump download-ci-llvm-stamp 2022-10-07 11:29:24 -07:00
dylib_util.rs
flags.rs bootstrap: add support for running Miri on a file 2022-11-06 09:52:31 +01:00
format.rs Auto merge of #98526 - jyn514:download-llvm-outside-checkout, r=Mark-Simulacrum 2022-07-11 01:15:47 +00:00
install.rs Remove miri from the submodule list and require it for CI to pass 2022-09-21 15:35:53 +00:00
job.rs
lib.rs check lld version to choose correct flag for tests 2022-10-26 11:18:14 +03:00
metadata.rs Simplify make_run for test::Crate by introducing crate_paths instead of calculating them after the fact 2022-03-30 18:01:23 -05:00
metrics.rs bump sysinfo version 2022-06-04 18:59:07 +02:00
native.rs Fix artifact version/channel detection for stable 2022-11-03 08:18:25 -04:00
README.md Remove trailing whitespaces 2022-10-03 05:14:55 +05:30
run.rs bootstrap: add support for running Miri on a file 2022-11-06 09:52:31 +01:00
sanity.rs add FIXME to replace this env var in the future 2022-11-07 09:14:49 +01:00
setup.rs Place config.toml in current working directory if config not found 2022-11-05 15:07:10 -06:00
tarball.rs re-add git-commit-hash file to tarballs 2022-10-02 23:58:40 -04:00
test.rs Rollup merge of #104046 - RalfJung:run-miri-run, r=oli-obk 2022-11-09 21:53:35 -08:00
tool.rs Auto merge of #102950 - oli-obk:check_miri, r=RalfJung 2022-10-31 23:03:39 +00:00
toolstate.rs miri is not in toolstate any more 2022-09-21 15:35:53 +00:00
util.rs check lld version to choose correct flag for tests 2022-10-26 11:18:14 +03:00

rustbuild - Bootstrapping Rust

This is an in-progress README which is targeted at helping to explain how Rust is bootstrapped and in general, some of the technical details of the build system.

Using rustbuild

The rustbuild build system has a primary entry point, a top level x.py script:

$ python ./x.py build

Note that if you're on Unix, you should be able to execute the script directly:

$ ./x.py build

The script accepts commands, flags, and arguments to determine what to do:

  • build - a general purpose command for compiling code. Alone, build will bootstrap the entire compiler, and otherwise, arguments passed indicate what to build. For example:

    # build the whole compiler
    ./x.py build --stage 2
    
    # build the stage1 compiler
    ./x.py build
    
    # build stage0 libstd
    ./x.py build --stage 0 library/std
    
    # build a particular crate in stage0
    ./x.py build --stage 0 library/test
    

    If files that would normally be rebuilt from stage 0 are dirty, the rebuild can be overridden using --keep-stage 0. Using --keep-stage n will skip all steps that belong to stage n or earlier:

    # build stage 1, keeping old build products for stage 0
    ./x.py build --keep-stage 0
    
  • test - a command for executing unit tests. Like the build command, this will execute the entire test suite by default, and otherwise, it can be used to select which test suite is run:

    # run all unit tests
    ./x.py test
    
    # execute tool tests
    ./x.py test tidy
    
    # execute the UI test suite
    ./x.py test src/test/ui
    
    # execute only some tests in the UI test suite
    ./x.py test src/test/ui --test-args substring-of-test-name
    
    # execute tests in the standard library in stage0
    ./x.py test --stage 0 library/std
    
    # execute tests in the core and standard library in stage0,
    # without running doc tests (thus avoid depending on building the compiler)
    ./x.py test --stage 0 --no-doc library/core library/std
    
    # execute all doc tests
    ./x.py test src/doc
    
  • doc - a command for building documentation. Like above, can take arguments for what to document.

Configuring rustbuild

There are currently two methods for configuring the rustbuild build system.

First, rustbuild offers a TOML-based configuration system with a config.toml file. An example of this configuration can be found at config.toml.example, and the configuration file can also be passed as --config path/to/config.toml if the build system is being invoked manually (via the python script).

Next, the ./configure options serialized in config.mk will be parsed and read. That is, if any ./configure options are passed, they'll be handled naturally. ./configure should almost never be used for local installations, and is primarily useful for CI. Prefer to customize behavior using config.toml.

Finally, rustbuild makes use of the cc-rs crate which has its own method of configuring C compilers and C flags via environment variables.

Build stages

The rustbuild build system goes through a few phases to actually build the compiler. What actually happens when you invoke rustbuild is:

  1. The entry point script, x.py is run. This script is responsible for downloading the stage0 compiler/Cargo binaries, and it then compiles the build system itself (this folder). Finally, it then invokes the actual bootstrap binary build system.
  2. In Rust, bootstrap will slurp up all configuration, perform a number of sanity checks (whether compilers exist, for example), and then start building the stage0 artifacts.
  3. The stage0 cargo, downloaded earlier, is used to build the standard library and the compiler, and then these binaries are then copied to the stage1 directory. That compiler is then used to generate the stage1 artifacts which are then copied to the stage2 directory, and then finally, the stage2 artifacts are generated using that compiler.

The goal of each stage is to (a) leverage Cargo as much as possible and failing that (b) leverage Rust as much as possible!

Incremental builds

You can configure rustbuild to use incremental compilation with the --incremental flag:

$ ./x.py build --incremental

The --incremental flag will store incremental compilation artifacts in build/<host>/stage0-incremental. Note that we only use incremental compilation for the stage0 -> stage1 compilation -- this is because the stage1 compiler is changing, and we don't try to cache and reuse incremental artifacts across different versions of the compiler.

You can always drop the --incremental to build as normal (but you will still be using the local nightly as your bootstrap).

Directory Layout

This build system houses all output under the build directory, which looks like this:

# Root folder of all output. Everything is scoped underneath here
build/

  # Location where the stage0 compiler downloads are all cached. This directory
  # only contains the tarballs themselves, as they're extracted elsewhere.
  cache/
    2015-12-19/
    2016-01-15/
    2016-01-21/
    ...

  # Output directory for building this build system itself. The stage0
  # cargo/rustc are used to build the build system into this location.
  bootstrap/
    debug/
    release/

  # Output of the dist-related steps like dist-std, dist-rustc, and dist-docs
  dist/

  # Temporary directory used for various input/output as part of various stages
  tmp/

  # Each remaining directory is scoped by the "host" triple of compilation at
  # hand.
  x86_64-unknown-linux-gnu/

    # The build artifacts for the `compiler-rt` library for the target that
    # this folder is under. The exact layout here will likely depend on the
    # platform, and this is also built with CMake, so the build system is
    # also likely different.
    compiler-rt/
      build/

    # Output folder for LLVM if it is compiled for this target
    llvm/

      # build folder (e.g. the platform-specific build system). Like with
      # compiler-rt, this is compiled with CMake
      build/

      # Installation of LLVM. Note that we run the equivalent of 'make install'
      # for LLVM, to setup these folders.
      bin/
      lib/
      include/
      share/
      ...

    # Output folder for all documentation of this target. This is what's filled
    # in whenever the `doc` step is run.
    doc/

    # Output for all compiletest-based test suites
    test/
      ui/
      debuginfo/
      ...

    # Location where the stage0 Cargo and Rust compiler are unpacked. This
    # directory is purely an extracted and overlaid tarball of these two (done
    # by the bootstrap python script). In theory, the build system does not
    # modify anything under this directory afterwards.
    stage0/

    # These to-build directories are the cargo output directories for builds of
    # the standard library and compiler, respectively. Internally, these may also
    # have other target directories, which represent artifacts being compiled
    # from the host to the specified target.
    #
    # Essentially, each of these directories is filled in by one `cargo`
    # invocation. The build system instruments calling Cargo in the right order
    # with the right variables to ensure that these are filled in correctly.
    stageN-std/
    stageN-test/
    stageN-rustc/
    stageN-tools/

    # This is a special case of the above directories, **not** filled in via
    # Cargo but rather the build system itself. The stage0 compiler already has
    # a set of target libraries for its own host triple (in its own sysroot)
    # inside of stage0/. When we run the stage0 compiler to bootstrap more
    # things, however, we don't want to use any of these libraries (as those are
    # the ones that we're building). So essentially, when the stage1 compiler is
    # being compiled (e.g. after libstd has been built), *this* is used as the
    # sysroot for the stage0 compiler being run.
    #
    # Basically, this directory is just a temporary artifact used to configure the
    # stage0 compiler to ensure that the libstd that we just built is used to
    # compile the stage1 compiler.
    stage0-sysroot/lib/

    # These output directories are intended to be standalone working
    # implementations of the compiler (corresponding to each stage). The build
    # system will link (using hard links) output from stageN-{std,rustc} into
    # each of these directories.
    #
    # In theory, there is no extra build output in these directories.
    stage1/
    stage2/
    stage3/

Cargo projects

The current build is unfortunately not quite as simple as cargo build in a directory, but rather the compiler is split into three different Cargo projects:

  • library/std - the standard library
  • library/test - testing support, depends on libstd
  • compiler/rustc - the actual compiler itself

Each "project" has a corresponding Cargo.lock file with all dependencies, and this means that building the compiler involves running Cargo three times. The structure here serves two goals:

  1. Facilitating dependencies coming from crates.io. These dependencies don't depend on std, so libstd is a separate project compiled ahead of time before the actual compiler builds.
  2. Splitting "host artifacts" from "target artifacts". That is, when building code for an arbitrary target, you don't need the entire compiler, but you'll end up needing libraries like libtest that depend on std but also want to use crates.io dependencies. Hence, libtest is split out as its own project that is sequenced after std but before rustc. This project is built for all targets.

There is some loss in build parallelism here because libtest can be compiled in parallel with a number of rustc artifacts, but in theory, the loss isn't too bad!

Build tools

We've actually got quite a few tools that we use in the compiler's build system and for testing. To organize these, each tool is a project in src/tools with a corresponding Cargo.toml. All tools are compiled with Cargo (currently having independent Cargo.lock files) and do not currently explicitly depend on the compiler or standard library. Compiling each tool is sequenced after the appropriate libstd/libtest/librustc compile above.

Extending rustbuild

So, you'd like to add a feature to the rustbuild build system or just fix a bug. Great! One of the major motivational factors for moving away from make is that Rust is in theory much easier to read, modify, and write. If you find anything excessively confusing, please open an issue on this, and we'll try to get it documented or simplified, pronto.

First up, you'll probably want to read over the documentation above, as that'll give you a high level overview of what rustbuild is doing. You also probably want to play around a bit yourself by just getting it up and running before you dive too much into the actual build system itself.

After that, each module in rustbuild should have enough documentation to keep you up and running. Some general areas that you may be interested in modifying are:

  • Adding a new build tool? Take a look at bootstrap/tool.rs for examples of other tools.
  • Adding a new compiler crate? Look no further! Adding crates can be done by adding a new directory with Cargo.toml followed by configuring all Cargo.toml files accordingly.
  • Adding a new dependency from crates.io? This should just work inside the compiler artifacts stage (everything other than libtest and libstd).
  • Adding a new configuration option? You'll want to modify bootstrap/flags.rs for command line flags and then bootstrap/config.rs to copy the flags to the Config struct.
  • Adding a sanity check? Take a look at bootstrap/sanity.rs.

If you make a major change, please remember to:

  • Update VERSION in src/bootstrap/main.rs.
  • Update changelog-seen = N in config.toml.example.
  • Add an entry in src/bootstrap/CHANGELOG.md.

A 'major change' includes

  • A new option or
  • A change in the default options.

Changes that do not affect contributors to the compiler or users building rustc from source don't need an update to VERSION.

If you have any questions, feel free to reach out on the #t-infra channel in the Rust Zulip server or ask on internals.rust-lang.org. When you encounter bugs, please file issues on the rust-lang/rust issue tracker.