2020-08-02 07:37:27 -05:00
Our approach to "clean code" is two-fold:
* We generally don't block PRs on style changes.
* At the same time, all code in rust-analyzer is constantly refactored.
It is explicitly OK for a reviewer to flag only some nits in the PR, and then send a follow-up cleanup PR for things which are easier to explain by example, cc-ing the original author.
Sending small cleanup PRs (like renaming a single local variable) is encouraged.
2021-01-22 06:26:19 -06:00
When reviewing pull requests prefer extending this document to leaving
non-reusable comments on the pull request itself.
2020-10-07 05:50:46 -05:00
# General
## Scale of Changes
2020-08-02 07:37:27 -05:00
Everyone knows that it's better to send small & focused pull requests.
The problem is, sometimes you *have* to, eg, rewrite the whole compiler, and that just doesn't fit into a set of isolated PRs.
The main things to keep an eye on are the boundaries between various components.
There are three kinds of changes:
1. Internals of a single component are changed.
Specifically, you don't change any `pub` items.
A good example here would be an addition of a new assist.
2. API of a component is expanded.
Specifically, you add a new `pub` function which wasn't there before.
A good example here would be expansion of assist API, for example, to implement lazy assists or assists groups.
3. A new dependency between components is introduced.
Specifically, you add a `pub use` reexport from another crate or you add a new line to the `[dependencies]` section of `Cargo.toml` .
A good example here would be adding reference search capability to the assists crates.
For the first group, the change is generally merged as long as:
* it works for the happy case,
* it has tests,
* it doesn't panic for the unhappy case.
For the second group, the change would be subjected to quite a bit of scrutiny and iteration.
The new API needs to be right (or at least easy to change later).
The actual implementation doesn't matter that much.
It's very important to minimize the amount of changed lines of code for changes of the second kind.
2021-02-20 04:53:19 -06:00
Often, you start doing a change of the first kind, only to realize that you need to elevate to a change of the second kind.
2020-08-02 07:37:27 -05:00
In this case, we'll probably ask you to split API changes into a separate PR.
Changes of the third group should be pretty rare, so we don't specify any specific process for them.
That said, adding an innocent-looking `pub use` is a very simple way to break encapsulation, keep an eye on it!
Note: if you enjoyed this abstract hand-waving about boundaries, you might appreciate
https://www.tedinski.com/2018/02/06/system-boundaries.html
2020-10-07 05:50:46 -05:00
## Crates.io Dependencies
2020-08-02 07:37:27 -05:00
We try to be very conservative with usage of crates.io dependencies.
2021-04-06 10:37:41 -05:00
Don't use small "helper" crates (exception: `itertools` and `either` are allowed).
2020-08-02 07:37:27 -05:00
If there's some general reusable bit of code you need, consider adding it to the `stdx` crate.
2021-04-06 10:37:41 -05:00
A useful exercise is to read Cargo.lock and see if some *transitive* dependencies do not make sense for rust-analyzer.
2020-08-02 07:37:27 -05:00
2021-04-02 05:59:36 -05:00
**Rationale:** keep compile times low, create ecosystem pressure for faster compiles, reduce the number of things which might break.
2021-01-07 11:11:55 -06:00
2020-10-07 05:50:46 -05:00
## Commit Style
We don't have specific rules around git history hygiene.
Maintaining clean git history is strongly encouraged, but not enforced.
Use rebase workflow, it's OK to rewrite history during PR review process.
After you are happy with the state of the code, please use [interactive rebase ](https://git-scm.com/book/en/v2/Git-Tools-Rewriting-History ) to squash fixup commits.
Avoid @mentioning people in commit messages and pull request descriptions(they are added to commit message by bors).
Such messages create a lot of duplicate notification traffic during rebases.
2020-12-28 04:43:07 -06:00
If possible, write commit messages from user's perspective:
```
2021-01-07 11:11:55 -06:00
# GOOD
2020-12-28 04:43:07 -06:00
Goto definition works inside macros
2021-01-07 11:11:55 -06:00
# BAD
2020-12-28 04:43:07 -06:00
Use original span for FileId
```
This makes it easier to prepare a changelog.
2021-01-11 07:29:29 -06:00
If the change adds a new user-visible functionality, consider recording a GIF with [peek ](https://github.com/phw/peek ) and pasting it into the PR description.
2021-04-19 13:09:24 -05:00
To make writing the release notes easier, you can mark a pull request as a feature, fix, internal change, or minor.
Minor changes are excluded from the release notes, while the other types are distributed in their corresponding sections.
There are two ways to mark this:
* use a `feat: ` , `feature: ` , `fix: ` , `internal: ` or `minor: ` prefix in the PR title
* write `changelog [feature|fix|internal|skip] [description]` in a comment or in the PR description; the description is optional, and will replace the title if included.
These comments don't have to be added by the PR author.
Editing a comment or the PR description or title is also fine, as long as it happens before the release.
2021-01-07 11:27:22 -06:00
**Rationale:** clean history is potentially useful, but rarely used.
2021-01-07 11:11:55 -06:00
But many users read changelogs.
2021-04-19 13:09:24 -05:00
Including a description and GIF suitable for the changelog means less work for the maintainers on the release day.
2021-01-07 11:11:55 -06:00
2020-10-07 05:50:46 -05:00
## Clippy
We don't enforce Clippy.
A number of default lints have high false positive rate.
Selectively patching false-positives with `allow(clippy)` is considered worse than not using Clippy at all.
2021-03-14 08:24:34 -05:00
There's a `cargo lint` command which runs a subset of low-FPR lints.
Careful tweaking of `lint` is welcome.
2020-10-07 05:50:46 -05:00
Of course, applying Clippy suggestions is welcome as long as they indeed improve the code.
2021-01-07 11:27:22 -06:00
**Rationale:** see [rust-lang/clippy#5537 ](https://github.com/rust-lang/rust-clippy/issues/5537 ).
2021-01-07 11:11:55 -06:00
2020-10-07 05:50:46 -05:00
# Code
## Minimal Tests
2020-08-02 07:37:27 -05:00
Most tests in rust-analyzer start with a snippet of Rust code.
2021-02-20 04:53:19 -06:00
These snippets should be minimal -- if you copy-paste a snippet of real code into the tests, make sure to remove everything which could be removed.
2020-08-02 07:37:27 -05:00
It also makes sense to format snippets more compactly (for example, by placing enum definitions like `enum E { Foo, Bar }` on a single line),
as long as they are still readable.
2020-11-06 13:29:41 -06:00
When using multiline fixtures, use unindented raw string literals:
```rust
#[test]
fn inline_field_shorthand() {
check_assist(
inline_local_variable,
2020-12-15 02:49:22 -06:00
r#"
2020-11-06 13:29:41 -06:00
struct S { foo: i32}
fn main() {
2021-01-06 14:15:48 -06:00
let $0foo = 92;
2020-11-06 13:29:41 -06:00
S { foo }
}
2020-12-15 02:49:22 -06:00
"#,
r#"
2020-11-06 13:29:41 -06:00
struct S { foo: i32}
fn main() {
S { foo: 92 }
}
2020-12-15 02:49:22 -06:00
"#,
2020-11-06 13:29:41 -06:00
);
}
```
2021-01-07 11:27:22 -06:00
**Rationale:**
2021-01-07 11:11:55 -06:00
There are many benefits to this:
* less to read or to scroll past
* easier to understand what exactly is tested
* less stuff printed during printf-debugging
* less time to run test
Formatting ensures that you can use your editor's "number of selected characters" feature to correlate offsets with test's source code.
2020-11-06 13:29:41 -06:00
2021-01-22 08:58:00 -06:00
## Marked Tests
Use
2021-03-08 14:19:44 -06:00
[`cov_mark::hit! / cov_mark::check!` ](https://github.com/matklad/cov-mark )
2021-01-22 08:58:00 -06:00
when testing specific conditions.
Do not place several marks into a single test or condition.
Do not reuse marks between several tests.
**Rationale:** marks provide an easy way to find the canonical test for each bit of code.
This makes it much easier to understand.
2021-04-13 04:20:54 -05:00
More than one mark per test / code branch doesn't add significantly to understanding.
## `#[should_panic]`
Do not use `#[should_panic]` tests.
Instead, explicitly check for `None` , `Err` , etc.
2021-07-26 17:15:40 -05:00
**Rationale:** `#[should_panic]` is a tool for library authors to make sure that the API does not fail silently when misused.
2021-04-13 04:20:54 -05:00
`rust-analyzer` is not a library, we don't need to test for API misuse, and we have to handle any user input without panics.
Panic messages in the logs from the `#[should_panic]` tests are confusing.
2021-01-22 08:58:00 -06:00
2021-06-14 14:55:05 -05:00
## `#[ignore]`
Do not `#[ignore]` tests.
If the test currently does not work, assert the wrong behavior and add a fixme explaining why it is wrong.
**Rationale:** noticing when the behavior is fixed, making sure that even the wrong behavior is acceptable (ie, not a panic).
2021-01-07 11:11:55 -06:00
## Function Preconditions
2020-08-02 07:37:27 -05:00
2020-08-02 07:59:18 -05:00
Express function preconditions in types and force the caller to provide them (rather than checking in callee):
2020-08-02 07:37:27 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2021-02-03 06:40:24 -06:00
fn frobnicate(walrus: Walrus) {
2020-08-02 07:37:27 -05:00
...
}
2021-01-07 11:11:55 -06:00
// BAD
2020-08-02 07:37:27 -05:00
fn frobnicate(walrus: Option< Walrus > ) {
let walrus = match walrus {
Some(it) => it,
None => return,
};
...
}
```
2021-01-07 11:27:22 -06:00
**Rationale:** this makes control flow explicit at the call site.
2021-01-07 11:11:55 -06:00
Call-site has more context, it often happens that the precondition falls out naturally or can be bubbled up higher in the stack.
Avoid splitting precondition check and precondition use across functions:
2020-09-29 07:42:09 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-07 06:03:13 -05:00
fn main() {
let s: & str = ...;
if let Some(contents) = string_literal_contents(s) {
}
}
2020-09-29 07:42:09 -05:00
fn string_literal_contents(s: & str) -> Option< & str> {
if s.starts_with('"') & & s.ends_with('"') {
Some(& s[1..s.len() - 1])
} else {
None
}
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-07 06:03:13 -05:00
fn main() {
2020-09-29 07:42:09 -05:00
let s: & str = ...;
2020-10-07 06:03:13 -05:00
if is_string_literal(s) {
let contents = & s[1..s.len() - 1];
2020-09-29 07:42:09 -05:00
}
}
2020-10-01 13:02:39 -05:00
fn is_string_literal(s: & str) -> bool {
2020-09-29 07:42:09 -05:00
s.starts_with('"') & & s.ends_with('"')
}
```
2020-10-01 13:02:39 -05:00
In the "Not as good" version, the precondition that `1` is a valid char boundary is checked in `is_string_literal` and used in `foo` .
In the "Good" version, the precondition check and usage are checked in the same block, and then encoded in the types.
2020-09-29 07:42:09 -05:00
2021-01-07 11:27:22 -06:00
**Rationale:** non-local code properties degrade under change.
2021-01-07 11:11:55 -06:00
2020-10-07 05:57:49 -05:00
When checking a boolean precondition, prefer `if !invariant` to `if negated_invariant` :
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-07 05:57:49 -05:00
if !(idx < len ) {
return None;
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-07 05:57:49 -05:00
if idx >= len {
return None;
}
```
2021-02-20 04:53:19 -06:00
**Rationale:** it's useful to see the invariant relied upon by the rest of the function clearly spelled out.
2021-01-07 11:11:55 -06:00
2021-01-14 09:25:19 -06:00
## Assertions
Assert liberally.
2021-01-26 13:11:12 -06:00
Prefer `stdx::never!` to standard `assert!` .
2021-01-14 09:25:19 -06:00
2020-10-07 05:50:46 -05:00
## Getters & Setters
2020-08-17 09:11:29 -05:00
If a field can have any value without breaking invariants, make the field public.
Conversely, if there is an invariant, document it, enforce it in the "constructor" function, make the field private, and provide a getter.
Never provide setters.
Getters should return borrowed data:
2020-08-24 05:19:12 -05:00
```rust
2020-08-17 09:11:29 -05:00
struct Person {
// Invariant: never empty
first_name: String,
middle_name: Option< String >
}
2021-01-07 11:11:55 -06:00
// GOOD
2020-08-17 09:11:29 -05:00
impl Person {
fn first_name(& self) -> & str { self.first_name.as_str() }
fn middle_name(& self) -> Option< & str> { self.middle_name.as_ref() }
}
2021-01-07 11:11:55 -06:00
// BAD
2020-08-17 09:11:29 -05:00
impl Person {
fn first_name(& self) -> String { self.first_name.clone() }
fn middle_name(& self) -> & Option< String > { & self.middle_name }
}
```
2021-01-07 11:27:22 -06:00
**Rationale:** we don't provide public API, it's cheaper to refactor than to pay getters rent.
2021-01-07 11:11:55 -06:00
Non-local code properties degrade under change, privacy makes invariant local.
2021-07-19 09:47:56 -05:00
Borrowed owned types (`& String`) disclose irrelevant details about internal representation.
2021-01-07 11:11:55 -06:00
Irrelevant (neither right nor wrong) things obscure correctness.
2021-02-02 06:41:57 -06:00
## Useless Types
More generally, always prefer types on the left
```rust
// GOOD BAD
& [T] & Vec< T >
& str & String
Option< & T> & Option< T >
2021-07-08 09:18:44 -05:00
& Path & PathBuf
2021-02-02 06:41:57 -06:00
```
**Rationale:** types on the left are strictly more general.
Even when generality is not required, consistency is important.
2020-10-14 13:02:03 -05:00
## Constructors
2021-07-12 12:14:58 -05:00
Prefer `Default` to zero-argument `new` function.
2020-10-14 13:02:03 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-14 13:02:03 -05:00
#[derive(Default)]
struct Foo {
bar: Option< Bar >
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-14 13:02:03 -05:00
struct Foo {
bar: Option< Bar >
}
impl Foo {
fn new() -> Foo {
Foo { bar: None }
}
}
```
2021-07-26 17:15:40 -05:00
Prefer `Default` even if it has to be implemented manually.
2020-10-14 13:02:03 -05:00
2021-01-07 11:27:22 -06:00
**Rationale:** less typing in the common case, uniformity.
2021-01-07 11:11:55 -06:00
2021-01-25 06:14:43 -06:00
Use `Vec::new` rather than `vec![]` .
**Rationale:** uniformity, strength reduction.
2021-01-20 05:30:50 -06:00
2021-07-12 12:14:58 -05:00
Avoid using "dummy" states to implement a `Default` .
If a type doesn't have a sensible default, empty value, don't hide it.
2021-07-26 17:15:40 -05:00
Let the caller explicitly decide what the right initial state is.
2021-07-12 12:14:58 -05:00
2020-11-02 05:08:49 -06:00
## Functions Over Objects
Avoid creating "doer" objects.
That is, objects which are created only to execute a single action.
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-11-02 05:08:49 -06:00
do_thing(arg1, arg2);
2021-01-07 11:11:55 -06:00
// BAD
2020-11-02 05:08:49 -06:00
ThingDoer::new(arg1, arg2).do();
```
Note that this concerns only outward API.
When implementing `do_thing` , it might be very useful to create a context object.
```rust
pub fn do_thing(arg1: Arg1, arg2: Arg2) -> Res {
2021-04-06 10:37:41 -05:00
let mut ctx = Ctx { arg1, arg2 };
2020-11-02 05:08:49 -06:00
ctx.run()
}
struct Ctx {
arg1: Arg1, arg2: Arg2
}
impl Ctx {
fn run(self) -> Res {
...
}
}
```
The difference is that `Ctx` is an impl detail here.
2020-11-02 05:17:34 -06:00
Sometimes a middle ground is acceptable if this can save some busywork:
2020-11-02 05:08:49 -06:00
```rust
ThingDoer::do(arg1, arg2);
pub struct ThingDoer {
arg1: Arg1, arg2: Arg2,
}
impl ThingDoer {
pub fn do(arg1: Arg1, arg2: Arg2) -> Res {
ThingDoer { arg1, arg2 }.run()
}
fn run(self) -> Res {
...
}
}
```
2021-01-07 11:27:22 -06:00
**Rationale:** not bothering the caller with irrelevant details, not mixing user API with implementor API.
2021-01-07 11:11:55 -06:00
2021-02-12 09:10:16 -06:00
## Functions with many parameters
Avoid creating functions with many optional or boolean parameters.
Introduce a `Config` struct instead.
```rust
// GOOD
pub struct AnnotationConfig {
pub binary_target: bool,
pub annotate_runnables: bool,
pub annotate_impls: bool,
}
pub fn annotations(
db: & RootDatabase,
file_id: FileId,
config: AnnotationConfig
) -> Vec< Annotation > {
...
}
// BAD
pub fn annotations(
db: & RootDatabase,
file_id: FileId,
binary_target: bool,
annotate_runnables: bool,
annotate_impls: bool,
) -> Vec< Annotation > {
...
}
```
**Rationale:** reducing churn.
If the function has many parameters, they most likely change frequently.
By packing them into a struct we protect all intermediary functions from changes.
Do not implement `Default` for the `Config` struct, the caller has more context to determine better defaults.
Do not store `Config` as a part of the `state` , pass it explicitly.
This gives more flexibility for the caller.
If there is variation not only in the input parameters, but in the return type as well, consider introducing a `Command` type.
```rust
// MAYBE GOOD
pub struct Query {
pub name: String,
pub case_sensitive: bool,
}
impl Query {
pub fn all(self) -> Vec< Item > { ... }
pub fn first(self) -> Option< Item > { ... }
}
// MAYBE BAD
fn query_all(name: String, case_sensitive: bool) -> Vec< Item > { ... }
fn query_first(name: String, case_sensitive: bool) -> Option< Item > { ... }
```
2021-05-04 14:20:04 -05:00
## Prefer Separate Functions Over Parameters
If a function has a `bool` or an `Option` parameter, and it is always called with `true` , `false` , `Some` and `None` literals, split the function in two.
```rust
// GOOD
fn caller_a() {
foo()
}
fn caller_b() {
foo_with_bar(Bar::new())
}
fn foo() { ... }
fn foo_with_bar(bar: Bar) { ... }
// BAD
fn caller_a() {
foo(None)
}
fn caller_b() {
foo(Some(Bar::new()))
}
fn foo(bar: Option< Bar > ) { ... }
```
**Rationale:** more often than not, such functions display "`false sharing`" -- they have additional `if` branching inside for two different cases.
Splitting the two different control flows into two functions simplifies each path, and remove cross-dependencies between the two paths.
If there's common code between `foo` and `foo_with_bar` , extract *that* into a common helper.
2021-01-22 06:26:19 -06:00
## Appropriate String Types
2021-01-22 08:58:00 -06:00
When interfacing with OS APIs, use `OsString` , even if the original source of data is utf-8 encoded.
**Rationale:** cleanly delineates the boundary when the data goes into the OS-land.
2021-01-22 06:26:19 -06:00
2021-01-22 08:58:00 -06:00
Use `AbsPathBuf` and `AbsPath` over `std::Path` .
**Rationale:** rust-analyzer is a long-lived process which handles several projects at the same time.
It is important not to leak cwd by accident.
2021-01-07 11:11:55 -06:00
2020-08-14 05:27:15 -05:00
# Premature Pessimization
2020-08-02 07:37:27 -05:00
2020-10-15 05:21:38 -05:00
## Avoid Allocations
2020-08-02 07:37:27 -05:00
Avoid writing code which is slower than it needs to be.
Don't allocate a `Vec` where an iterator would do, don't allocate strings needlessly.
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-08-02 07:37:27 -05:00
use itertools::Itertools;
let (first_word, second_word) = match text.split_ascii_whitespace().collect_tuple() {
Some(it) => it,
None => return,
}
2021-01-07 11:11:55 -06:00
// BAD
2020-08-02 07:37:27 -05:00
let words = text.split_ascii_whitespace().collect::< Vec < _ > >();
if words.len() != 2 {
return
}
```
2021-07-26 17:15:40 -05:00
**Rationale:** not allocating is almost always faster.
2021-01-07 11:11:55 -06:00
2020-10-15 05:21:38 -05:00
## Push Allocations to the Call Site
2020-09-29 07:42:09 -05:00
If allocation is inevitable, let the caller allocate the resource:
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-09-29 07:42:09 -05:00
fn frobnicate(s: String) {
...
}
2021-01-07 11:11:55 -06:00
// BAD
2020-09-29 07:42:09 -05:00
fn frobnicate(s: & str) {
let s = s.to_string();
...
}
```
2021-01-07 11:27:22 -06:00
**Rationale:** reveals the costs.
2020-10-15 05:21:38 -05:00
It is also more efficient when the caller already owns the allocation.
2021-01-20 05:47:42 -06:00
## Collection Types
2020-08-24 05:49:36 -05:00
2020-10-07 05:50:46 -05:00
Prefer `rustc_hash::FxHashMap` and `rustc_hash::FxHashSet` instead of the ones in `std::collections` .
2021-01-07 11:11:55 -06:00
2021-01-07 11:27:22 -06:00
**Rationale:** they use a hasher that's significantly faster and using them consistently will reduce code size by some small amount.
2020-10-07 05:50:46 -05:00
2021-01-20 05:47:42 -06:00
## Avoid Intermediate Collections
When writing a recursive function to compute a sets of things, use an accumulator parameter instead of returning a fresh collection.
Accumulator goes first in the list of arguments.
```rust
// GOOD
pub fn reachable_nodes(node: Node) -> FxHashSet< Node > {
let mut res = FxHashSet::default();
go(& mut res, node);
res
}
fn go(acc: & mut FxHashSet< Node > , node: Node) {
acc.insert(node);
for n in node.neighbors() {
go(acc, n);
}
}
// BAD
pub fn reachable_nodes(node: Node) -> FxHashSet< Node > {
let mut res = FxHashSet::default();
res.insert(node);
for n in node.neighbors() {
res.extend(reachable_nodes(n));
}
res
}
```
2021-01-25 06:14:43 -06:00
**Rationale:** re-use allocations, accumulator style is more concise for complex cases.
2021-01-20 05:47:42 -06:00
2021-07-05 16:00:39 -05:00
## Avoid Monomorphization
Avoid making a lot of code type parametric, *especially* on the boundaries between crates.
```rust
// GOOD
fn frobnicate(f: impl FnMut()) {
frobnicate_impl(& mut f)
}
fn frobnicate_impl(f: & mut dyn FnMut()) {
// lots of code
}
// BAD
fn frobnicate(f: impl FnMut()) {
// lots of code
}
```
Avoid `AsRef` polymorphism, it pays back only for widely used libraries:
```rust
// GOOD
fn frobnicate(f: & Path) {
}
// BAD
fn frobnicate(f: impl AsRef< Path > ) {
}
```
**Rationale:** Rust uses monomorphization to compile generic code, meaning that for each instantiation of a generic functions with concrete types, the function is compiled afresh, *per crate* .
This allows for exceptionally good performance, but leads to increased compile times.
Runtime performance obeys 80%/20% rule -- only a small fraction of code is hot.
Compile time **does not** obey this rule -- all code has to be compiled.
2020-10-07 05:50:46 -05:00
# Style
## Order of Imports
Separate import groups with blank lines.
Use one `use` per crate.
2021-01-07 11:11:55 -06:00
Module declarations come before the imports.
Order them in "suggested reading order" for a person new to the code base.
2020-10-07 05:50:46 -05:00
```rust
mod x;
mod y;
// First std.
use std::{ ... }
// Second, external crates (both crates.io crates and other rust-analyzer crates).
use crate_foo::{ ... }
use crate_bar::{ ... }
// Then current crate.
use crate::{}
// Finally, parent and child modules, but prefer `use crate::` .
use super::{}
2021-05-13 05:44:47 -05:00
// Re-exports are treated as item definitions rather than imports, so they go
// after imports and modules. Use them sparingly.
pub use crate::x::Z;
2020-10-07 05:50:46 -05:00
```
2021-01-07 11:27:22 -06:00
**Rationale:** consistency.
2021-01-07 11:11:55 -06:00
Reading order is important for new contributors.
2021-04-06 10:37:41 -05:00
Grouping by crate allows spotting unwanted dependencies easier.
2020-10-07 05:50:46 -05:00
## Import Style
Qualify items from `hir` and `ast` .
2020-08-24 05:49:36 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-07 05:50:46 -05:00
use syntax::ast;
2021-01-07 11:11:55 -06:00
fn frobnicate(func: hir::Function, strukt: ast::Struct) {}
2020-10-07 05:50:46 -05:00
2021-01-07 11:11:55 -06:00
// BAD
2020-10-07 05:50:46 -05:00
use hir::Function;
2021-01-07 11:11:55 -06:00
use syntax::ast::Struct;
2020-10-07 05:50:46 -05:00
2021-01-07 11:11:55 -06:00
fn frobnicate(func: Function, strukt: Struct) {}
2020-10-07 05:50:46 -05:00
```
2021-01-07 11:27:22 -06:00
**Rationale:** avoids name clashes, makes the layer clear at a glance.
2020-10-07 05:50:46 -05:00
2021-01-07 10:27:47 -06:00
When implementing traits from `std::fmt` or `std::ops` , import the module:
2020-10-07 05:50:46 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-07 05:50:46 -05:00
use std::fmt;
impl fmt::Display for RenameError {
fn fmt(& self, f: & mut fmt::Formatter< '_>) -> fmt::Result { .. }
2020-08-24 05:49:36 -05:00
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-07 05:50:46 -05:00
impl std::fmt::Display for RenameError {
fn fmt(& self, f: & mut std::fmt::Formatter< '_>) -> std::fmt::Result { .. }
2020-08-24 05:49:36 -05:00
}
2021-01-07 10:27:47 -06:00
2021-01-07 11:11:55 -06:00
// BAD
2021-01-07 10:27:47 -06:00
use std::ops::Deref;
impl Deref for Widget {
type Target = str;
fn deref(& self) -> & str { .. }
}
2020-08-24 05:49:36 -05:00
```
2021-01-07 11:27:22 -06:00
**Rationale:** overall, less typing.
2021-01-07 11:11:55 -06:00
Makes it clear that a trait is implemented, rather than used.
Avoid local `use MyEnum::*` imports.
2021-01-07 11:27:22 -06:00
**Rationale:** consistency.
2021-01-07 11:11:55 -06:00
Prefer `use crate::foo::bar` to `use super::bar` or `use self::bar::baz` .
2021-01-07 11:27:22 -06:00
**Rationale:** consistency, this is the style which works in all cases.
2021-01-07 11:11:55 -06:00
2021-05-13 05:44:47 -05:00
By default, avoid re-exports.
**Rationale:** for non-library code, re-exports introduce two ways to use something and allow for inconsistency.
2020-10-07 05:50:46 -05:00
## Order of Items
Optimize for the reader who sees the file for the first time, and wants to get a general idea about what's going on.
People read things from top to bottom, so place most important things first.
Specifically, if all items except one are private, always put the non-private item on top.
2020-08-24 05:49:36 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-16 05:50:09 -05:00
pub(crate) fn frobnicate() {
Helper::act()
}
#[derive(Default)]
struct Helper { stuff: i32 }
impl Helper {
fn act(& self) {
}
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-16 05:50:09 -05:00
#[derive(Default)]
struct Helper { stuff: i32 }
pub(crate) fn frobnicate() {
Helper::act()
2020-08-24 05:49:36 -05:00
}
2020-10-16 05:50:09 -05:00
impl Helper {
fn act(& self) {
}
}
2020-10-07 05:50:46 -05:00
```
2020-10-16 05:50:09 -05:00
If there's a mixture of private and public items, put public items first.
2021-01-07 11:11:55 -06:00
Put `struct` s and `enum` s first, functions and impls last. Order type declarations in top-down manner.
2020-10-07 05:50:46 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-16 05:50:09 -05:00
struct Parent {
children: Vec< Child >
}
struct Child;
impl Parent {
}
impl Child {
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-16 05:50:09 -05:00
struct Child;
2020-10-07 05:50:46 -05:00
2020-10-16 05:50:09 -05:00
impl Child {
}
struct Parent {
children: Vec< Child >
}
impl Parent {
2020-08-24 05:49:36 -05:00
}
```
2021-01-07 11:27:22 -06:00
**Rationale:** easier to get the sense of the API by visually scanning the file.
2021-01-07 11:11:55 -06:00
If function bodies are folded in the editor, the source code should read as documentation for the public API.
2021-07-05 16:00:39 -05:00
## Context Parameters
Some parameters are threaded unchanged through many function calls.
They determine the "context" of the operation.
Pass such parameters first, not last.
If there are several context parameters, consider packing them into a `struct Ctx` and passing it as `&self` .
```rust
// GOOD
fn dfs(graph: & Graph, v: Vertex) -> usize {
let mut visited = FxHashSet::default();
return go(graph, & mut visited, v);
fn go(graph: & Graph, visited: & mut FxHashSet< Vertex > , v: usize) -> usize {
...
}
}
// BAD
fn dfs(v: Vertex, graph: & Graph) -> usize {
fn go(v: usize, graph: & Graph, visited: & mut FxHashSet< Vertex > ) -> usize {
...
}
let mut visited = FxHashSet::default();
go(v, graph, & mut visited)
}
```
**Rationale:** consistency.
Context-first works better when non-context parameter is a lambda.
2020-10-07 05:50:46 -05:00
## Variable Naming
2020-08-02 07:37:27 -05:00
2020-10-07 05:50:46 -05:00
Use boring and long names for local variables ([yay code completion](https://github.com/rust-analyzer/rust-analyzer/pull/4162#discussion_r417130973)).
The default name is a lowercased name of the type: `global_state: GlobalState` .
Avoid ad-hoc acronyms and contractions, but use the ones that exist consistently (`db`, `ctx` , `acc` ).
2020-12-10 08:41:57 -06:00
Prefer American spelling (color, behavior).
2020-08-02 07:37:27 -05:00
2020-10-07 05:50:46 -05:00
Default names:
2020-08-02 07:37:27 -05:00
2020-10-07 05:50:46 -05:00
* `res` -- "result of the function" local variable
* `it` -- I don't really care about the name
* `n_foo` -- number of foos
* `foo_idx` -- index of `foo`
2020-08-02 07:37:27 -05:00
2020-10-15 11:14:30 -05:00
Many names in rust-analyzer conflict with keywords.
We use mangled names instead of `r#ident` syntax:
```
crate -> krate
enum -> enum_
2021-05-17 10:37:06 -05:00
fn -> func
impl -> imp
2021-08-03 09:36:06 -05:00
macro -> mac
2020-10-15 11:14:30 -05:00
mod -> module
2021-05-17 10:37:06 -05:00
struct -> strukt
trait -> trait_
type -> ty
2020-10-15 11:14:30 -05:00
```
2020-10-02 03:13:58 -05:00
2021-01-07 11:11:55 -06:00
**Rationale:** consistency.
2020-10-07 05:50:46 -05:00
## Early Returns
2020-10-02 03:13:58 -05:00
2020-10-07 05:50:46 -05:00
Do use early returns
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-07 05:50:46 -05:00
fn foo() -> Option< Bar > {
if !condition() {
return None;
}
Some(...)
}
2021-01-07 11:11:55 -06:00
// BAD
2020-10-07 05:50:46 -05:00
fn foo() -> Option< Bar > {
if condition() {
Some(...)
} else {
None
}
}
```
2021-02-03 06:40:24 -06:00
**Rationale:** reduce cognitive stack usage.
2021-01-07 11:11:55 -06:00
2020-10-07 05:57:49 -05:00
## Comparisons
2021-03-03 02:23:05 -06:00
When doing multiple comparisons use `<` /`< =`, avoid `>` /`>=`.
2020-10-07 05:57:49 -05:00
2020-10-07 06:11:33 -05:00
```rust
2021-01-07 11:11:55 -06:00
// GOOD
2020-10-07 05:57:49 -05:00
assert!(lo < = x & & x < = hi);
2021-03-03 02:23:05 -06:00
assert!(r1 < l2 | | r2 < l1 ) ;
assert!(x < y ) ;
assert!(x > 0);
2020-10-07 05:57:49 -05:00
2021-01-07 11:11:55 -06:00
// BAD
2021-04-06 10:37:41 -05:00
assert!(x >= lo & & x < = hi);
2021-03-03 02:23:05 -06:00
assert!(r1 < l2 | | l1 > r2);
assert!(y > x);
assert!(0 > x);
2020-10-07 05:57:49 -05:00
```
2021-01-07 11:27:22 -06:00
**Rationale:** Less-then comparisons are more intuitive, they correspond spatially to [real line ](https://en.wikipedia.org/wiki/Real_line ).
2021-01-07 11:11:55 -06:00
2021-03-16 02:51:05 -05:00
## If-let
Avoid `if let ... { } else { }` construct, use `match` instead.
```rust
// GOOD
match ctx.expected_type.as_ref() {
Some(expected_type) => completion_ty == expected_type & & !expected_type.is_unit(),
None => false,
}
// BAD
if let Some(expected_type) = ctx.expected_type.as_ref() {
completion_ty == expected_type & & !expected_type.is_unit()
} else {
false
}
```
2021-04-02 06:52:00 -05:00
**Rationale:** `match` is almost always more compact.
2021-03-16 02:51:05 -05:00
The `else` branch can get a more precise pattern: `None` or `Err(_)` instead of `_` .
2021-01-07 11:11:55 -06:00
2021-08-02 07:59:28 -05:00
## Match Ergonomics
Don't use the `ref` keyword.
**Rationale:** consistency & simplicity.
`ref` was required before [match ergonomics ](https://github.com/rust-lang/rfcs/blob/master/text/2005-match-ergonomics.md ).
Today, it is redundant.
Between `ref` and mach ergonomics, the latter is more ergonomic in most cases, and is simpler (does not require a keyword).
2021-08-10 06:34:05 -05:00
## Functional Combinators
Use high order monadic combinators like `map` , `then` , only when they are a natural choice, don't bend the code to fit into some combinator.
If writing a chain of combinators creates friction, replace them with control flow constructs: `for` , `if` , `match` .
Mostly avoid `bool::then` and `Option::filter` .
```rust
// GOOD
if !x.cond() {
return None;
}
Some(x)
// BAD
Some(x).filter(|it| it.cond())
```
This rule is more "soft" then others, and boils down mostly to taste.
The guiding principle behind this rule is that code should be dense in computation, and sparse in the number of expressions per line.
The second example contains *less* computation -- `filter` function is an indirection for `if` , it doesn't do any useful work by itself.
At the same time, it is more crowded -- it takes more time to visually scan it.
**Rationale:** consistency, playing to language's strengths.
Rust has first-class support for imperative control flow constructs like `for` and `if` , while functions are less first-class due to lack of universal function type, currying, and non-first-class effects (`?`, `.await` ).
2021-04-02 06:52:00 -05:00
## Helper Functions
Avoid creating singe-use helper functions:
```rust
// GOOD
let buf = {
let mut buf = get_empty_buf(& mut arena);
buf.add_item(item);
buf
};
// BAD
let buf = prepare_buf(& mut arena, item);
...
fn prepare_buf(arena: & mut Arena, item: Item) -> ItemBuf {
let mut res = get_empty_buf(& mut arena);
res.add_item(item);
res
}
```
Exception: if you want to make use of `return` or `?` .
**Rationale:** single-use functions change frequently, adding or removing parameters adds churn.
A block serves just as well to delineate a bit of logic, but has access to all the context.
Re-using originally single-purpose function often leads to bad coupling.
2021-07-04 03:12:41 -05:00
## Local Helper Functions
Put nested helper functions at the end of the enclosing functions
(this requires using return statement).
Don't nest more than one level deep.
```rust
// GOOD
fn dfs(graph: & Graph, v: Vertex) -> usize {
let mut visited = FxHashSet::default();
return go(graph, & mut visited, v);
fn go(graph: & Graph, visited: & mut FxHashSet< Vertex > , v: usize) -> usize {
...
}
}
// BAD
fn dfs(graph: & Graph, v: Vertex) -> usize {
fn go(graph: & Graph, visited: & mut FxHashSet< Vertex > , v: usize) -> usize {
...
}
let mut visited = FxHashSet::default();
go(graph, & mut visited, v)
}
```
**Rationale:** consistency, improved top-down readability.
2021-04-02 06:52:00 -05:00
## Helper Variables
2021-04-05 05:02:47 -05:00
Introduce helper variables freely, especially for multiline conditions:
```rust
// GOOD
let rustfmt_not_installed =
captured_stderr.contains("not installed") || captured_stderr.contains("not available");
match output.status.code() {
Some(1) if !rustfmt_not_installed => Ok(None),
_ => Err(format_err!("rustfmt failed:\n{}", captured_stderr)),
};
// BAD
match output.status.code() {
Some(1)
if !captured_stderr.contains("not installed")
& & !captured_stderr.contains("not available") => Ok(None),
_ => Err(format_err!("rustfmt failed:\n{}", captured_stderr)),
};
```
2021-04-02 06:52:00 -05:00
2021-07-26 17:15:40 -05:00
**Rationale:** Like blocks, single-use variables are a cognitively cheap abstraction, as they have access to all the context.
2021-04-02 06:52:00 -05:00
Extra variables help during debugging, they make it easy to print/view important intermediate results.
2021-07-26 17:15:40 -05:00
Giving a name to a condition inside an `if` expression often improves clarity and leads to nicely formatted code.
2021-04-02 06:52:00 -05:00
2021-01-10 09:47:12 -06:00
## Token names
Use `T![foo]` instead of `SyntaxKind::FOO_KW` .
```rust
// GOOD
match p.current() {
T![true] | T![false] => true,
_ => false,
}
// BAD
match p.current() {
SyntaxKind::TRUE_KW | SyntaxKind::FALSE_KW => true,
_ => false,
}
```
**Rationale:** The macro uses the familiar Rust syntax, avoiding ambiguities like "is this a brace or bracket?".
2020-10-07 05:50:46 -05:00
## Documentation
2021-05-10 11:04:41 -05:00
Style inline code comments as proper sentences.
Start with a capital letter, end with a dot.
```rust
// GOOD
// Only simple single segment paths are allowed.
MergeBehavior::Last => {
tree.use_tree_list().is_none() & & tree.path().map(path_len) < = Some(1)
}
// BAD
// only simple single segment paths are allowed
MergeBehavior::Last => {
tree.use_tree_list().is_none() & & tree.path().map(path_len) < = Some(1)
}
```
**Rationale:** writing a sentence (or maybe even a paragraph) rather just "a comment" creates a more appropriate frame of mind.
It tricks you into writing down more of the context you keep in your head while coding.
2020-10-07 05:50:46 -05:00
For `.md` and `.adoc` files, prefer a sentence-per-line format, don't wrap lines.
If the line is too long, you want to split the sentence in two :-)
2021-01-07 11:11:55 -06:00
2021-05-06 09:40:36 -05:00
**Rationale:** much easier to edit the text and read the diff, see [this link ](https://asciidoctor.org/docs/asciidoc-recommended-practices/#one-sentence-per-line ).