bors 00978a9879 Auto merge of #24383 - avdi:patch-1, r=steveklabnik
"Dynamically typed" didn't seem like a relevant distinction; there are statically-compiled dynamically-typed languages. Another term that might work here (despite being notoriously vague) is "scripting languages".
2015-04-18 23:35:50 +00:00
..
2015-04-19 00:54:42 +05:30

% The Rust Programming Language

Welcome! This book will teach you about the Rust Programming Language. Rust is a systems programming language focused on three goals: safety, speed, and concurrency. It maintains these goals without having a garbage collector, making it a useful language for a number of use cases other languages arent good at: embedding in other languages, programs with specific space and time requirements, and writing low-level code, like device drivers and operating systems. It improves on current languages targeting this space by having a number of compile-time safety checks that produce no runtime overhead, while eliminating all data races. Rust also aims to achieve zero-cost abstrations even though some of these abstractions feel like those of a high-level language. Even then, Rust still allows precise control like a low-level language would.

“The Rust Programming Language” is split into seven sections. This introduction is the first. After this:

After reading this introduction, youll want to dive into either Learn Rust or Syntax and Semantics, depending on your preference: Learn Rust if you want to dive in with a project, or Syntax and Semantics if you prefer to start small, and learn a single concept thoroughly before moving onto the next. Copious cross-linking connects these parts together.

A brief introduction to Rust

Is Rust a language you might be interested in? Lets examine a few small code samples to show off a few of its strengths.

The main concept that makes Rust unique is called ownership. Consider this small example:

fn main() {
    let mut x = vec!["Hello", "world"];
}

This program makes a variable binding named x. The value of this binding is a Vec<T>, a vector, that we create through a macro defined in the standard library. This macro is called vec, and we invoke macros with a !. This follows a general principle of Rust: make things explicit. Macros can do significantly more complicated things than function calls, and so theyre visually distinct. The ! also helps with parsing, making tooling easier to write, which is also important.

We used mut to make x mutable: bindings are immutable by default in Rust. Well be mutating this vector later in the example.

Its also worth noting that we didnt need a type annotation here: while Rust is statically typed, we didnt need to explicitly annotate the type. Rust has type inference to balance out the power of static typing with the verbosity of annotating types.

Rust prefers stack allocation to heap allocation: x is placed directly on the stack. However, the Vec<T> type allocates space for the elements of the vector on the heap. If youre not familiar with this distinction, you can ignore it for now, or check out The Stack and the Heap. As a systems programming language, Rust gives you the ability to control how your memory is allocated, but when were getting started, its less of a big deal.

Earlier, we mentioned that ownership is the key new concept in Rust. In Rust parlance, x is said to own the vector. This means that when x goes out of scope, the vectors memory will be de-allocated. This is done deterministically by the Rust compiler, rather than through a mechanism such as a garbage collector. In other words, in Rust, you dont call functions like malloc and free yourself: the compiler statically determines when you need to allocate or deallocate memory, and inserts those calls itself. To err is to be human, but compilers never forget.

Lets add another line to our example:

fn main() {
    let mut x = vec!["Hello", "world"];

    let y = &x[0];
}

Weve introduced another binding, y. In this case, y is a reference to the first element of the vector. Rusts references are similar to pointers in other languages, but with additional compile-time safety checks. References interact with the ownership system by borrowing what they point to, rather than owning it. The difference is, when the reference goes out of scope, it will not deallocate the underlying memory. If it did, wed de-allocate twice, which is bad!

Lets add a third line. It looks innocent enough, but causes a compiler error:

fn main() {
    let mut x = vec!["Hello", "world"];

    let y = &x[0];

    x.push("foo");
}

push is a method on vectors that appends another element to the end of the vector. When we try to compile this program, we get an error:

error: cannot borrow `x` as mutable because it is also borrowed as immutable
    x.push(4);
    ^
note: previous borrow of `x` occurs here; the immutable borrow prevents
subsequent moves or mutable borrows of `x` until the borrow ends
    let y = &x[0];
             ^
note: previous borrow ends here
fn main() {

}
^

Whew! The Rust compiler gives quite detailed errors at times, and this is one of those times. As the error explains, while we made our binding mutable, we still cannot call push. This is because we already have a reference to an element of the vector, y. Mutating something while another reference exists is dangerous, because we may invalidate the reference. In this specific case, when we create the vector, we may have only allocated space for three elements. Adding a fourth would mean allocating a new chunk of memory for all those elements, copying the old values over, and updating the internal pointer to that memory. That all works just fine. The problem is that y wouldnt get updated, and so wed have a dangling pointer. Thats bad. Any use of y would be an error in this case, and so the compiler has caught this for us.

So how do we solve this problem? There are two approaches we can take. The first is making a copy rather than using a reference:

fn main() {
    let mut x = vec!["Hello", "world"];

    let y = x[0].clone();

    x.push("foo");
}

Rust has move semantics by default, so if we want to make a copy of some data, we call the clone() method. In this example, y is no longer a reference to the vector stored in x, but a copy of its first element, "hello". Now that we dont have a reference, our push() works just fine.

If we truly want a reference, we need the other option: ensure that our reference goes out of scope before we try to do the mutation. That looks like this:

fn main() {
    let mut x = vec!["Hello", "world"];

    {
        let y = &x[0];
    }

    x.push("foo");
}

We created an inner scope with an additional set of curly braces. y will go out of scope before we call push(), and so were all good.

This concept of ownership isnt just good for preventing danging pointers, but an entire set of related problems, like iterator invalidation, concurrency, and more.