rust/src/doc/trpl/functions.md
Pascal Hertleif 6f69cd6387 TRPL: Add rust Marker to Some Code Block
This adds strictly more information to the source files and reduces the
need for customized tooling to render the book.

(While this should not change the output of _rustbook_, it is very
useful when rendering the sources with external tools like Pandoc.)
2015-05-18 20:56:00 +02:00

6.0 KiB
Raw Blame History

% Functions

Every Rust program has at least one function, the main function:

fn main() {
}

This is the simplest possible function declaration. As we mentioned before, fn says this is a function, followed by the name, some parentheses because this function takes no arguments, and then some curly braces to indicate the body. Heres a function named foo:

fn foo() {
}

So, what about taking arguments? Heres a function that prints a number:

fn print_number(x: i32) {
    println!("x is: {}", x);
}

Heres a complete program that uses print_number:

fn main() {
    print_number(5);
}

fn print_number(x: i32) {
    println!("x is: {}", x);
}

As you can see, function arguments work very similar to let declarations: you add a type to the argument name, after a colon.

Heres a complete program that adds two numbers together and prints them:

fn main() {
    print_sum(5, 6);
}

fn print_sum(x: i32, y: i32) {
    println!("sum is: {}", x + y);
}

You separate arguments with a comma, both when you call the function, as well as when you declare it.

Unlike let, you must declare the types of function arguments. This does not work:

fn print_sum(x, y) {
    println!("sum is: {}", x + y);
}

You get this error:

expected one of `!`, `:`, or `@`, found `)`
fn print_number(x, y) {

This is a deliberate design decision. While full-program inference is possible, languages which have it, like Haskell, often suggest that documenting your types explicitly is a best-practice. We agree that forcing functions to declare types while allowing for inference inside of function bodies is a wonderful sweet spot between full inference and no inference.

What about returning a value? Heres a function that adds one to an integer:

fn add_one(x: i32) -> i32 {
    x + 1
}

Rust functions return exactly one value, and you declare the type after an arrow, which is a dash (-) followed by a greater-than sign (>). The last line of a function determines what it returns. Youll note the lack of a semicolon here. If we added it in:

fn add_one(x: i32) -> i32 {
    x + 1;
}

We would get an error:

error: not all control paths return a value
fn add_one(x: i32) -> i32 {
     x + 1;
}

help: consider removing this semicolon:
     x + 1;
          ^

This reveals two interesting things about Rust: it is an expression-based language, and semicolons are different from semicolons in other curly brace and semicolon-based languages. These two things are related.

Expressions vs. Statements

Rust is primarily an expression-based language. There are only two kinds of statements, and everything else is an expression.

So what's the difference? Expressions return a value, and statements do not. Thats why we end up with not all control paths return a value here: the statement x + 1; doesnt return a value. There are two kinds of statements in Rust: declaration statements and expression statements. Everything else is an expression. Lets talk about declaration statements first.

In some languages, variable bindings can be written as expressions, not just statements. Like Ruby:

x = y = 5

In Rust, however, using let to introduce a binding is not an expression. The following will produce a compile-time error:

let x = (let y = 5); // expected identifier, found keyword `let`

The compiler is telling us here that it was expecting to see the beginning of an expression, and a let can only begin a statement, not an expression.

Note that assigning to an already-bound variable (e.g. y = 5) is still an expression, although its value is not particularly useful. Unlike other languages where an assignment evaluates to the assigned value (e.g. 5 in the previous example), in Rust the value of an assignment is an empty tuple ():

let mut y = 5;

let x = (y = 6);  // x has the value `()`, not `6`

The second kind of statement in Rust is the expression statement. Its purpose is to turn any expression into a statement. In practical terms, Rust's grammar expects statements to follow other statements. This means that you use semicolons to separate expressions from each other. This means that Rust looks a lot like most other languages that require you to use semicolons at the end of every line, and you will see semicolons at the end of almost every line of Rust code you see.

What is this exception that makes us say "almost"? You saw it already, in this code:

fn add_one(x: i32) -> i32 {
    x + 1
}

Our function claims to return an i32, but with a semicolon, it would return () instead. Rust realizes this probably isnt what we want, and suggests removing the semicolon in the error we saw before.

Early returns

But what about early returns? Rust does have a keyword for that, return:

fn foo(x: i32) -> i32 {
    return x;

    // we never run this code!
    x + 1
}

Using a return as the last line of a function works, but is considered poor style:

fn foo(x: i32) -> i32 {
    return x + 1;
}

The previous definition without return may look a bit strange if you havent worked in an expression-based language before, but it becomes intuitive over time.

Diverging functions

Rust has some special syntax for diverging functions, which are functions that do not return:

fn diverges() -> ! {
    panic!("This function never returns!");
}

panic! is a macro, similar to println!() that weve already seen. Unlike println!(), panic!() causes the current thread of execution to crash with the given message.

Because this function will cause a crash, it will never return, and so it has the type !, which is read diverges. A diverging function can be used as any type:

# fn diverges() -> ! {
#    panic!("This function never returns!");
# }
let x: i32 = diverges();
let x: String = diverges();