2015-01-08 10:27:03 -08:00
|
|
|
% Strings
|
2014-12-02 09:20:48 -05:00
|
|
|
|
|
|
|
Strings are an important concept for any programmer to master. Rust's string
|
|
|
|
handling system is a bit different from other languages, due to its systems
|
|
|
|
focus. Any time you have a data structure of variable size, things can get
|
|
|
|
tricky, and strings are a re-sizable data structure. That being said, Rust's
|
|
|
|
strings also work differently than in some other systems languages, such as C.
|
|
|
|
|
2015-01-08 16:52:50 -08:00
|
|
|
Let's dig into the details. A *string* is a sequence of Unicode scalar values
|
2014-12-02 09:20:48 -05:00
|
|
|
encoded as a stream of UTF-8 bytes. All strings are guaranteed to be
|
|
|
|
validly encoded UTF-8 sequences. Additionally, strings are not null-terminated
|
|
|
|
and can contain null bytes.
|
|
|
|
|
|
|
|
Rust has two main types of strings: `&str` and `String`.
|
|
|
|
|
2015-01-08 16:52:50 -08:00
|
|
|
The first kind is a `&str`. These are called *string slices*. String literals
|
2014-12-02 09:20:48 -05:00
|
|
|
are of the type `&str`:
|
|
|
|
|
|
|
|
```{rust}
|
|
|
|
let string = "Hello there."; // string: &str
|
|
|
|
```
|
|
|
|
|
|
|
|
This string is statically allocated, meaning that it's saved inside our
|
|
|
|
compiled program, and exists for the entire duration it runs. The `string`
|
|
|
|
binding is a reference to this statically allocated string. String slices
|
|
|
|
have a fixed size, and cannot be mutated.
|
|
|
|
|
2015-02-19 18:39:38 -08:00
|
|
|
A `String`, on the other hand, is a heap-allocated string. This string
|
|
|
|
is growable, and is also guaranteed to be UTF-8. `String`s are
|
|
|
|
commonly created by converting from a string slice using the
|
|
|
|
`to_string` method.
|
2014-12-02 09:20:48 -05:00
|
|
|
|
|
|
|
```{rust}
|
|
|
|
let mut s = "Hello".to_string(); // mut s: String
|
|
|
|
println!("{}", s);
|
|
|
|
|
|
|
|
s.push_str(", world.");
|
|
|
|
println!("{}", s);
|
|
|
|
```
|
|
|
|
|
2015-02-02 00:46:34 +00:00
|
|
|
`String`s will coerce into `&str` with an `&`:
|
2014-12-02 09:20:48 -05:00
|
|
|
|
2015-01-30 11:17:26 -05:00
|
|
|
```
|
2014-12-02 09:20:48 -05:00
|
|
|
fn takes_slice(slice: &str) {
|
|
|
|
println!("Got: {}", slice);
|
|
|
|
}
|
|
|
|
|
|
|
|
fn main() {
|
|
|
|
let s = "Hello".to_string();
|
2015-01-30 11:17:26 -05:00
|
|
|
takes_slice(&s);
|
2014-12-02 09:20:48 -05:00
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
Viewing a `String` as a `&str` is cheap, but converting the `&str` to a
|
|
|
|
`String` involves allocating memory. No reason to do that unless you have to!
|
|
|
|
|
|
|
|
That's the basics of strings in Rust! They're probably a bit more complicated
|
|
|
|
than you are used to, if you come from a scripting language, but when the
|
|
|
|
low-level details matter, they really matter. Just remember that `String`s
|
|
|
|
allocate memory and control their data, while `&str`s are a reference to
|
|
|
|
another string, and you'll be all set.
|