mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
Eduard Burtescu	02aec40dc9	[MIR] Add Storage{Live,Dead} statements to emit llvm.lifetime.{start,end}.	2016-08-14 06:34:14 +03:00
Michael Woerister	f7820888da	Add codegen test to make sure that closures are 'internalized' properly.	2016-07-19 06:22:35 -04:00
Michael Woerister	3a47103f1d	Fix codegen tests by make sure items are translated in AST order.	2016-07-08 10:42:47 -04:00
Eduard Burtescu	7279af86c8	trans: generalize immediate temporaries to all MIR locals.	2016-06-20 23:55:14 +03:00
Ulrik Sverdrup	c2ef20fd38	specialize zip: Add codegen test	2016-06-14 15:51:49 +02:00
bors	5e3136def7	Auto merge of #34252 - dsprenkels:issue-32364-test, r=eddyb Add regression test for #32364 This PR adds a regression test for #32364. r? @eddyb	2016-06-13 02:31:29 -07:00
Daan Sprenkels	df564451db	Add regression test for #32364	2016-06-13 00:36:22 +02:00
bors	abc57aba05	Auto merge of #34241 - dsprenkels:issue-32031-test, r=eddyb add a test case for issue #32031 I propose a test case to finish the fix for issue #32031. Please review this commit thoroughly, as I have never written a codegen test before. r? @eddyb	2016-06-12 12:48:57 -07:00
Daan Sprenkels	688840fdd6	add a test case for issue #32031	2016-06-12 18:09:38 +02:00
Eduard Burtescu	e252865b74	trans: always use a memcpy for ABI argument/return casts.	2016-06-08 00:35:01 +03:00
Simonas Kazlauskas	5b404523dd	Fix stores codegen pass	2016-05-26 12:41:40 +03:00
Alex Crichton	0ec321f7b5	rustc: Implement custom panic runtimes This commit is an implementation of [RFC 1513] which allows applications to alter the behavior of panics at compile time. A new compiler flag, `-C panic`, is added and accepts the values `unwind` or `panic`, with the default being `unwind`. This model affects how code is generated for the local crate, skipping generation of landing pads with `-C panic=abort`. [RFC 1513]: https://github.com/rust-lang/rfcs/blob/master/text/1513-less-unwinding.md Panic implementations are then provided by crates tagged with `#![panic_runtime]` and lazily required by crates with `#![needs_panic_runtime]`. The panic strategy (`-C panic` value) of the panic runtime must match the final product, and if the panic strategy is not `abort` then the entire DAG must have the same panic strategy. With the `-C panic=abort` strategy, users can expect a stable method to disable generation of landing pads, improving optimization in niche scenarios, decreasing compile time, and decreasing output binary size. With the `-C panic=unwind` strategy users can expect the existing ability to isolate failure in Rust code from the outside world. Organizationally, this commit dismantles the `sys_common::unwind` module in favor of some bits moving part of it to `libpanic_unwind` and the rest into the `panicking` module in libstd. The custom panic runtime support is pretty similar to the custom allocator support with the only major difference being how the panic runtime is injected (takes the `-C panic` flag into account).	2016-05-09 08:22:36 -07:00
Ticki	1f6b05e955	Add tests	2016-03-22 09:58:23 +01:00
Ulrik Sverdrup	2dbac1fb8e	Add intrinsics for float arithmetic with `fast` flag enabled `fast` a.k.a UnsafeAlgebra is the flag for enabling all "unsafe" (according to llvm) float optimizations. See LangRef for more information http://llvm.org/docs/LangRef.html#fast-math-flags Providing these operations with less precise associativity rules (for example) is useful to numerical applications. For example, the summation loop: let sum = 0.; for element in data { sum += *element; } Using the default floating point semantics, this loop expresses the floats must be added in a sequence, one after another. This constraint is usually completely unintended, and it means that no autovectorization is possible.	2016-03-18 17:31:41 +01:00
Eduard Burtescu	473f804491	Add #[rustc_no_mir] to make tests pass with -Z orbit.	2016-03-17 22:48:07 +02:00
Eduard Burtescu	bffb0decc8	tests: Use arguments in codegen/stores.rs to turn aggregates into immediates.	2016-03-17 21:51:52 +02:00
Eduard Burtescu	cf0ea786ff	tests: Force instantiation of extern fns.	2016-03-17 21:51:50 +02:00
Björn Steinbrink	a17fb64fce	Workaround LLVM optimizer bug by not marking &mut pointers as noalias LLVM's memory dependence analysis doesn't properly account for calls that could unwind and thus effectively act as a branching point. This can lead to stores that are only visible when the call unwinds being removed, possibly leading to calls to drop() functions with b0rked memory contents. As there is no fix for this in LLVM yet and we want to keep compatibility to current LLVM versions anyways, we have to workaround this bug by omitting the noalias attribute on &mut function arguments. Benchmarks suggest that the performance loss by this change is very small. Thanks to @RalfJung for pushing me towards not removing too many noalias annotations and @alexcrichton for helping out with the test for this bug. Fixes #29485	2016-02-10 23:09:47 +01:00
Björn Steinbrink	8c0f4f5d3a	Avoid quadratic growth of functions due to cleanups If a new cleanup is added to a cleanup scope, the cached exits for that scope are cleared, so all previous cleanups have to be translated again. In the worst case this means that we get N distinct landing pads where the last one has N cleanups, then N-1 and so on. As new cleanups are to be executed before older ones, we can instead cache the number of already translated cleanups in addition to the block that contains them, and then only translate new ones, if any and then jump to the cached ones, getting away with linear growth instead. For the crate in #31381 this reduces the compile time for an optimized build from >20 minutes (I cancelled the build at that point) to about 11 seconds. Testing a few crates that come with rustc show compile time improvements somewhere between 1 and 8%. The "big" winner being rustc_platform_intrinsics which features code similar to that in #31381. Fixes #31381	2016-02-04 00:34:53 +01:00
Simonas Kazlauskas	20618d0e42	[MIR] Avoid some code generation for stores of ZST Fixes #30831	2016-01-12 21:35:10 +02:00
Björn Steinbrink	6741f3315a	Avoid FCA loads and extractvalue when copying fat pointers Since fat pointers do not qualify as structural types, they got copied using load_ty and store_ty, which means that we load an FCA and use extractvalue to get the components of the fat pointer. This breaks certain optimizations in LLVM. Found via apasel422/ref_count#13	2015-11-20 21:42:13 +01:00
Björn Steinbrink	6ad079e341	Set proper alignment on constants For enum variants, the default alignment for a specific variant might be lower than the alignment of the enum type itself. In such cases we, for example, generate memcpy calls with an alignment that's higher than the alignment of the constant we copy from. To avoid that, we need to explicitly set the required alignment on constants. Fixes #28912.	2015-10-10 01:15:36 +02:00
Björn Steinbrink	bda083fa70	Avoid unnecessary temporaries when ref'ing a DST value A DST value and a fat pointer to it have the same representation, all we have to do is to adjust the type of the datum holding the pointer.	2015-10-01 15:24:23 +02:00
Björn Steinbrink	91f7c60d2d	Tell LLVM when a match is exhaustive By putting an "unreachable" instruction into the default arm of a switch instruction we can let LLVM know that the match is exhaustive, allowing for better optimizations. For example, this match: ```rust pub enum Enum { One, Two, Three, } impl Enum { pub fn get_disc(self) -> u8 { match self { Enum::One => 0, Enum::Two => 1, Enum::Three => 2, } } } ``` Currently compiles to this on x86_64: ```asm .cfi_startproc movzbl %dil, %ecx cmpl $1, %ecx setne %al testb %cl, %cl je .LBB0_2 incb %al movb %al, %dil .LBB0_2: movb %dil, %al retq .Lfunc_end0: ``` But with this change we get: ```asm .cfi_startproc movb %dil, %al retq .Lfunc_end0: ```	2015-09-25 11:09:19 +02:00
Richard Diamond	9a24025661	Avoid loading the whole gdb debug scripts section. This is so LLVM isn't forced to load every byte of it. Also sets the alignment of the load. Adds a test for the debug script section.	2015-09-21 15:43:52 -05:00
Björn Steinbrink	2c17bfc20c	Skip no-op adjustments in trans That allows us to keep using trans_into() in case of adjustments that may actually be ignored in trans because they are a plain deref/ref pair with no overloaded deref or unsizing. Unoptimized(!) benchmarks from servo/servo#7638 Before ``` test goser::bench_clone ... bench: 17,701 ns/iter (+/- 58) = 30 MB/s test goser::bincode::bench_decoder ... bench: 33,715 ns/iter (+/- 300) = 11 MB/s test goser::bincode::bench_deserialize ... bench: 36,804 ns/iter (+/- 329) = 9 MB/s test goser::bincode::bench_encoder ... bench: 34,695 ns/iter (+/- 149) = 11 MB/s test goser::bincode::bench_populate ... bench: 18,879 ns/iter (+/- 88) test goser::bincode::bench_serialize ... bench: 31,668 ns/iter (+/- 156) = 11 MB/s test goser::capnp::bench_deserialize ... bench: 2,049 ns/iter (+/- 87) = 218 MB/s test goser::capnp::bench_deserialize_packed ... bench: 10,707 ns/iter (+/- 258) = 31 MB/s test goser::capnp::bench_populate ... bench: 635 ns/iter (+/- 5) test goser::capnp::bench_serialize ... bench: 35,657 ns/iter (+/- 155) = 12 MB/s test goser::capnp::bench_serialize_packed ... bench: 37,881 ns/iter (+/- 146) = 8 MB/s test goser::msgpack::bench_decoder ... bench: 50,634 ns/iter (+/- 307) = 5 MB/s test goser::msgpack::bench_encoder ... bench: 25,738 ns/iter (+/- 90) = 11 MB/s test goser::msgpack::bench_populate ... bench: 18,900 ns/iter (+/- 138) test goser::protobuf::bench_decoder ... bench: 2,791 ns/iter (+/- 29) = 102 MB/s test goser::protobuf::bench_encoder ... bench: 75,414 ns/iter (+/- 358) = 3 MB/s test goser::protobuf::bench_populate ... bench: 19,248 ns/iter (+/- 92) test goser::rustc_serialize_json::bench_decoder ... bench: 109,999 ns/iter (+/- 797) = 5 MB/s test goser::rustc_serialize_json::bench_encoder ... bench: 58,777 ns/iter (+/- 418) = 10 MB/s test goser::rustc_serialize_json::bench_populate ... bench: 18,887 ns/iter (+/- 76) test goser::serde_json::bench_deserializer ... bench: 104,803 ns/iter (+/- 770) = 5 MB/s test goser::serde_json::bench_populate ... bench: 18,890 ns/iter (+/- 69) test goser::serde_json::bench_serializer ... bench: 75,046 ns/iter (+/- 435) = 8 MB/s ``` After ``` test goser::bench_clone ... bench: 16,052 ns/iter (+/- 188) = 34 MB/s test goser::bincode::bench_decoder ... bench: 31,194 ns/iter (+/- 941) = 12 MB/s test goser::bincode::bench_deserialize ... bench: 33,934 ns/iter (+/- 352) = 10 MB/s test goser::bincode::bench_encoder ... bench: 30,737 ns/iter (+/- 1,969) = 13 MB/s test goser::bincode::bench_populate ... bench: 17,234 ns/iter (+/- 176) test goser::bincode::bench_serialize ... bench: 28,269 ns/iter (+/- 452) = 12 MB/s test goser::capnp::bench_deserialize ... bench: 2,019 ns/iter (+/- 85) = 221 MB/s test goser::capnp::bench_deserialize_packed ... bench: 10,662 ns/iter (+/- 527) = 31 MB/s test goser::capnp::bench_populate ... bench: 607 ns/iter (+/- 2) test goser::capnp::bench_serialize ... bench: 30,488 ns/iter (+/- 219) = 14 MB/s test goser::capnp::bench_serialize_packed ... bench: 33,731 ns/iter (+/- 201) = 9 MB/s test goser::msgpack::bench_decoder ... bench: 46,921 ns/iter (+/- 461) = 6 MB/s test goser::msgpack::bench_encoder ... bench: 22,315 ns/iter (+/- 96) = 12 MB/s test goser::msgpack::bench_populate ... bench: 17,268 ns/iter (+/- 73) test goser::protobuf::bench_decoder ... bench: 2,658 ns/iter (+/- 44) = 107 MB/s test goser::protobuf::bench_encoder ... bench: 71,024 ns/iter (+/- 359) = 4 MB/s test goser::protobuf::bench_populate ... bench: 17,704 ns/iter (+/- 104) test goser::rustc_serialize_json::bench_decoder ... bench: 107,867 ns/iter (+/- 759) = 5 MB/s test goser::rustc_serialize_json::bench_encoder ... bench: 52,327 ns/iter (+/- 479) = 11 MB/s test goser::rustc_serialize_json::bench_populate ... bench: 17,262 ns/iter (+/- 68) test goser::serde_json::bench_deserializer ... bench: 99,156 ns/iter (+/- 657) = 6 MB/s test goser::serde_json::bench_populate ... bench: 17,264 ns/iter (+/- 77) test goser::serde_json::bench_serializer ... bench: 66,135 ns/iter (+/- 392) = 9 MB/s ```	2015-09-18 15:46:58 +02:00
Björn Steinbrink	6def06c973	Don't create adjustments from a type to itself Currently, we're generating adjustments, for example, to get from &[u8] to &[u8], which is unneeded and kicks us out of trans_into() into trans() which means an additional stack slot and copy in the unoptimized code.	2015-09-17 15:10:21 +02:00
Björn Steinbrink	3ef75d5774	Mark all extern functions as nounwind Unwinding across an FFI boundary is undefined behaviour, so we can mark all external function as nounwind. The obvious exception are those functions that actually perform the unwinding.	2015-09-14 11:36:09 +02:00
Richard Diamond	ba72d9722c	Don't add unnamed address attributes to intrinsics. Intrinsics never have an address, so it doesn't make sense to say that their address is unnamed.	2015-09-06 19:26:41 -05:00
Sylvestre Ledru	0fdc4a89f5	Issue #27628 - Also support the LLVM 3.6 IR format in two tests	2015-08-19 12:43:16 +02:00
Eli Friedman	c40703f9b3	Fix link_section regression. Fixes #27467.	2015-08-03 17:33:23 -07:00
Björn Steinbrink	f777562eab	Pass fat pointers in two immediate arguments This has a number of advantages compared to creating a copy in memory and passing a pointer. The obvious one is that we don't have to put the data into memory but can keep it in registers. Since we're currently passing a pointer anyway (instead of using e.g. a known offset on the stack, which is what the `byval` attribute would achieve), we only use a single additional register for each fat pointer, but save at least two pointers worth of stack in exchange (sometimes more because more than one copy gets eliminated). On archs that pass arguments on the stack, we save a pointer worth of stack even without considering the omitted copies. Additionally, LLVM can optimize the code a lot better, to a large degree due to the fact that lots of copies are gone or can be optimized away. Additionally, we can now emit attributes like nonnull on the data and/or vtable pointers contained in the fat pointer, potentially allowing for even more optimizations. This results in LLVM passes being about 3-7% faster (depending on the crate), and the resulting code is also a few percent smaller, for example: text data filename 5671479 3941461 before/librustc-d8ace771.so 5447663 3905745 after/librustc-d8ace771.so 1944425 2394024 before/libstd-d8ace771.so 1896769 2387610 after/libstd-d8ace771.so I had to remove a call in the backtrace-debuginfo test, because LLVM can now merge the tails of some blocks when optimizations are turned on, which can't correctly preserve line info. Fixes #22924 Cc #22891 (at least for fat pointers the code is good now)	2015-06-20 18:58:47 +02:00
bors	1d33318018	Auto merge of #26336 - dotdash:raw_ptr_coercions, r=nrc Unlike coercing from reference to unsafe pointer, coercing between two unsafe pointers doesn't need an AutoDerefRef, because there is no region that regionck would need to know about. In unoptimized libcore, this reduces the number of "auto_deref" allocas from 174 to 4.	2015-06-18 03:16:40 +00:00
Alex Crichton	f9d4149c29	rustc: Update LLVM This commit updates the LLVM submodule in use to the current HEAD of the LLVM repository. This is primarily being done to start picking up unwinding support for MSVC, which is currently unimplemented in the revision of LLVM we are using. Along the way a few changes had to be made: * As usual, lots of C++ debuginfo bindings in LLVM changed, so there were some significant changes to our RustWrapper.cpp * As usual, some pass management changed in LLVM, so clang was re-scrutinized to ensure that we're doing the same thing as clang. * Some optimization options are now passed directly into the `PassManagerBuilder` instead of through CLI switches to LLVM. * The `NoFramePointerElim` option was removed from LLVM, favoring instead the `no-frame-pointer-elim` function attribute instead. Additionally, LLVM has picked up some new optimizations which required fixing an existing soundness hole in the IR we generate. It appears that the current LLVM we use does not expose this hole. When an enum is moved, the previous slot in memory is overwritten with a bit pattern corresponding to "dropped". When the drop glue for this slot is run, however, the switch on the discriminant can often start executing the `unreachable` block of the switch due to the discriminant now being outside the normal range. This was patched over locally for now by having the `unreachable` block just change to a `ret void`.	2015-06-16 22:56:42 -07:00
Björn Steinbrink	cabd0682e9	Avoid deref/ref cycles for no-op coercions between unsafe pointers Unlike coercing from reference to unsafe pointer, coercing between two unsafe pointers doesn't need an AutoDerefRef, because there is no region that regionck would need to know about. In unoptimized libcore, this reduces the number of "auto_deref" allocas from 174 to 4.	2015-06-16 14:10:52 +02:00
Björn Steinbrink	677367599e	Revamp codegen tests to check IR quality instead of quantity The current codegen tests only compare IR line counts between similar rust and C programs, the latter getting compiled with clang. That looked like a good idea back then, but actually things like lifetime intrinsics mean that less IR isn't always better, so the metric isn't really helpful. Instead, we can start doing tests that check specific aspects of the generated IR, like attributes or metadata. To do that, we can use LLVM's FileCheck tool which has a number of useful features for such tests. To start off, I created some tests for a few things that were recently added and/or broken.	2015-05-27 12:08:31 +02:00
Alex Crichton	43bfaa4a33	Mass rename uint/int to usize/isize Now that support has been removed, all lingering use cases are renamed.	2015-03-26 12:10:22 -07:00
Patrick Walton	e9ad12c0ca	librustc: Forbid private types in public APIs. This breaks code like: struct Foo { ... } pub fn make_foo() -> Foo { ... } Change this code to: pub struct Foo { // note `pub` ... } pub fn make_foo() -> Foo { ... } The `visible_private_types` lint has been removed, since it is now an error to attempt to expose a private type in a public API. In its place a `#[feature(visible_private_types)]` gate has been added. Closes #16463. RFC #48. [breaking-change]	2014-09-22 20:05:45 -07:00
Alex Crichton	caf7b678dd	Add `pub` to all the codegen tests Otherwise the test function is internalized and LLVM will most likely optimize it out.	2013-10-10 06:00:51 -07:00
Erick Tryzelaar	ad5c676853	Fix warnings it tests	2013-08-17 08:42:35 -07:00
Björn Steinbrink	4b74b8dca4	Elide unnecessary ret slot allocas When there is only a single store to the ret slot that dominates the load that gets the value for the "ret" instruction, we can elide the ret slot and directly return the operand of the dominating store instruction. This is the same thing that clang does, except for a special case that doesn't seem to affect us. Fixes #8238	2013-08-10 11:04:50 +02:00
Graydon Hoare	8261f2c37c	test: add more codegen tests, add copyright headers to all.	2013-07-31 15:02:22 -07:00
Graydon Hoare	40f74341f3	test: new codegen tests, rename hello.	2013-07-16 17:44:57 -07:00
Graydon Hoare	fbc5bb4c0a	wire up makefile to run codegen tests and add one to start	2013-07-11 13:15:52 -07:00

44 Commits