mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
bors	42857db66d	Auto merge of #115232 - wesleywiser:revert_114643, r=tmiasko Revert "Use the same DISubprogram for each instance of the same inline function within the caller" This reverts commit `687bffa493`. Reverting to resolve ICEs reported on nightly. cc `@dpaoliello` Fixes #115156	2023-08-26 07:47:26 +00:00
Ralf Jung	0fde82fb97	codegen_llvm/llvm_type: avoid matching on the Rust type	2023-08-26 08:34:56 +02:00
Scott McMurray	84e305dd93	Stop emitting non-power-of-two vectors in basic LLVM codegen	2023-08-25 20:06:57 -07:00
Wesley Wiser	d0b2c4f727	Revert "Use the same DISubprogram for each instance of the same inlined function within the caller" This reverts commit `687bffa493`. Reverting to resolve ICEs reported on nightly.	2023-08-25 19:49:10 -04:00
bors	154ae32a55	Auto merge of #114643 - dpaoliello:inlinedebuginfo, r=wesleywiser Use the same DISubprogram for each instance of the same inlined function within a caller # Issue Details: The call to `panic` within a function like `Option::unwrap` is translated to LLVM as a `tail call` (as it will never return), when multiple calls to the same function like this is inlined LLVM will notice the common `tail call` block (i.e., loading the same panic string + location info and then calling `panic`) and merge them together. When merging these instructions together, LLVM will also attempt to merge the debug locations as well, but this fails (i.e., debug info is dropped) as Rust emits a new `DISubprogram` at each inline site thus LLVM doesn't recognize that these are actually the same function and so thinks that there isn't a common debug location. As an example of this when building for x86_64 Windows (note the lack of `.cv_loc` before the call to `panic`, thus it will be attributed to the same line at the `addq` instruction): ``` .cv_loc 0 1 23 0 # src\lib.rs:23:0 addq $40, %rsp retq leaq .Lalloc_f570dea0a53168780ce9a91e67646421(%rip), %rcx leaq .Lalloc_629ace53b7e5b76aaa810d549cc84ea3(%rip), %r8 movl $43, %edx callq _ZN4core9panicking5panic17h12e60b9063f6dee8E int3 ``` # Fix Details: Cache the `DISubprogram` emitted for each inlined function instance within a caller so that this can be reused if that instance is encountered again, this also requires caching the `DILexicalBlock` and `DIVariable` objects to avoid creating duplicates. After this change the above assembly now looks like: ``` .cv_loc 0 1 23 0 # src\lib.rs:23:0 addq $40, %rsp retq .cv_inline_site_id 5 within 0 inlined_at 1 0 0 .cv_inline_site_id 6 within 5 inlined_at 1 12 0 .cv_loc 6 2 935 0 # library\core\src\option.rs:935:0 leaq .Lalloc_5f55955de67e57c79064b537689facea(%rip), %rcx leaq .Lalloc_e741d4de8cb5801e1fd7a6c6795c1559(%rip), %r8 movl $43, %edx callq _ZN4core9panicking5panic17hde1558f32d5b1c04E int3 ```	2023-08-22 20:15:29 +00:00
Wesley Wiser	1097e0957e	Fix spelling mistake	2023-08-22 15:30:26 -04:00
Arpad Borsos	2ceea9ae9d	Inline functions called from `add_coverage` This removes quite a bit of indirection and duplicated code related to getting the `FunctionCoverage`.	2023-08-22 10:59:19 +02:00
bors	2ceed0b6cb	Auto merge of #113814 - ChoKyuWon:master, r=davidtwco Replace the \01__gnu_mcount_nc to LLVM intrinsic for ARM Current `-Zinstrument-mcount` for ARM32 use the `\01__gnu_mcount_nc` directly for its instrumentation function. However, the LLVM does not use this mcount function directly, but it wraps it to intrinsic, `llvm.arm.gnu.eabi.mcount` and the transform pass also only handle the intrinsic. As a result, current `-Zinstrument-mcount` not work on ARM32. Refer: https://github.com/namhyung/uftrace/issues/1764 This commit replaces the mcount name from native function to the LLVM intrinsic so that the transform pass can handle it.	2023-08-18 13:20:37 +00:00
ChoKyuWon	3bd54c14bc	Replace the \01__gnu_mcount_nc to LLVM intrinsic for ARM Current `-Zinstrument-mcount` for ARM32 use the `\01__gnu_mcount_nc` directly for its instrumentation function. However, the LLVM does not use this mcount function directly, but it wraps it to intrinsic, `llvm.arm.gnu.eabi.mcount` and the transform pass also only handle the intrinsic. As a result, current `-Zinstrument-mcount` not work on ARM32. Refer: https://github.com/namhyung/uftrace/issues/1764 This commit replaces the mcount name from native function to the LLVM intrinsic so that the transform pass can handle it. Signed-off-by: ChoKyuWon <kyuwoncho18@gmail.com>	2023-08-18 01:52:40 +09:00
bors	c1699a79a6	Auto merge of #114467 - Amanieu:asm-unstable-features, r=davidtwco Use `unstable_target_features` when checking inline assembly This is necessary to properly validate register classes even when the relevant target feature name is still unstable.	2023-08-15 11:59:02 +00:00
dirreke	74817b7053	Upgrade Object and related deps	2023-08-14 23:05:45 +08:00
Dirreke	184a9afffb	add details for csky-unknown-linux-gnuabiv2 and add docs	2023-08-14 23:02:37 +08:00
Dirreke	d16409fe22	add a csky-unknown-linux-gnuabiv2 target	2023-08-14 23:02:36 +08:00
bors	644e8068cb	Auto merge of #113722 - bjorn3:allocator_shim_refactor, r=jackh726 Extract a create_wrapper_function for use in allocator shim writing This deduplicates some logic and makes it easier to follow what wrappers are produced. In the future it may allow moving the code to determine which wrappers to create to cg_ssa.	2023-08-13 16:49:49 +00:00
Daniel Paoliello	687bffa493	Use the same DISubprogram for each instance of the same inlined function within the caller	2023-08-11 10:21:52 -07:00
Michael Goulet	94533d924e	Rollup merge of #114622 - petrochenkov:noplugin, r=oli-obk rustc: Move `crate_types` and `stable_crate_id` from `Session` to `GlobalCtxt` Removes two pieces of mutable state. Follow up to https://github.com/rust-lang/rust/pull/114578.	2023-08-10 21:17:07 -07:00
bors	a9b2c6a0ce	Auto merge of #114005 - Zalathar:no-cstr, r=jackh726 coverage: Don't convert filename/symbol strings to `CString` for FFI LLVM APIs are usually perfectly happy to accept pointer/length strings, as long as we supply a suitable length value when creating a `StringRef` or `std::string`. This lets us avoid quite a few intermediate `CString` copies during coverage codegen. It also lets us use an `IndexSet<Symbol>` (instead of an `IndexSet<CString>`) when building the deduplicated filename table.	2023-08-10 23:06:10 +00:00
Matthias Krüger	7d78885a8e	Rollup merge of #111891 - rustbox:feat/riscv-isr-cconv, r=jackh726 feat: `riscv-interrupt-{m,s}` calling conventions Similar to prior support added for the mips430, avr, and x86 targets this change implements the rough equivalent of clang's [`__attribute__((interrupt))`][clang-attr] for riscv targets, enabling e.g. ```rust static mut CNT: usize = 0; pub extern "riscv-interrupt-m" fn isr_m() { unsafe { CNT += 1; } } ``` to produce highly effective assembly like: ```asm pub extern "riscv-interrupt-m" fn isr_m() { 420003a0: 1141 addi sp,sp,-16 unsafe { CNT += 1; 420003a2: c62a sw a0,12(sp) 420003a4: c42e sw a1,8(sp) 420003a6: 3fc80537 lui a0,0x3fc80 420003aa: 63c52583 lw a1,1596(a0) # 3fc8063c <_ZN12esp_riscv_rt3CNT17hcec3e3a214887d53E.0> 420003ae: 0585 addi a1,a1,1 420003b0: 62b52e23 sw a1,1596(a0) } } 420003b4: 4532 lw a0,12(sp) 420003b6: 45a2 lw a1,8(sp) 420003b8: 0141 addi sp,sp,16 420003ba: 30200073 mret ``` (disassembly via `riscv64-unknown-elf-objdump -C -S --disassemble ./esp32c3-hal/target/riscv32imc-unknown-none-elf/release/examples/gpio_interrupt`) This outcome is superior to hand-coded interrupt routines which, lacking visibility into any non-assembly body of the interrupt handler, have to be very conservative and save the [entire CPU state to the stack frame][full-frame-save]. By instead asking LLVM to only save the registers that it uses, we defer the decision to the tool with the best context: it can more accurately account for the cost of spills if it knows that every additional register used is already at the cost of an implicit spill. At the LLVM level, this is apparently [implemented by] marking every register as "[callee-save]," matching the semantics of an interrupt handler nicely (it has to leave the CPU state just as it found it after its `{m\|s}ret`). This approach is not suitable for every interrupt handler, as it makes no attempt to e.g. save the state in a user-accessible stack frame. For a full discussion of those challenges and tradeoffs, please refer to [the interrupt calling conventions RFC][rfc]. Inside rustc, this implementation differs from prior art because LLVM does not expose the "all-saved" function flavor as a calling convention directly, instead preferring to use an attribute that allows for differentiating between "machine-mode" and "superivsor-mode" interrupts. Finally, some effort has been made to guide those who may not yet be aware of the differences between machine-mode and supervisor-mode interrupts as to why no `riscv-interrupt` calling convention is exposed through rustc, and similarly for why `riscv-interrupt-u` makes no appearance (as it would complicate future LLVM upgrades). [clang-attr]: https://clang.llvm.org/docs/AttributeReference.html#interrupt-risc-v [full-frame-save]: `9281af2ecf/src/lib.rs (L440-L469)` [implemented by]: `b7fb2a3fec/llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp (L61-L67)` [callee-save]: `973f1fe7a8/llvm/lib/Target/RISCV/RISCVCallingConv.td (L30-L37)` [rfc]: https://github.com/rust-lang/rfcs/pull/3246	2023-08-09 22:59:58 +02:00
Vadim Petrochenkov	0b89aac08d	rustc: Move `crate_types` from `Session` to `GlobalCtxt` Removes a piece of mutable state. Follow up to #114578.	2023-08-09 14:17:54 +08:00
Seth Pellegrino	897c7bb23b	feat: `riscv-interrupt-{m,s}` calling conventions Similar to prior support added for the mips430, avr, and x86 targets this change implements the rough equivalent of clang's [`__attribute__((interrupt))`][clang-attr] for riscv targets, enabling e.g. ```rust static mut CNT: usize = 0; pub extern "riscv-interrupt-m" fn isr_m() { unsafe { CNT += 1; } } ``` to produce highly effective assembly like: ```asm pub extern "riscv-interrupt-m" fn isr_m() { 420003a0: 1141 addi sp,sp,-16 unsafe { CNT += 1; 420003a2: c62a sw a0,12(sp) 420003a4: c42e sw a1,8(sp) 420003a6: 3fc80537 lui a0,0x3fc80 420003aa: 63c52583 lw a1,1596(a0) # 3fc8063c <_ZN12esp_riscv_rt3CNT17hcec3e3a214887d53E.0> 420003ae: 0585 addi a1,a1,1 420003b0: 62b52e23 sw a1,1596(a0) } } 420003b4: 4532 lw a0,12(sp) 420003b6: 45a2 lw a1,8(sp) 420003b8: 0141 addi sp,sp,16 420003ba: 30200073 mret ``` (disassembly via `riscv64-unknown-elf-objdump -C -S --disassemble ./esp32c3-hal/target/riscv32imc-unknown-none-elf/release/examples/gpio_interrupt`) This outcome is superior to hand-coded interrupt routines which, lacking visibility into any non-assembly body of the interrupt handler, have to be very conservative and save the [entire CPU state to the stack frame][full-frame-save]. By instead asking LLVM to only save the registers that it uses, we defer the decision to the tool with the best context: it can more accurately account for the cost of spills if it knows that every additional register used is already at the cost of an implicit spill. At the LLVM level, this is apparently [implemented by] marking every register as "[callee-save]," matching the semantics of an interrupt handler nicely (it has to leave the CPU state just as it found it after its `{m\|s}ret`). This approach is not suitable for every interrupt handler, as it makes no attempt to e.g. save the state in a user-accessible stack frame. For a full discussion of those challenges and tradeoffs, please refer to [the interrupt calling conventions RFC][rfc]. Inside rustc, this implementation differs from prior art because LLVM does not expose the "all-saved" function flavor as a calling convention directly, instead preferring to use an attribute that allows for differentiating between "machine-mode" and "superivsor-mode" interrupts. Finally, some effort has been made to guide those who may not yet be aware of the differences between machine-mode and supervisor-mode interrupts as to why no `riscv-interrupt` calling convention is exposed through rustc, and similarly for why `riscv-interrupt-u` makes no appearance (as it would complicate future LLVM upgrades). [clang-attr]: https://clang.llvm.org/docs/AttributeReference.html#interrupt-risc-v [full-frame-save]: `9281af2ecf/src/lib.rs (L440-L469)` [implemented by]: `b7fb2a3fec/llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp (L61-L67)` [callee-save]: `973f1fe7a8/llvm/lib/Target/RISCV/RISCVCallingConv.td (L30-L37)` [rfc]: https://github.com/rust-lang/rfcs/pull/3246	2023-08-08 18:09:56 -07:00
Matthias Krüger	c097e48082	Rollup merge of #113593 - rcvalle:rust-cfi-fix-90546, r=wesleywiser CFI: Fix error compiling core with LLVM CFI enabled Fix #90546 by filtering out global value function pointer types from the type tests, and adding the LowerTypeTests pass to the rustc LTO optimization pipelines.	2023-08-08 21:44:43 +02:00
bors	f525bb4e2a	Auto merge of #114439 - Kobzol:remark-pgo-hotness, r=tmiasko Add hotness data to LLVM remarks Slight improvement of https://github.com/rust-lang/rust/pull/113040. This makes sure that if PGO is used, remarks generated using `-Zremark-dir` will include the `Hotness` attribute. r? `@tmiasko`	2023-08-08 15:41:44 +00:00
Jakub Beránek	9d417d7c86	Only enable hotness information when PGO is available	2023-08-08 15:36:55 +02:00
Ramon de C Valle	f837c48f0d	CFI: Fix error compiling core with LLVM CFI enabled Fix #90546 by filtering out global value function pointer types from the type tests, and adding the LowerTypeTests pass to the rustc LTO optimization pipelines.	2023-08-07 15:59:15 -07:00
Nikita Popov	ad7ea8b7e6	Update powerpc data layouts Function pointer alignment is specified since https://reviews.llvm.org/D147016.	2023-08-07 20:35:55 +02:00
Matthias Krüger	cbe2522652	Rollup merge of #114382 - scottmcm:compare-bytes-intrinsic, r=cjgillot Add a new `compare_bytes` intrinsic instead of calling `memcmp` directly As discussed in #113435, this lets the backends be the place that can have the "don't call the function if n == 0" logic, if it's needed for the target. (I didn't actually add those checks, though, since as I understood it we didn't actually need them on known targets?) Doing this also let me make it `const` (unstable), which I don't think `extern "C" fn memcmp` can be. cc `@RalfJung` `@Amanieu`	2023-08-07 05:29:12 +02:00
scottmcm	75277a6606	Apply suggestions from code review Co-authored-by: Ralf Jung <post@ralfj.de>	2023-08-06 15:47:40 -07:00
Scott McMurray	502af03445	Add a new `compare_bytes` intrinsic instead of calling `memcmp` directly	2023-08-06 15:47:40 -07:00
David Tolnay	704aa56ba0	Generate better function argument names in global_allocator expansion	2023-08-06 07:36:05 -07:00
Matthias Krüger	a0fd747e38	Rollup merge of #114450 - chenyukang:yukang-fix-114435, r=compiler-errors Fix ICE failed to get layout for ReferencesError Fixes #114435 r? `@compiler-errors`	2023-08-04 21:31:57 +02:00
yukang	3d25b5c7e8	Fix ICE failed to get layout for ReferencesError	2023-08-05 01:38:14 +08:00
Amanieu d'Antras	a3ab31c0f9	Use `unstable_target_features` when checking inline assembly This is necessary to properly validate register classes even when the relevant target feature name is still unstable.	2023-08-04 16:09:54 +01:00
bjorn3	49ae3b7867	Extract a create_wrapper_function for use in allocator shim writing This deduplicates some logic and makes it easier to follow what wrappers are produced. In the future it may allow moving the code to determine which wrappers to create to cg_ssa.	2023-08-04 13:26:36 +00:00
bors	73dc6f03a2	Auto merge of #114350 - erikdesjardins:ident, r=tmiasko cg_llvm: stop identifying ADTs in LLVM IR This is an extension of https://github.com/rust-lang/rust/pull/94107. It may be a minor perf win. Fixes #96242. Now that we use opaque pointers, ADTs can no longer be recursive, so we do not need to name them. Previously, this would be necessary if you had a struct like ```rs struct Foo(Box<Foo>, u64, u64); ``` which would be represented with something like ```ll %Foo = type { %Foo*, i64, i64 } ``` which is now just ```ll { ptr, i64, i64 } ``` r? `@tmiasko`	2023-08-04 07:17:02 +00:00
Zalathar	4b154bc8e2	coverage: Don't convert symbol names to `CString` for FFI	2023-08-04 10:48:05 +10:00
Zalathar	e184118683	coverage: Don't convert filenames to `CString` for FFI	2023-08-04 10:48:05 +10:00
Oli Scherer	4457ef2c6d	Forbid old-style `simd_shuffleN` intrinsics	2023-08-03 09:29:00 +00:00
Nilstrieb	46f6b05eb7	Rollup merge of #114079 - compiler-errors:closure-upvars, r=oli-obk Use `upvar_tys` in more places, make it return a list Just a cleanup that fell out of a PR that I was gonna write, but that PR kinda got stuck.	2023-08-02 13:46:54 +02:00
Zalathar	d6ed6e3904	coverage: Consolidate FFI types into one module Coverage FFI types were historically split across two modules, because some of them were needed by code in `rustc_codegen_ssa`. Now that all of the coverage codegen code has been moved into `rustc_codegen_llvm` (#113355), it's possible to move all of the FFI types into a single module, making it easier to see all of them at once.	2023-08-02 15:26:47 +10:00
Michael Goulet	99969d282b	Use upvar_tys in more places, make it a list	2023-08-01 23:19:31 +00:00
bors	f77c624c03	Auto merge of #113339 - lqd:respect-filters, r=tmiasko Filter out short-lived LLVM diagnostics before they reach the rustc handler During profiling I saw remark passes being unconditionally enabled: for example `Machine Optimization Remark Emitter`. The diagnostic remarks enabled by default are [from missed optimizations and opt analyses](https://github.com/rust-lang/rust/pull/113339#discussion_r1259480303). They are created by LLVM, passed to the diagnostic handler on the C++ side, emitted to rust, where they are unpacked, C++ strings are converted to rust, etc. Then they are discarded in the vast majority of the time (i.e. unless some kind of `-Cremark` has enabled some of these passes' output to be printed). These unneeded allocations are very short-lived, basically only lasting between the LLVM pass emitting them and the rust handler where they are discarded. So it doesn't hugely impact max-rss, and is only a slight reduction in instruction count (cachegrind reports a reduction between 0.3% and 0.5%) _on linux_. It's possible that targets without `jemalloc` or with a worse allocator, may optimize these less. It is however significant in the aggregate, looking at the total number of allocated bytes: - it's the biggest source of allocations according to dhat, on the benchmarks I've tried e.g. `syn` or `cargo` - allocations on `syn` are reduced by 440MB, 17% (from 2440722647 bytes total, to 2030461328 bytes) - allocations on `cargo` are reduced by 6.6GB, 19% (from 35371886402 bytes total, to 28723987743 bytes) Some of these diagnostics objects [are allocated in LLVM](https://github.com/rust-lang/rust/pull/113339#discussion_r1252387484) before they're emitted to our diagnostic handler, where they'll be filtered out. So we could remove those in the future, but that will require changing a few LLVM call-sites upstream, so I left a FIXME.	2023-08-01 23:15:20 +00:00
Rémy Rakic	ca5a383fb6	remove remark filtering on the rust side now that remarks are filtered before cg_llvm's diagnostic handler callback is called, we don't need to do the filtering post c++-to-rust conversion of the diagnostic.	2023-08-01 21:01:20 +00:00
bors	abd3637e42	Auto merge of #105545 - erikdesjardins:ptrclean, r=bjorn3 cleanup: remove pointee types This can't be merged until the oldest LLVM version we support uses opaque pointers, which will be the case after #114148. (Also note `-Cllvm-args="-opaque-pointers=0"` can technically be used in LLVM 15, though I don't think we should support that configuration.) I initially hoped this would provide some minor perf win, but in https://github.com/rust-lang/rust/pull/105412#issuecomment-1341224450 it had very little impact, so this is only valuable as a cleanup. As a followup, this will enable #96242 to be resolved. r? `@ghost` `@rustbot` label S-blocked	2023-08-01 19:44:17 +00:00
Zalathar	3920e07f0b	Make coverage counter IDs count up from 0, not 1 Operand types are now tracked explicitly, so there is no need to reserve ID 0 for the special always-zero counter. As part of the renumbering, this change fixes an off-by-one error in the way counters were counted by the `coverageinfo` query. As a result, functions should now have exactly the number of counters they actually need, instead of always having an extra counter that is never used.	2023-08-01 11:29:55 +10:00
Zalathar	f103db894f	Make coverage expression IDs count up from 0, not down from `u32::MAX` Operand types are now tracked explicitly, so there is no need for expression IDs to avoid counter IDs by descending from `u32::MAX`. Instead they can just count up from 0, and can be used directly as indices when necessary.	2023-08-01 11:29:55 +10:00
Zalathar	1a014d42f4	Replace `ExpressionOperandId` with enum `Operand` Because the three kinds of operand are now distinguished explicitly, we no longer need fiddly code to disambiguate counter IDs and expression IDs based on the total number of counters/expressions in a function. This does increase the size of operands from 4 bytes to 8 bytes, but that shouldn't be a big deal since they are mostly stored inside boxed structures, and the current coverage code is not particularly size-optimized anyway.	2023-08-01 11:29:55 +10:00
bors	5082281609	Auto merge of #113879 - nnethercote:codegen_ssa-cleanups, r=bjorn3 `codegen_ssa` cleanups Some clarifications I made when reading this code closely. r? `@tmiasko`	2023-07-31 08:18:19 +00:00
Nicholas Nethercote	3b44f5b0eb	Use standard Rust capitalization rules for names containing "LTO".	2023-07-31 16:21:02 +10:00
Nicholas Nethercote	4a120f33f7	Remove `ExtraBackendMethods::spawn_thread`. It's no longer used, and `spawn_named_thread` is preferable, because naming threads is helpful when profiling.	2023-07-31 16:21:02 +10:00
bors	3be07c1161	Auto merge of #114266 - calebzulawski:simd-bswap, r=compiler-errors Fix simd_bswap for i8/u8 #114156 missed this test case ☹️ cc `@workingjubilee`	2023-07-31 04:43:48 +00:00
Caleb Zulawski	77ed437de8	Fix simd_bswap for i8/u8	2023-07-30 15:40:32 -04:00
Matthias Krüger	3ce90b1649	inline format!() args up to and including rustc_codegen_llvm	2023-07-30 14:22:50 +02:00
Erik Desjardins	55800123b7	cg_llvm: simplify llvm.masked.gather/scatter naming with opaque pointers With opaque pointers, there's no longer a need to generate a chain of pointer types in the intrinsic name when arguments are pointers to pointers.	2023-07-29 16:56:27 -04:00
Erik Desjardins	cf7788d54b	cg_llvm: clean up match	2023-07-29 16:32:03 -04:00
Erik Desjardins	def44c5669	cg_llvm: inline check_store	2023-07-29 16:31:53 -04:00
Erik Desjardins	1d7f728901	cg_llvm: stop identifying ADTs in LLVM IR Now that we use opaque pointers, ADTs can no longer be recursive, so we do not need to name them. Previously, this would be necessary if you had a struct like ```rs struct Foo(Box<Foo>, u64, u64); ``` which would be represented with something like ```ll %Foo = type { %Foo*, i64, i64 } ``` which is now just ```ll { ptr, i64, i64 } ```	2023-07-29 16:12:27 -04:00
bors	03a57254b5	Auto merge of #114156 - calebzulawski:simd-bswap, r=compiler-errors Add simd_bswap, simd_bitreverse, simd_ctlz, and simd_cttz intrinsics cc `@workingjubilee`	2023-07-29 18:51:45 +00:00
Erik Desjardins	04303cfb3a	cg_ssa: remove pointee types and pointercast/bitcast-of-ptr	2023-07-29 13:18:20 -04:00
Erik Desjardins	b6540777fe	cg_llvm: remove pointee types and pointercast/bitcast-of-ptr	2023-07-29 13:18:17 -04:00
Caleb Zulawski	ce4a48f41f	Use i1 instead of bool	2023-07-28 09:46:16 -04:00
Caleb Zulawski	4c02b4cf4c	Add SIMD bitreverse, ctlz, cttz intrinsics	2023-07-27 23:53:45 -04:00
Caleb Zulawski	3ea0e6e3fb	Add simd_bswap intrinsic	2023-07-27 23:04:14 -04:00
Josh Stone	190ded8443	Update the minimum external LLVM to 15	2023-07-27 14:07:08 -07:00
Zalathar	01f3cc1272	coverage: Obtain the `__llvm_covfun` section name outside a per-function loop This section name is always constant for a given target, but obtaining it from LLVM requires a few intermediate allocations. There's no need to do so repeatedly from inside a per-function loop.	2023-07-24 21:58:00 +10:00
David Tolnay	5bbf0a8306	Revert "Auto merge of #113166 - moulins:ref-niches-initial, r=oli-obk" This reverts commit `557359f925`, reversing changes made to `1e6c09a803`.	2023-07-21 22:35:57 -07:00
Miguel Ojeda	74b8d324eb	Support `.comment` section like GCC/Clang (`!llvm.ident`) Both GCC and Clang write by default a `.comment` section with compiler information: ```txt $ gcc -c -xc /dev/null && readelf -p '.comment' null.o String dump of section '.comment': [ 1] GCC: (GNU) 11.2.0 $ clang -c -xc /dev/null && readelf -p '.comment' null.o String dump of section '.comment': [ 1] clang version 14.0.1 (https://github.com/llvm/llvm-project.git c62053979489ccb002efe411c3af059addcb5d7d) ``` They also implement the `-Qn` flag to avoid doing so: ```txt $ gcc -Qn -c -xc /dev/null && readelf -p '.comment' null.o readelf: Warning: Section '.comment' was not dumped because it does not exist! $ clang -Qn -c -xc /dev/null && readelf -p '.comment' null.o readelf: Warning: Section '.comment' was not dumped because it does not exist! ``` So far, `rustc` only does it for WebAssembly targets and only when debug info is enabled: ```txt $ echo 'fn main(){}' \| rustc --target=wasm32-unknown-unknown --emit=llvm-ir -Cdebuginfo=2 - && grep llvm.ident rust_out.ll !llvm.ident = !{!27} ``` In the RFC part of this PR it was decided to always add the information, which gets us closer to other popular compilers. An opt-out flag like GCC and Clang may be added later on if deemed necessary. Implementation-wise, this covers both `ModuleLlvm::new()` and `ModuleLlvm::new_metadata()` cases by moving the addition to `context::create_module` and adds a few test cases. ThinLTO also sees the `llvm.ident` named metadata duplicated (in temporary outputs), so this deduplicates it like it is done for `wasm.custom_sections`. The tests also check this duplication does not take place. Signed-off-by: Miguel Ojeda <ojeda@kernel.org>	2023-07-21 22:01:50 +02:00
bors	557359f925	Auto merge of #113166 - moulins:ref-niches-initial, r=oli-obk Prototype: Add unstable `-Z reference-niches` option MCP: rust-lang/compiler-team#641 Relevant RFC: rust-lang/rfcs#3204 This prototype adds a new `-Z reference-niches` option, controlling the range of valid bit-patterns for reference types (`&T` and `&mut T`), thereby enabling new enum niching opportunities. Like `-Z randomize-layout`, this setting is crate-local; as such, references to built-in types (primitives, tuples, ...) are not affected. The possible settings are (here, `MAX` denotes the all-1 bit-pattern): \| `-Z reference-niches=` \| Valid range \| \|:---:\|:---:\| \| `null` (the default) \| `1..=MAX` \| \| `size` \| `1..=(MAX- size)` \| \| `align` \| `align..=MAX.align_down_to(align)` \| \| `size,align` \| `align..=(MAX-size).align_down_to(align)` \| ------ This is very WIP, and I'm not sure the approach I've taken here is the best one, but stage 1 tests pass locally; I believe this is in a good enough state to unleash this upon unsuspecting 3rd-party code, and see what breaks.	2023-07-21 15:00:36 +00:00
Matthias Krüger	b1d1e99c22	Rollup merge of #113780 - dtolnay:printkindpath, r=b-naber Support `--print KIND=PATH` command line syntax As is already done for `--emit KIND=PATH` and `-L KIND=PATH`. In the discussion of #110785, it was pointed out that `--print KIND=PATH` is nicer than trying to apply the single global `-o` path to `--print`'s output, because in general there can be multiple print requests within a single rustc invocation, and anyway `-o` would already be used for a different meaning in the case of `link-args` and `native-static-libs`. I am interested in using `--print cfg=PATH` in Buck2. Currently Buck2 works around the lack of support for `--print KIND=PATH` by [indirecting through a Python wrapper script](`d43cf3a51a/prelude/rust/tools/get_rustc_cfg.py`) to redirect rustc's stdout into the location dictated by the build system. From skimming Cargo's usages of `--print`, it definitely seems like it would benefit from `--print KIND=PATH` too. Currently it is working around the lack of this by inserting `--crate-name=___ --print=crate-name` so that it can look for a line containing `___` as a delimiter between the 2 other `--print` informations it actually cares about. This is commented as a "HACK" and "abuse". `31eda6f7c3/src/cargo/core/compiler/build_context/target_info.rs (L242)` (FYI `@weihanglo` as you dealt with this recently in https://github.com/rust-lang/cargo/pull/11633.) Mentioning reviewers active in #110785: `@fee1-dead` `@jyn514` `@bjorn3`	2023-07-21 06:52:28 +02:00
Matthias Krüger	2734b5ada9	Rollup merge of #113723 - khei4:khei4/llvm-stats, r=oli-obk,nikic Resurrect: rustc_llvm: Add a -Z `print-codegen-stats` option to expose LLVM statistics. This resurrects PR https://github.com/rust-lang/rust/pull/104000, which has sat idle for a while. And I want to see the effect of stack-move optimizations on LLVM (like https://reviews.llvm.org/D153453) :). I have applied the changes requested by `@oli-obk` and `@nagisa` https://github.com/rust-lang/rust/pull/104000#discussion_r1014625377 and https://github.com/rust-lang/rust/pull/104000#discussion_r1014642482 in the latest commits. r? `@oli-obk` ----- LLVM has a neat [statistics](https://llvm.org/docs/ProgrammersManual.html#the-statistic-class-stats-option) feature that tracks how often optimizations kick in. It's very handy for optimization work. Since we expose the LLVM pass timings, I thought it made sense to expose the LLVM statistics too. ----- (Edit: fix broken link (Edit2: fix segmentation fault and use malloc If `rustc` is built with ```toml [llvm] assertions = true ``` Then you can see like ``` rustc +stage1 -Z print-codegen-stats -C opt-level=3 tmp.rs ===-------------------------------------------------------------------------=== ... Statistics Collected ... ===-------------------------------------------------------------------------=== 3 aa - Number of MayAlias results 193 aa - Number of MustAlias results 531 aa - Number of NoAlias results ... ``` And the current default build emits only ``` $ rustc +stage1 -Z print-codegen-stats -C opt-level=3 tmp.rs ===-------------------------------------------------------------------------=== ... Statistics Collected ... ===-------------------------------------------------------------------------=== $ ``` This might be better to emit the message to tell assertion flag necessity, but now I can't find how to do that...	2023-07-21 06:52:27 +02:00
Moulins	403f34b599	Don't treat ref. fields with non-null niches as `dereferenceable_or_null`	2023-07-21 03:31:46 +02:00
David Tolnay	815a114974	Implement printing to file in PassWrapper	2023-07-20 11:04:31 -07:00
David Tolnay	6e734fce63	Implement printing to file in llvm_util	2023-07-20 11:04:31 -07:00
David Tolnay	c80cbe4bae	Implement printing to file in codegen_backend.print	2023-07-20 11:04:31 -07:00
David Tolnay	c0dc0c6875	Store individual output file name with every PrintRequest	2023-07-20 11:04:30 -07:00
khei4	c7bf20dfdc	address feedback from nikic and oli-obk https://github.com/rust-lang/rust/pull/113723/files use slice memcpy rather than strcpy and write it on stdout use println on failure Co-authored-by: Oli Scherer <github35764891676564198441@oli-obk.de>	2023-07-20 16:53:06 +09:00
Dylan DPC	c1d6d322f4	Rollup merge of #113716 - DianQK:add-no_builtins-to-function, r=pnkfelix Add the `no-builtins` attribute to functions when `no_builtins` is applied at the crate level. When `no_builtins` is applied at the crate level, we should add the `no-builtins` attribute to each function to ensure it takes effect in LTO. This is also the reason why no_builtins does not take effect in LTO as mentioned in #35540. Now, `#![no_builtins]` should be similar to `-fno-builtin` in clang/gcc, see https://clang.godbolt.org/z/z4j6Wsod5. Next, we should make `#![no_builtins]` participate in LTO again. That makes sense, as LTO also takes into consideration function-level instruction optimizations, such as the MachineOutliner. More importantly, when a user writes a large `#![no_builtins]` crate, they would like this crate to participate in LTO as well. We should also add a function-level no_builtins attribute to allow users to have more control over it. This is similar to Clang's `__attribute__((no_builtin))` feature, see https://clang.godbolt.org/z/Wod6KK6eq. Before implementing this feature, maybe we should discuss whether to support more fine-grained control, such as `__attribute__((no_builtin("memcpy")))`. Related discussions: - #109821 - #35540 Next (a separate pull request?): - [ ] Revert #35637 - [ ] Add a function-level `no_builtin` attribute?	2023-07-19 22:37:06 +05:30
bors	77e24f90f5	Auto merge of #112591 - jfgoog:better-dlltool-diagnostics, r=WaffleLapkin Better diagnostics for dlltool errors. When dlltool fails, show the full command that was executed. In particular, llvm-dlltool is not very helpful, printing a generic usage message rather than what actually went wrong, so stdout and stderr aren't of much use when troubleshooting.	2023-07-19 07:27:50 +00:00
DianQK	cc08749df2	Add the `no-builtins` attribute to functions when `no_builtins` is applied at the crate level. When `no_builtins` is applied at the crate level, we should add the `no-builtins` attribute to each function to ensure it takes effect in LTO.	2023-07-18 22:15:47 +08:00
James Farrell	c59b82353d	Better diagnostics for dlltool errors. When dlltool fails, show the full command that was executed. In particular, llvm-dlltool is not very helpful, printing a generic usage message rather than what actually went wrong, so stdout and stderr aren't of much use when troubleshooting.	2023-07-17 20:20:01 +00:00
Nicholas Nethercote	b52f9eb6ca	Introduce `MonoItemData`. It replaces `(Linkage, Visibility)`, making the code nicer. Plus the next commit will add another field.	2023-07-17 08:44:48 +10:00
khei4	4d307c4822	print on rustc_codegen_llvm and rename malloc and cpy c_char	2023-07-17 00:37:52 +09:00
Patrick Walton	2d47816cba	rustc_llvm: Add a `-Z print-llvm-stats` option to expose LLVM statistics. LLVM has a neat [statistics] feature that tracks how often optimizations kick in. It's very handy for optimization work. Since we expose the LLVM pass timings, I thought it made sense to expose the LLVM statistics too. [statistics]: https://llvm.org/docs/ProgrammersManual.html#the-statistic-class-stats-option	2023-07-16 22:56:04 +09:00
bors	ffb9b61294	Auto merge of #113430 - Zalathar:hash, r=b-naber Remove `LLVMRustCoverageHashCString` Coverage has two FFI functions for computing the hash of a byte string. One takes a ptr/len pair (`LLVMRustCoverageHashByteArray`), and the other takes a NUL-terminated C string (`LLVMRustCoverageHashCString`). But on closer inspection, the C string version is unnecessary. The calling-side code converts a Rust `&str` into a `CString`, and the C++ code then immediately turns it back into a ptr/len string before actually hashing it. So we can just call the ptr/len version directly instead. --- This PR also fixes a bug in the C++ declaration of `LLVMRustCoverageHashByteArray`. It should be `size_t`, since that's what is declared and passed on the Rust side, and it's what `StrRef`'s constructor expects to receive on the callee side.	2023-07-16 01:56:23 +00:00
Mahdi Dibaiee	e55583c4b8	refactor(rustc_middle): Substs -> GenericArg	2023-07-14 13:27:35 +01:00
Rémy Rakic	598acffa60	make opt diagnostic kinds printable	2023-07-14 11:11:59 +00:00
Mark Rousskov	cc907f80b9	Re-format let-else per rustfmt update	2023-07-12 21:49:27 -04:00
Zalathar	352d031599	Remove `LLVMRustCoverageHashCString` Coverage has two FFI functions for computing the hash of a byte string. One takes a ptr/len pair, and the other takes a NUL-terminated C string. But on closer inspection, the C string version is unnecessary. The calling-side code converts a Rust `&str` into a C string, and the C++ code then immediately turns it back into a ptr/len string before actually hashing it.	2023-07-13 11:31:15 +10:00
Zalathar	7a5ad35da4	Pass a byte slice to `coverageinfo::hash_bytes` instead of an owned vector The function body immediately treats it as a slice anyway, so this just makes it possible to call the hash function with arbitrary read-only byte slices.	2023-07-13 11:28:50 +10:00
Zalathar	29c53d8748	Don't clone symbol names for coverage hashing A symbol already contains a `&str`, and in this context there's no need to make an owned copy, so we can just use the original string reference.	2023-07-13 11:16:27 +10:00
Jubilee Young	0726c7826b	Reuse LLVMConstInBoundsGEP2 We have had LLVM 14 as our minimum for a bit now.	2023-07-10 00:20:56 -07:00
bors	4dd1719b34	Auto merge of #113377 - BoxyUwU:move_ty_ctors_to_ty, r=compiler-errors Move `TyCtxt::mk_x` to `Ty::new_x` where applicable Part of rust-lang/compiler-team#616 turns out there's a lot of places we construct `Ty` this is a ridiculously huge PR :S r? `@oli-obk`	2023-07-06 08:10:42 +00:00
fee1-dead	1830b80c2d	Rollup merge of #113334 - fmease:revert-lexing-c-str-lits, r=compiler-errors Revert the lexing of `c"…"` string literals Fixes \[after beta-backport\] #113235. Further progress is tracked in #113333. This PR manually reverts parts of #108801 (since a git-revert would've been too coarse-grained & messy) and git-reverts #111647. CC `@fee1-dead` (#108801) `@klensy` (#111647) r? `@compiler-errors` `@rustbot` label F-c_str_literals beta-nominated	2023-07-06 09:20:33 +08:00
fee1-dead	e461502e06	Rollup merge of #112791 - WaffleLapkin:wag_the_llvm, r=cuviper llvm ffi: Expose `CallInst->setTailCallKind` This is needed for the explicit tail calls experiment.	2023-07-06 09:20:31 +08:00
Boxy	12138b8e5e	Move `TyCtxt::mk_x` to `Ty::new_x` where applicable	2023-07-05 20:27:07 +01:00
León Orell Valerian Liehr	3788b7ab32	Revert "use new c literals instead of cstr! macro" This reverts commit `a17561ffc9`.	2023-07-05 13:11:27 +02:00
Zalathar	cb570d6bc1	Move `coverageinfo::ffi` and `coverageinfo::map` out of SSA	2023-07-05 20:40:40 +10:00
Zalathar	9c430d38cf	Remove trait `CoverageInfoMethods`, since non-LLVM backends don't need it These methods are only ever called from within `rustc_codegen_llvm`, so they can just be declared there as well.	2023-07-05 20:40:40 +10:00
Zalathar	4169d0f756	Narrow trait `CoverageInfoBuilderMethods` down to just one method This effectively inlines most of `FunctionCx::codegen_coverage` into the LLVM implementation of `CoverageInfoBuilderMethods`.	2023-07-05 20:40:39 +10:00
bors	131a03664e	Auto merge of #113040 - Kobzol:llvm-remark-streamer, r=tmiasko Add `-Zremark-dir` unstable flag to write LLVM optimization remarks to YAML This PR adds an option for `rustc` to emit LLVM optimization remarks to a set of YAML files, which can then be digested by existing tools, like https://github.com/OfekShilon/optview2. When `-Cremark-dir` is passed, and remarks are enabled (`-Cremark=all`), the remarks will be now written to the specified directory, instead of being printed to standard error output. The files are named based on the CGU from which they are being generated. Currently, the remarks are written using the LLVM streaming machinery, directly in the diagnostics handler. It seemed easier than going back to Rust and then form there back to C++ to use the streamer from the diagnostics handler. But there are many ways to implement this, of course, so I'm open to suggestions :) I included some comments with questions into the code. Also, I'm not sure how to test this. r? `@tmiasko`	2023-07-02 12:48:44 +00:00
Jakub Beránek	62728c7aaf	Add `rustc` option to output LLVM optimization remarks to YAML files	2023-07-02 13:41:36 +02:00
Maybe Waffle	bf5eaa4550	llvm ffi: Expose `CallInst->setTailCallKind`	2023-06-30 16:31:45 +00:00
bors	56d507dc92	Auto merge of #109524 - bzEq:aix-embed-llvmbc, r=nagisa Support embedding LLVM bitcode on AIX	2023-06-30 11:39:58 +00:00
bors	97279e91d8	Auto merge of #113162 - matthiaskrgr:rollup-fct3wj7, r=matthiaskrgr Rollup of 7 pull requests Successful merges: - #111322 (Support for native WASM exceptions) - #112086 (resolve: Remove artificial import ambiguity errors) - #112234 (refactor `tool_doc!`) - #112300 (Convert `run-make/coverage-reports` tests to use a custom compiletest mode) - #112795 (Migrate some rustc_builtin_macros to SessionDiagnostic) - #113144 (Make the `Elaboratable` trait take clauses) - #113161 (Fix type privacy lints error message) r? `@ghost` `@rustbot` modify labels: rollup	2023-06-30 03:27:42 +00:00
bors	8aed93d912	Auto merge of #113116 - nnethercote:codegen-opts, r=oli-obk A mish-mash of micro-optimizations These were aimed at speeding up LLVM codegen, but ended up affecting other places as well. r? `@bjorn3`	2023-06-30 00:35:19 +00:00
Matthias Krüger	4696a92183	Rollup merge of #111322 - mirkootter:master, r=davidtwco Support for native WASM exceptions ### Motivation Currently, rustc does not support native WASM exceptions. It does support JavaScript based exceptions for the wasm32-emscripten-target, but this requires back&forth with javascript for many calls, which is very slow. Native wasm support for exceptions is quite common: Clang+LLVM implemented them years ago, and all major browsers support them by now. They enable zero-cost exceptions, at least with regard to runtime-performance-cost. They may increase startup-time and code size, though. ### Important: This PR does not change default behaviour Exceptions usually add a lot of code in form of unwinding blocks, increasing the binary size. Most users probably do not want that, especially which regard to web development. Therefore, wasm exceptions play a similar role as WASM-threads: rustc should support them, like clang does, but users who want to use it have to use some command-line magic like rustflags to opt in. ### What does this PR do? As stated above, the default behaviour is not changed. It is already possible to opt-in into wasm exceptions using the command line. Unfortunately, the LLVM IR is invalid and the LLVM backend crashes. ``` rustc <sourcefile> --target wasm32-unknown-unknown -C panic=unwind -C llvm-args=-wasm-enable-eh -C target-feature=+exception-handling ``` As it turns out, LLVM is quite picky when it comes to IR for exception handling. If the IR does not look exactly like it should, some LLVM-assertions fail and the code generation crashes. This PR adds the necessary modifications to the code generator to make it work. It also adds `exception-handling` as a wasm target feature. ### What this PR does not / what is missing This PR is not a full fledges solution. It is the first step. A few parts are still missing; however, it is already useable (see next section). Currently missing: * The std library has to be adapted. Currently, only [no_std] crates work * Usually, nested exceptions abort the program (i.e. a panic during the cleanup of another panic). This is currently not done yet. - Currently, code inside cleanup handlers does not unwind - To fix this requires a little more work: The code generator currently maintains a single terminate block per function for this. Unfortunately, WASM requires funclet based exception handling. Therefore, we need to create a terminate block per funclet. This is probably not a big problem, but I want to keep this PR simple. ### How to use the compiler given this PR? This PR does not add any command line flags or features. It uses those which are already there. To compile with exceptions enabled, you need * to set the panic strategy to unwind, i.e. `-C panic=unwind` * to enable the exception-handling target feature, i.e. `-C target-feature=+exception-handling` * to tell LLVM about the exception handling, i.e. `-C llvm-args=-wasm-enable-eh` Since the standard library has not been adapted, you can only use it in [no_std] crates as of now. The intrinsic `core::intrinsics::r#try` works. To throw exceptions, you need the ```@llvm.wasm.throw``` intrinsic. I created a sample application which works for me: https://github.com/mirkootter/rust-wasm-demos This example can be run at https://webassembly.sh	2023-06-29 16:36:30 +02:00
Nicholas Nethercote	8d7084d65f	Simplify the `bundles` vectors. After the last commit, they contain `Option<&OperandBundleDef<'a>>` but the values are always `Some(_)`. This commit removes the needless `Option` wrapper. This also simplifies the type signatures of `LLVMRustBuild{Invoke,Call}`, which were relying on the fact that the represention of `Option<&T>` is the same as `&T` for non-`None` values.	2023-06-29 11:51:00 +10:00
Nicholas Nethercote	81436ebd55	Use `SmallVec` for the `bundles` vectors. They never have a length of more than two. So this commit changes them to `SmallVec<[_; 2]>`. Also, we possibly push `None` values and then filter those `None` values out again with `retain`. So this commit removes the `retain` and instead only pushes the values if they are `Some(_)`.	2023-06-29 11:47:39 +10:00
Nicholas Nethercote	d20b1a8f6b	Set capacity of the string passed to `push_item_name`. Other callsites already do this, but these two were missed. This avoids some allocations.	2023-06-29 11:46:25 +10:00
Nicholas Nethercote	de1914af34	Avoid an unnecessary use of `SmallStr`. I don't know why `SmallStr` was used here; some ad hoc profiling showed this code is not that hot, the string is usually empty, and when it's not empty it's usually very short. However, the use of a `SmallStr<1024>` does result in 1024 byte `memcpy` call on each execution, which shows up when I do `memcpy` profiling. So using a normal string makes the code both simpler and very slightly faster.	2023-06-29 11:37:12 +10:00
Nicholas Nethercote	b4c6e19ade	Replace a `lookup_debug_loc` call. `lookup_debug_loc` finds a file, line, and column, which requires two binary searches. But this call site only needs the file. This commit replaces the call with `lookup_source_file`, which does a single binary search.	2023-06-29 11:31:43 +10:00
Nicholas Nethercote	a13be655a5	Avoid unnecessary line lookup. `lookup_debug_loc` calls `SourceMap::lookup_line`, which does a binary search over the files, and then a binary search over the lines within the found file. It then calls `SourceFile::line_begin_pos`, which redoes the binary search over the lines within the found file. This commit removes the second binary search over the lines, instead getting the line starting pos directly using the result of the first binary search over the lines. (And likewise for `get_span_loc`, in the cranelift backend.)	2023-06-29 11:26:39 +10:00
bors	3c554f5cb4	Auto merge of #112516 - erikdesjardins:loop, r=davidtwco cg_llvm: use index-based loop in write_operand_repeatedly This should be easier for LLVM to analyze. Fixes #111603 This needs a perf run. [cc](https://github.com/rust-lang/rust/issues/111603#issuecomment-1567531178) `@caojoshua`	2023-06-27 15:01:56 +00:00
Camille GILLOT	3a1edd8212	Store generator field names in GeneratorLayout.	2023-06-19 16:50:52 +00:00
Camille GILLOT	7d5b2e4926	Make closure_saved_names_of_captured_variables a query.	2023-06-19 16:50:52 +00:00
Michael Goulet	3eb8c2ae10	Rollup merge of #112474 - ldm0:ldm_enum_debuginfo_128_support, r=compiler-errors Support 128-bit enum variant in debuginfo codegen fixes #111600	2023-06-16 12:53:22 -07:00
Nicholas Nethercote	7c3ce02a11	Introduce a minimum CGU size in non-incremental builds. Because tiny CGUs make compilation less efficient and result in worse generated code. We don't do this when the number of CGUs is explicitly given, because there are times when the requested number is very important, as described in some comments within the commit. So the commit also introduces a `CodegenUnits` type that distinguishes between default values and user-specified values. This change has a roughly neutral effect on walltimes across the rustc-perf benchmarks; there are some speedups and some slowdowns. But it has significant wins for most other metrics on numerous benchmarks, including instruction counts, cycles, binary size, and max-rss. It also reduces parallelism, which is good for reducing jobserver competition when multiple rustc processes are running at the same time. It's smaller benchmarks that benefit the most; larger benchmarks already have CGUs that are all larger than the minimum size. Here are some example before/after CGU sizes for opt builds. - html5ever - CGUs: 16, mean size: 1196.1, sizes: [3908, 2992, 1706, 1652, 1572, 1136, 1045, 948, 946, 938, 579, 471, 443, 327, 286, 189] - CGUs: 4, mean size: 4396.0, sizes: [6706, 3908, 3490, 3480] - libc - CGUs: 12, mean size: 35.3, sizes: [163, 93, 58, 53, 37, 8, 2 (x6)] - CGUs: 1, mean size: 424.0, sizes: [424] - tt-muncher - CGUs: 5, mean size: 1819.4, sizes: [8508, 350, 198, 34, 7] - CGUs: 1, mean size: 9075.0, sizes: [9075] Note that CGUs of size 100,000+ aren't unusual in larger programs.	2023-06-14 10:57:44 +10:00
Erik Desjardins	bd0aae92dc	cg_llvm: use index-based loop in write_operand_repeatedly This is easier for LLVM to analyze.	2023-06-11 00:04:53 -04:00
DonoughLiu	204bfb6a8c	Support 128-bit enum variant in debuginfo codegen	2023-06-10 03:39:24 +08:00
bors	a0df04c0f2	Auto merge of #110040 - ndrewxie:issue-84447-partial-1, r=lcnr,michaelwoerister Removed use of iteration through a HashMap/HashSet in rustc_incremental and replaced with IndexMap/IndexSet This allows for the `#[allow(rustc::potential_query_instability)]` in rustc_incremental to be removed, moving towards fixing #84447 (although a LOT more modules have to be changed to fully resolve it). Only HashMaps/HashSets that are being iterated through have been modified (although many structs and traits outside of rustc_incremental had to be modified as well, as they had fields/methods that involved a HashMap/HashSet that would be iterated through) I'm making a PR for just 1 module changed to test for performance regressions and such, for future changes I'll either edit this PR to reflect additional modules being converted, or batch multiple modules of changes together and make a PR for each group of modules.	2023-06-08 07:30:03 +00:00
Kai Luo	7b79cb1759	Use `c`-prefixed string	2023-06-08 13:24:35 +08:00
Kai Luo	5725561e16	Support embedding bitcode on AIX	2023-06-08 13:22:57 +08:00
Jan-Mirko Otter	744ec64c93	fix comment (review change) Co-authored-by: bjorn3 <17426603+bjorn3@users.noreply.github.com>	2023-06-07 17:48:33 +02:00
Jan-Mirko Otter	12ad6622ad	add comment regarding `__gxx_wasm_personality_v0`	2023-06-07 17:48:33 +02:00
Jan-Mirko Otter	82730b4521	wasm exception handling	2023-06-07 17:48:28 +02:00
Jan-Mirko Otter	00ce5e8fca	add wasm eh intrinsics	2023-06-07 17:46:34 +02:00
Andrew Xie	17412bae30	Removed use of iteration through a HashMap/HashSet in rustc_incremental and replaced with IndexMap/IndexSet	2023-06-04 21:54:28 -04:00
Scott McMurray	e1b020df9f	Use `load`-`store` instead of `memcpy` for short integer arrays	2023-06-04 00:51:49 -07:00
Matthias Krüger	a3b639ce43	Rollup merge of #111647 - klensy:cstr, r=oli-obk use c literals in compiler and library Use c literals #108801 in compiler and library currently blocked on: * <strike>rustfmt: don't know how to format c literals</strike> nope, nightly one works. * <strike>bootstrap</strike> r? `@ghost` `@rustbot` blocked	2023-06-02 18:12:45 +02:00
Deadbeef	4f83717cf7	Use translatable diagnostics in `rustc_const_eval`	2023-06-01 14:45:18 +00:00
Scott McMurray	bf36193ef6	Add a distinct `OperandValue::ZeroSized` variant for ZSTs These tend to have special handling in a bunch of places anyway, so the variant helps remember that. And I think it's easier to grok than non-Scalar Aggregates sometimes being `Immediates` (like I got wrong and caused 109992). As a minor bonus, it means we don't need to generate poison LLVM values for them to pass around in `OperandValue::Immediate`s.	2023-05-31 19:10:28 -07:00
klensy	a17561ffc9	use new c literals instead of cstr! macro	2023-05-31 19:40:24 +03:00
bors	a9251b6ce1	Auto merge of #112102 - Nilstrieb:rollup-ivu1hmc, r=Nilstrieb Rollup of 7 pull requests Successful merges: - #107916 (fix comment on Allocator trait) - #111543 (Uplift `clippy::invalid_utf8_in_unchecked` lint) - #111872 (fix: dedup `static_candidates` before report) - #111955 (bootstrap: Various Step refactors) - #112060 (`EarlyBinder::new` -> `EarlyBinder::bind`) - #112064 (Migrate GUI colors test to original CSS color format) - #112100 (Don't typecheck recovered method call from suggestion) r? `@ghost` `@rustbot` modify labels: rollup	2023-05-30 13:25:42 +00:00
bors	3266c36624	Auto merge of #111768 - oli-obk:pair_const_llvm, r=cjgillot Optimize scalar and scalar pair representations loaded from ByRef in llvm in https://github.com/rust-lang/rust/pull/105653 I noticed that we were generating suboptimal LLVM IR if we had a `ConstValue::ByRef` that could be represented by a `ScalarPair`. Before https://github.com/rust-lang/rust/pull/105653 this is probably rare, but after it, every slice will go down this suboptimal code path that requires LLVM to untangle a bunch of indirections and translate static allocations that are only used once to read a scalar pair from.	2023-05-30 10:31:10 +00:00
lcnr	08d149ca85	EarlyBinder::new -> EarlyBinder::bind	2023-05-29 13:46:10 +02:00
Kyle Matsuda	03534ac8b7	Replace EarlyBinder(x) with EarlyBinder::new(x)	2023-05-28 10:44:50 -06:00
Wesley Wiser	019d75b44e	Add SafeStack support to rustc Adds support for LLVM [SafeStack] which provides backward edge control flow protection by separating the stack into two parts: data which is only accessed in provable safe ways is allocated on the normal stack (the "safe stack") and all other data is placed in a separate allocation (the "unsafe stack"). SafeStack support is enabled by passing `-Zsanitizer=safestack`. [SafeStack]: https://clang.llvm.org/docs/SafeStack.html	2023-05-26 15:18:54 -04:00
Oli Scherer	164d041e30	Stop creating intermediate places just to immediate convert them to operands	2023-05-26 15:01:29 +00:00
clubby789	f97fddab91	Ensure Fluent messages are in alphabetical order	2023-05-25 23:49:35 +00:00
bors	a2b1646c59	Auto merge of #86844 - bjorn3:global_alloc_improvements, r=pnkfelix Support #[global_allocator] without the allocator shim This makes it possible to use liballoc/libstd in combination with `--emit obj` if you use `#[global_allocator]`. This is what rust-for-linux uses right now and systemd may use in the future. Currently they have to depend on the exact implementation of the allocator shim to create one themself as `--emit obj` doesn't create an allocator shim. Note that currently the allocator shim also defines the oom error handler, which is normally required too. Once `#![feature(default_alloc_error_handler)]` becomes the only option, this can be avoided. In addition when using only fallible allocator methods and either `--cfg no_global_oom_handling` for liballoc (like rust-for-linux) or `--gc-sections` no references to the oom error handler will exist. To avoid this feature being insta-stable, you will have to define `__rust_no_alloc_shim_is_unstable` to avoid linker errors. (Labeling this with both T-compiler and T-lang as it originally involved both an implementation detail and had an insta-stable user facing change. As noted above, the `__rust_no_alloc_shim_is_unstable` symbol requirement should prevent unintended dependence on this unstable feature.)	2023-05-25 16:59:57 +00:00
Maybe Waffle	307799a711	Use `is_some_and`/`is_ok_and` in less obvious spots	2023-05-24 14:33:43 +00:00
Maybe Waffle	fb0f74a8c9	Use `Option::is_some_and` and `Result::is_ok_and` in the compiler	2023-05-24 14:20:41 +00:00
Jamie Cunliffe	a059e68d11	Create a structure to define the features from to_llvm_features. Rather than returning an array of features from to_llvm_features, return a structure that contains the dependencies. This also contains metadata on how the features depend on each other to allow for the correct enabling and disabling.	2023-05-22 14:46:40 +01:00
Jamie Cunliffe	aab0757c66	Only disable folded features when it makes sense. Some features that are tied together only make sense to be folded together when enabling the feature. For example on AArch64 sve and neon are tied together, however it doesn't make sense to disable neon when disabling sve.	2023-05-22 14:27:14 +01:00
Jamie Cunliffe	4cca436e30	Tie neon with fp-armv8. In #91608 the fp-armv8 feature was removed as it's tied to the neon feature. However disabling neon didn't actually disable the use of floating point registers and instructions, for this `-fp-armv8` is required.	2023-05-22 14:27:14 +01:00
bors	4eb5225cdf	Auto merge of #111413 - workingjubilee:bump-object-0-31-1, r=MarkSimulacrum Bump object and thorin-dwp Required to fix watchOS breakage.	2023-05-20 13:19:37 +00:00
bors	17a681000b	Auto merge of #111641 - michaelwoerister:debugger-visualizer-fixes, r=cjgillot Fix dependency tracking for debugger visualizers This PR fixes dependency tracking for debugger visualizer files by changing the `debugger_visualizers` query to an `eval_always` query that scans the AST while it is still available. This way the set of visualizer files is already available when dep-info is emitted. Since the query is turned into an `eval_always` query, dependency tracking will now reliably detect changes to the visualizer script files themselves. TODO: - [x] perf.rlo - [x] Needs a bit more documentation in some places - [x] Needs regression test for the incr. comp. case Fixes https://github.com/rust-lang/rust/issues/111226 Fixes https://github.com/rust-lang/rust/issues/111227 Fixes https://github.com/rust-lang/rust/issues/111295 r? `@wesleywiser` cc `@gibbyfree`	2023-05-19 11:30:44 +00:00
bors	c9dc55d05c	Auto merge of #111345 - jyn514:cfg-release-caching, r=cjgillot,est31 Only depend on CFG_VERSION in rustc_interface This avoids having to rebuild the whole compiler on each commit when `omit-git-hash = false`. cc https://github.com/rust-lang/rust/issues/76720 - this won't fix it, and I'm not suggesting we turn this on by default, but it will make it less painful for people who do have `omit-git-hash` on as a workaround.	2023-05-18 21:45:02 +00:00
jyn	d5f2b8e5c6	Only depend on CFG_VERSION in rustc_interface this avoids having to rebuild the whole compiler on each commit when `omit-git-hash = false`.	2023-05-17 23:54:21 -05:00
bors	77fb0cd3aa	Auto merge of #111364 - cuviper:unhack-thinlto, r=nikic Remove the ThinLTO CU hack This reverts #46722, commit `e0ab5d5feb`. Since #111167, commit `10b69dde3f`, we are generating DWARF subprograms in a way that is meant to be more compatible with LLVM's expectations, so hopefully we don't need this workaround rewriting CUs anymore.	2023-05-18 01:35:41 +00:00
Michael Woerister	d623668551	Move DebuggerVisualizerFile types from rustc_span to rustc_middle	2023-05-16 21:03:28 +02:00

1 2 3 4 5 ...

1548 Commits