mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
Jed Brown	270fb2130c	intrinsics.fmuladdf{16,32,64,128}: expose llvm.fmuladd.* semantics Add intrinsics `fmuladd{f16,f32,f64,f128}`. This computes `(a * b) + c`, to be fused if the code generator determines that (i) the target instruction set has support for a fused operation, and (ii) that the fused operation is more efficient than the equivalent, separate pair of `mul` and `add` instructions. https://llvm.org/docs/LangRef.html#llvm-fmuladd-intrinsic MIRI support is included for f32 and f64. The codegen_cranelift uses the `fma` function from libc, which is a correct implementation, but without the desired performance semantic. I think this requires an update to cranelift to expose a suitable instruction in its IR. I have not tested with codegen_gcc, but it should behave the same way (using `fma` from libc).	2024-10-11 15:32:56 -06:00
Jubilee Young	48504415cf	cg_clif: Factor out rustc_target::abi	2024-10-08 18:24:56 -07:00
Folkert de Vries	2811ce715d	various fixes for `naked_asm!` implementation - fix for divergence - fix error message - fix another cranelift test - fix some cranelift things - don't set the NORETURN option for naked asm - fix use of naked_asm! in doc comment - fix use of naked_asm! in run-make test - use `span_bug` in unreachable branch	2024-10-06 19:00:09 +02:00
Urgau	0bd0b99729	Use wide pointers consistenly across the compiler	2024-10-04 14:06:48 +02:00
Michael Goulet	d10a16ae40	Relax a debug assertion in codegen	2024-09-30 12:18:02 -04:00
Ralf Jung	9b31c68c0e	fix cranelift CI	2024-09-29 07:32:08 +02:00
Michael Goulet	6fdc133daa	Add a debug assertion in codegen that unsize casts of the same principal trait def id are truly NOPs	2024-09-25 11:13:59 -04:00
Lukas Markeffsky	25b66d8091	be even more precise about "cast" vs "coercion"	2024-09-24 23:12:02 +02:00
Lukas Markeffsky	1efad22339	unify dyn* coercions with other pointer coercions	2024-09-24 22:17:55 +02:00
Michael Goulet	631ab21ff0	Check vtable projections for validity in miri	2024-09-23 19:38:26 -04:00
bjorn3	9d8572d33b	Rustfmt	2024-09-23 11:28:15 +00:00
bjorn3	f3ae4237ab	Revert change to apply_patches to fix panic	2024-09-23 11:20:56 +00:00
bjorn3	32b608a439	Merge commit '6d35b4c9a04580366fd800692a5b5db79d766530' into sync_cg_clif-2024-09-22	2024-09-23 11:20:46 +00:00
Michael Goulet	02b0f3b5ab	Reformat using the new identifier sorting from rustfmt	2024-09-22 19:11:29 -04:00
Folkert de Vries	90300ef40b	add `C-cmse-nonsecure-entry` ABI	2024-09-21 13:04:14 +02:00
Michael Goulet	460abead20	Do not unnecessarily eval consts in codegen	2024-09-20 20:38:11 -04:00
Nicholas Nethercote	37204ee563	Use associative type defaults in `{Layout,FnAbi}OfHelpers`. This avoids some repetitive boilerplate code.	2024-09-17 10:25:06 +10:00
Ralf Jung	15848f3f08	simd_shuffle: require index argument to be a vector	2024-09-14 14:43:24 +02:00
Scott McMurray	fe5183e627	Fix the examples in cg_clif	2024-09-09 19:39:43 -07:00
bors	cf7c0554f1	Auto merge of #128939 - bjorn3:windows_cg_clif_component, r=albertlarsan68 Distribute rustc_codegen_cranelift for Windows With support for raw-dylib recently added to cg_clif, and inline assembly support working on Windows for quite a while now, all blockers for distributing cg_clif on Windows that I mentioned in https://github.com/rust-lang/rust/pull/81746#issuecomment-1774099637 are fixed now.	2024-09-09 16:09:06 +00:00
Rémy Rakic	0fba9b4dbb	introduce `PrettyPrintMirOptions` for cosmetic MIR dump options initially starting with `-Z mir-include-spans` because we want them in the NLL mir dump pass	2024-08-30 07:07:28 +00:00
Trevor Gross	8a26d21ce2	Rollup merge of #128731 - RalfJung:simd-shuffle-vector, r=workingjubilee simd_shuffle intrinsic: allow argument to be passed as vector See https://github.com/rust-lang/rust/issues/128738 for context. I'd like to get rid of [this hack](`6c0b89dfac/compiler/rustc_codegen_ssa/src/mir/block.rs (L922-L935)`). https://github.com/rust-lang/rust/pull/128537 almost lets us do that since constant SIMD vectors will then be passed as immediate arguments. However, simd_shuffle for some reason actually takes an array as argument, not a vector, so the hack is still required to ensure that the array becomes an immediate (which then later stages of codegen convert into a vector, as that's what LLVM needs). This PR prepares simd_shuffle to also support a vector as the `idx` argument. Once this lands, stdarch can hopefully be updated to pass `idx` as a vector, and then support for arrays can be removed, which finally lets us get rid of that hack.	2024-08-27 01:46:50 -05:00
Ralf Jung	8e7b3b5c2b	rename AddressOf -> RawBorrow inside the compiler	2024-08-18 19:46:53 +02:00
Ralf Jung	f1fadb8ba2	stabilize raw_ref_op	2024-08-18 19:46:53 +02:00
Chris Denton	d7198a541d	Always use ar_archive_writer for import libs	2024-08-17 19:10:46 +00:00
bors	e2acda7447	Auto merge of #128812 - nnethercote:shrink-TyKind-FnPtr, r=compiler-errors Shrink `TyKind::FnPtr`. By splitting the `FnSig` within `TyKind::FnPtr` into `FnSigTys` and `FnHeader`, which can be packed more efficiently. This reduces the size of the hot `TyKind` type from 32 bytes to 24 bytes on 64-bit platforms. This reduces peak memory usage by a few percent on some benchmarks. It also reduces cache misses and page faults similarly, though this doesn't translate to clear cycles or wall-time improvements on CI. r? `@compiler-errors`	2024-08-14 00:56:53 +00:00
Ralf Jung	6906793db8	simd_shuffle intrinsic: allow argument to be passed as vector (not just as array)	2024-08-13 07:51:17 +02:00
Guillaume Gomez	ef74616e44	Rollup merge of #128149 - RalfJung:nontemporal_store, r=jieyouxu,Amanieu,Jubilee nontemporal_store: make sure that the intrinsic is truly just a hint The `!nontemporal` flag for stores in LLVM sounds like it is just a hint, but actually, it is not -- at least on x86, non-temporal stores need very special treatment by the programmer or else the Rust memory model breaks down. LLVM still treats these stores as-if they were normal stores for optimizations, which is [highly dubious](https://github.com/llvm/llvm-project/issues/64521). Let's avoid all that dubiousness by making our own non-temporal stores be truly just a hint, which is possible on some targets (e.g. ARM). On all other targets, non-temporal stores become regular stores. ~~Blocked on https://github.com/rust-lang/stdarch/pull/1541 propagating to the rustc repo, to make sure the `_mm_stream` intrinsics are unaffected by this change.~~ Fixes https://github.com/rust-lang/rust/issues/114582 Cc `@Amanieu` `@workingjubilee`	2024-08-12 17:09:14 +02:00
bjorn3	060811a602	Distribute rustc_codegen_cranelift for Windows	2024-08-10 21:16:44 +00:00
Nadrieril	3de829e4e5	Fixes in various places	2024-08-10 12:08:46 +02:00
bjorn3	0174cb1932	Use ar_archive_writer from sysroot for cg_clif	2024-08-09 17:23:50 +00:00
bjorn3	3ea9313de3	Merge commit '69b3f5a426a5c1c05236a45b36f6679d95fbe01b' into sync_cg_clif-2024-08-09	2024-08-09 17:18:46 +00:00
Nicholas Nethercote	b8b3a9328f	Shrink `TyKind::FnPtr`. By splitting the `FnSig` within `TyKind::FnPtr` into `FnSigTys` and `FnHeader`, which can be packed more efficiently. This reduces the size of the hot `TyKind` type from 32 bytes to 24 bytes on 64-bit platforms. This reduces peak memory usage by a few percent on some benchmarks. It also reduces cache misses and page faults similarly, though this doesn't translate to clear cycles or wall-time improvements on CI.	2024-08-09 14:33:25 +10:00
Michael Goulet	e96ece7c0b	Rename struct_tail_erasing_lifetimes to struct_tail_for_codegen	2024-08-08 12:15:16 -04:00
Ralf Jung	80a7d5f61d	nontemporal_store: make sure that the intrinsic is truly just a hint	2024-08-05 10:57:14 +02:00
bjorn3	37da264107	Move computation of decorated names out of the create_dll_import_lib method	2024-07-30 10:32:32 +00:00
bjorn3	161d2476d5	Move temp file name generation out of the create_dll_import_lib method	2024-07-30 10:10:41 +00:00
bors	d35f63e303	Auto merge of #125016 - nicholasbishop:bishop-cb-112, r=tgross35 Update compiler_builtins to 0.1.114 The `weak-intrinsics` feature was removed from compiler_builtins in https://github.com/rust-lang/compiler-builtins/pull/598, so dropped the `compiler-builtins-weak-intrinsics` feature from alloc/std/sysroot. In https://github.com/rust-lang/compiler-builtins/pull/593, some builtins for f16/f128 were added. These don't work for all compiler backends, so add a `compiler-builtins-no-f16-f128` feature and disable it for cranelift and gcc.	2024-07-29 07:41:33 +00:00
Nicholas Bishop	5cd1e8d9ab	Update compiler_builtins to 0.1.114 The `weak-intrinsics` feature was removed from compiler_builtins in https://github.com/rust-lang/compiler-builtins/pull/598, so dropped the `compiler-builtins-weak-intrinsics` feature from alloc/std/sysroot. In https://github.com/rust-lang/compiler-builtins/pull/593, some builtins for f16/f128 were added. These don't work for all compiler backends, so add a `compiler-builtins-no-f16-f128` feature and disable it for cranelift and gcc. Also disable it for LLVM targets that don't support it.	2024-07-28 20:43:07 -04:00
Nicholas Nethercote	ea9f491696	Reformat `use` declarations. The previous commit updated `rustfmt.toml` appropriately. This commit is the outcome of running `x fmt --all` with the new formatting options.	2024-07-29 08:26:52 +10:00
Nicholas Nethercote	b00404e1a1	Update `use` declarations formatting options. As decided in rust-lang/compiler-team#750. Use declarations are currently wildly inconsistent because rustfmt is quite unopinionated about how they should be formatted. The `rustfmt.toml` additions makes rustfmt more opinionated, which avoids the need for any decision when adding new use declarations to a file. This commit only updates `rustfmt.toml` and `compiler/rustc_codegen_cranelift/rustfmt.toml`. The next commit will do the reformatting.	2024-07-29 08:26:08 +10:00
Slanterns	62a0c22ca9	stabilize `is_sorted`	2024-07-28 03:11:54 +08:00
GnomedDev	486f5b5059	Use Cow<'static, str> for InlineAsmTemplatePiece::String	2024-07-24 21:11:55 +01:00
许杰友 Jieyou Xu (Joe)	f85d741d6f	Rollup merge of #127177 - bjorn3:arm64_macos_cg_clif, r=Mark-Simulacrum Distribute rustc_codegen_cranelift for arm64 macOS Support for arm64 macOS has been added to rustc_codegen_cranelift recently. Fixes https://github.com/rust-lang/rustc_codegen_cranelift/issues/1502	2024-07-22 16:44:02 +08:00
Matthias Krüger	0c9ba01011	Rollup merge of #127779 - momvart:should_codegen_hook, r=cjgillot Add a hook for `should_codegen_locally` This PR lifts the module-local function `should_codegen_locally` to `TyCtxt` as a hook. In addition to monomorphization, this function is used for checking the dependency of `compiler_builtins` on other libraries. Moving this function to the hooks also makes overriding it possible for the tools that use the rustc interface.	2024-07-20 13:24:52 +02:00
Yuri Astrakhan	c1c13bd07c	Avoid ref when using format! in compiler Clean up a few minor refs in `format!` macro, as it has a performance cost. Apparently the compiler is unable to inline `format!("{}", &variable)`, and does a run-time double-reference instead (format macro already does one level referencing). Inlining format args prevents accidental `&` misuse.	2024-07-19 14:52:07 -04:00
Trevor Gross	25dce8f92b	Rollup merge of #124033 - bjorn3:ar_archive_writer_0_3_0, r=davidtwco Sync ar_archive_writer to LLVM 18.1.3 From LLVM 15.0.0-rc3. This adds support for COFF archives containing Arm64EC object files and has various fixes for AIX big archive files.	2024-07-16 16:15:13 -05:00
Mohammad Omidvar	c01fe3ceac	Move compiler_builtin check to the use case	2024-07-15 23:43:52 +00:00
bjorn3	d0d2e60886	Merge commit '659243d85c7489412bd0faa1c068d904a6042941' into sync_cg_clif-2024-07-13	2024-07-13 18:39:03 +00:00
bors	776ab988c1	Auto merge of #113128 - WaffleLapkin:become_trully_unuwuable, r=oli-obk,RalfJung Support tail calls in mir via `TerminatorKind::TailCall` This is one of the interesting bits in tail call implementation — MIR support. This adds a new `TerminatorKind` which represents a tail call: ```rust TailCall { func: Operand<'tcx>, args: Vec<Operand<'tcx>>, fn_span: Span, }, ``` Structurally this is very similar to a normal `Call` but is missing a few fields: - `destination` — tail calls don't write to destination, instead they pass caller's destination to the callee (such that eventual `return` will write to the caller of the function that used tail call) - `target` — similarly to `destination` tail calls pass the caller's return address to the callee, so there is nothing to do - `unwind` — I _think_ this is applicable too, although it's a bit confusing - `call_source` — `become` forbids operators and is not created as a lowering of something else; tail calls always come from HIR (at least for now) It might be helpful to read the interpreter implementation to understand what `TailCall` means exactly, although I've tried documenting it too. ----- There are a few `FIXME`-questions still left, ideally we'd be able to answer them during review ':) ----- r? `@oli-obk` cc `@scottmcm` `@DrMeepster` `@JakobDegen`	2024-07-08 04:35:04 +00:00

1 2 3 4 5 ...

2518 Commits