mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
bors	f4a7ce997a	Auto merge of #95519 - oli-obk:tait_ub2, r=compiler-errors Enforce well formedness for type alias impl trait's hidden type fixes #84657 This was not an issue with return-position-impl-trait because the generic bounds of the function are the same as those of the opaque type, and the hidden type must already be well formed within the function. With type-alias-impl-trait the hidden type could be defined in a function that has more lifetime bounds than the type alias. This is fine, but the hidden type must still be well formed without those additional bounds.	2022-04-08 20:45:16 +00:00
Rémy Rakic	7585269673	add `generic_activity_with_arg_recorder` to the self-profiler This allows profiling costly arguments to be recorded only when `-Zself-profile-events=args` is on: using a closure that takes an `EventArgRecorder` and call its `record_arg` or `record_args` methods.	2022-04-07 15:47:20 +02:00
Rémy Rakic	1c4ae7aa4a	turn `exec` comment into doc comment	2022-04-07 15:47:19 +02:00
Oli Scherer	2e0ef701c2	Document and rename the new wrapper type	2022-04-07 13:01:48 +00:00
Oli Scherer	b30bcfae38	Fix some fallout around type alias impl trait in associated types	2022-04-06 12:56:22 +00:00
bors	168a020900	Auto merge of #92686 - saethlin:unsafe-debug-asserts, r=Amanieu Add debug assertions to some unsafe functions As suggested by https://github.com/rust-lang/rust/issues/51713 ~~Some similar code calls `abort()` instead of `panic!()` but aborting doesn't work in a `const fn`, and the intrinsic for doing dispatch based on whether execution is in a const is unstable.~~ This picked up some invalid uses of `get_unchecked` in the compiler, and fixes them. I can confirm that they do in fact pick up invalid uses of `get_unchecked` in the wild, though the user experience is less-than-awesome: ``` Running unittests (target/x86_64-unknown-linux-gnu/debug/deps/rle_decode_fast-04b7918da2001b50) running 6 tests error: test failed, to rerun pass '--lib' Caused by: process didn't exit successfully: `/home/ben/rle-decode-helper/target/x86_64-unknown-linux-gnu/debug/deps/rle_decode_fast-04b7918da2001b50` (signal: 4, SIGILL: illegal instruction) ``` ~~As best I can tell these changes produce a 6% regression in the runtime of `./x.py test` when `[rust] debug = true` is set.~~ Latest commit (`6894d559bd`) brings the additional overhead from this PR down to 0.5%, while also adding a few more assertions. I think this actually covers all the places in `core` that it is reasonable to check for safety requirements at runtime. Thoughts?	2022-04-03 16:04:47 +00:00
Oli Scherer	6ffd654683	Check that the cached stable hash is the right one if debug assertions are enabled	2022-03-31 14:54:04 +00:00
Oli Scherer	00c24dd8ce	Move stable hash from TyS into a datastructure that can be shared with other interned types.	2022-03-31 14:54:04 +00:00
bors	05142a7e44	Auto merge of #95466 - Dylan-DPC:rollup-g7ddr8y, r=Dylan-DPC Rollup of 5 pull requests Successful merges: - #95294 (Document Linux kernel handoff in std::io::copy and std::fs::copy) - #95443 (Clarify how `src/tools/x` searches for python) - #95452 (fix since field version for termination stabilization) - #95460 (Spellchecking compiler code) - #95461 (Spellchecking some comments) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup	2022-03-30 07:45:42 +00:00
Dylan DPC	03b3993ae8	Rollup merge of #95461 - nyurik:spelling, r=lcnr Spellchecking some comments This PR attempts to clean up some minor spelling mistakes in comments	2022-03-30 09:10:07 +02:00
Yuri Astrakhan	7e8201ae0a	Spellchecking some comments This PR attempts to clean up some minor spelling mistakes in comments	2022-03-30 01:39:38 -04:00
bors	f132bcf3bd	Auto merge of #94081 - oli-obk:lazy_tait_take_two, r=nikomatsakis Lazy type-alias-impl-trait take two ### user visible change 1: RPIT inference from recursive call sites Lazy TAIT has an insta-stable change. The following snippet now compiles, because opaque types can now have their hidden type set from wherever the opaque type is mentioned. ```rust fn bar(b: bool) -> impl std::fmt::Debug { if b { return 42 } let x: u32 = bar(false); // this errors on stable 99 } ``` The return type of `bar` stays opaque, you can't do `bar(false) + 42`, you need to actually mention the hidden type. ### user visible change 2: divergence between RPIT and TAIT in return statements Note that `return` statements and the trailing return expression are special with RPIT (but not TAIT). So ```rust #![feature(type_alias_impl_trait)] type Foo = impl std::fmt::Debug; fn foo(b: bool) -> Foo { if b { return vec![42]; } std::iter::empty().collect() //~ ERROR `Foo` cannot be built from an iterator } fn bar(b: bool) -> impl std::fmt::Debug { if b { return vec![42] } std::iter::empty().collect() // Works, magic (accidentally stabilized, not intended) } ``` But when we are working with the return value of a recursive call, the behavior of RPIT and TAIT is the same: ```rust type Foo = impl std::fmt::Debug; fn foo(b: bool) -> Foo { if b { return vec![]; } let mut x = foo(false); x = std::iter::empty().collect(); //~ ERROR `Foo` cannot be built from an iterator vec![] } fn bar(b: bool) -> impl std::fmt::Debug { if b { return vec![]; } let mut x = bar(false); x = std::iter::empty().collect(); //~ ERROR `impl Debug` cannot be built from an iterator vec![] } ``` ### user visible change 3: TAIT does not merge types across branches In contrast to RPIT, TAIT does not merge types across branches, so the following does not compile. ```rust type Foo = impl std::fmt::Debug; fn foo(b: bool) -> Foo { if b { vec![42_i32] } else { std::iter::empty().collect() //~^ ERROR `Foo` cannot be built from an iterator over elements of type `_` } } ``` It is easy to support, but we should make an explicit decision to include the additional complexity in the implementation (it's not much, see a721052457cf513487fb4266e3ade65c29b272d2 which needs to be reverted to enable this). ### PR formalities previous attempt: #92007 This PR also includes #92306 and #93783, as they were reverted along with #92007 in #93893 fixes #93411 fixes #88236 fixes #89312 fixes #87340 fixes #86800 fixes #86719 fixes #84073 fixes #83919 fixes #82139 fixes #77987 fixes #74282 fixes #67830 fixes #62742 fixes #54895	2022-03-30 05:04:45 +00:00
Ben Kimock	6e6d0cbf83	Add debug assertions to some unsafe functions These debug assertions are all implemented only at runtime using `const_eval_select`, and in the error path they execute `intrinsics::abort` instead of being a normal debug assertion to minimize the impact of these assertions on code size, when enabled. Of all these changes, the bounds checks for unchecked indexing are expected to be most impactful (case in point, they found a problem in rustc).	2022-03-29 11:05:24 -04:00
Oli Scherer	264cd05b16	Revert "Auto merge of #93893 - oli-obk:sad_revert, r=oli-obk" This reverts commit `6499c5e7fc`, reversing changes made to `78450d2d60`.	2022-03-28 16:27:14 +00:00
klensy	008fc79dcd	Propagate `parallel_compiler` feature through rustc crates. Turned off feature gives change of builded crates: 238 -> 224.	2022-03-28 08:41:12 +03:00
lcnr	b8135fd5c8	add `#[rustc_pass_by_value]` to more types	2022-03-08 15:39:52 +01:00
Nicholas Nethercote	4f008e06c3	Clarify `Layout` interning. `Layout` is another type that is sometimes interned, sometimes not, and we always use references to refer to it so we can't take any advantage of the uniqueness properties for hashing or equality checks. This commit renames `Layout` as `LayoutS`, and then introduces a new `Layout` that is a newtype around an `Interned<LayoutS>`. It also interns more layouts than before. Previously layouts within layouts (via the `variants` field) were never interned, but now they are. Hence the lifetime on the new `Layout` type. Unlike other interned types, these ones are in `rustc_target` instead of `rustc_middle`. This reflects the existing structure of the code, which does layout-specific stuff in `rustc_target` while `TyAndLayout` is generic over the `Ty`, allowing the type-specific stuff to occur in `rustc_middle`. The commit also adds a `HashStable` impl for `Interned`, which was needed. It hashes the contents, unlike the `Hash` impl which hashes the pointer.	2022-03-07 13:41:47 +11:00
Nicholas Nethercote	4852291417	Introduce `ConstAllocation`. Currently some `Allocation`s are interned, some are not, and it's very hard to tell at a use point which is which. This commit introduces `ConstAllocation` for the known-interned ones, which makes the division much clearer. `ConstAllocation::inner()` is used to get the underlying `Allocation`. In some places it's natural to use an `Allocation`, in some it's natural to use a `ConstAllocation`, and in some places there's no clear choice. I've tried to make things look as nice as possible, while generally favouring `ConstAllocation`, which is the type that embodies more information. This does require quite a few calls to `inner()`. The commit also tweaks how `PartialOrd` works for `Interned`. The previous code was too clever by half, building on `T: Ord` to make the code shorter. That caused problems with deriving `PartialOrd` and `Ord` for `ConstAllocation`, so I changed it to build on `T: PartialOrd`, which is slightly more verbose but much more standard and avoided the problems.	2022-03-07 08:25:50 +11:00
bors	c38b8a8c62	Auto merge of #94579 - tmiasko:target-features, r=nagisa Always include global target features in function attributes This ensures that information about target features configured with `-C target-feature=...` or detected with `-C target-cpu=native` is retained for subsequent consumers of LLVM bitcode. This is crucial for linker plugin LTO, since this information is not conveyed to the plugin otherwise. <details><summary>Additional test case demonstrating the issue</summary> ```rust extern crate core; #[inline] #[target_feature(enable = "aes")] unsafe fn f(a: u128, b: u128) -> u128 { use core::arch::x86_64::*; use core::mem::transmute; transmute(_mm_aesenc_si128(transmute(a), transmute(b))) } pub fn g(a: u128, b: u128) -> u128 { unsafe { f(a, b) } } fn main() { let mut args = std::env::args(); let _ = args.next().unwrap(); let a: u128 = args.next().unwrap().parse().unwrap(); let b: u128 = args.next().unwrap().parse().unwrap(); println!("{}", g(a, b)); } ``` ```console $ rustc --edition=2021 a.rs -Clinker-plugin-lto -Clink-arg=-fuse-ld=lld -Ctarget-feature=+aes -O ... = note: LLVM ERROR: Cannot select: intrinsic %llvm.x86.aesni.aesenc ``` </details> r? `@nagisa`	2022-03-06 18:07:11 +00:00
Tomasz Miąsko	725c11ef3c	Add SmallStr	2022-03-04 16:57:34 +01:00
Tomasz Miąsko	ea0a31ff0c	Inline SmallCStr::deref	2022-03-04 16:57:34 +01:00
Loïc BRANSTETT	08e1e67b49	Remove invalid #[cfg(tests)] in index_map	2022-03-04 11:34:50 +01:00
bors	2a280de64f	Auto merge of #94514 - matthiaskrgr:rollup-pdzn82h, r=matthiaskrgr Rollup of 9 pull requests Successful merges: - #94464 (Suggest adding a new lifetime parameter when two elided lifetimes should match up for traits and impls.) - #94476 (7 - Make more use of `let_chains`) - #94478 (Fix panic when handling intra doc links generated from macro) - #94482 (compiler: fix some typos) - #94490 (Update books) - #94496 (tests: accept llvm intrinsic in align-checking test) - #94498 (9 - Make more use of `let_chains`) - #94503 (Provide C FFI types via core::ffi, not just in std) - #94513 (update Miri) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup	2022-03-02 05:32:00 +00:00
cuishuang	eb2b9441e7	compiler: fix some typos	2022-03-01 20:02:47 +08:00
Simonas Kazlauskas	df701a292c	Querify `global_backend_features` At the very least this serves to deduplicate the diagnostics that are output about unknown target features provided via CLI.	2022-03-01 01:57:25 +02:00
bors	3b1fe7e7c9	Auto merge of #94084 - Mark-Simulacrum:drop-sharded, r=cjgillot Avoid query cache sharding code in single-threaded mode In non-parallel compilers, this is just adding needless overhead at compilation time (since there is only one shard statically anyway). This amounts to roughly ~10 seconds reduction in bootstrap time, with overall neutral (some wins, some losses) performance results. Parallel compiler performance should be largely unaffected by this PR; sharding is kept there.	2022-02-27 14:04:07 +00:00
Matthias Krüger	648a8e314a	Rollup merge of #94306 - Mark-Simulacrum:dom-fixups, r=jackh726 Avoid exhausting stack space in dominator compression Doesn't add a test case -- I ended up running into this while playing with the generated example from #43578, which we could do with a run-make test (to avoid checking a large code snippet into tree), but I suspect we don't want to wait for it to compile (locally it takes ~14s -- not terrible, but doesn't seem worth it to me). In practice stack space exhaustion is difficult to test for, too, since if we set the bound too low a different call structure above us (e.g., a nearer ensure_sufficient_stack call) would let the test pass even with the old impl, most likely. Locally it seems like this manages to perform approximately equivalently to the recursion, but will run perf to confirm.	2022-02-26 07:52:44 +01:00
Mark Rousskov	22c3a71de1	Switch bootstrap cfgs	2022-02-25 08:00:52 -05:00
Matthias Krüger	ae27c4ab1f	Rollup merge of #94288 - Mark-Simulacrum:ser-opt, r=nnethercote Cleanup a few Decoder methods This is just some simple follow up to #93839. r? `@nnethercote`	2022-02-24 07:48:09 +01:00
Mark Rousskov	4d89292785	Avoid exhausting stack space in dominator compression	2022-02-23 16:07:56 -05:00
bors	bafe8d06e0	Auto merge of #93984 - nnethercote:ChunkedBitSet, r=Mark-Simulacrum Introduce `ChunkedBitSet` and use it for some dataflow analyses. This reduces peak memory usage significantly for some programs with very large functions. r? `@ghost`	2022-02-23 01:26:07 +00:00
Nicholas Nethercote	36b495f3cf	Introduce `ChunkedBitSet` and use it for some dataflow analyses. This reduces peak memory usage significantly for some programs with very large functions, such as: - `keccak`, `unicode_normalization`, and `match-stress-enum`, from the `rustc-perf` benchmark suite; - `http-0.2.6` from crates.io. The new type is used in the analyses where the bitsets can get huge (e.g. 10s of thousands of bits): `MaybeInitializedPlaces`, `MaybeUninitializedPlaces`, and `EverInitializedPlaces`. Some refactoring was required in `rustc_mir_dataflow`. All existing analysis domains are either `BitSet` or a trivial wrapper around `BitSet`, and access in a few places is done via `Borrow<BitSet>` or `BorrowMut<BitSet>`. Now that some of these domains are `ClusterBitSet`, that no longer works. So this commit replaces the `Borrow`/`BorrowMut` usage with a new trait `BitSetExt` containing the needed bitset operations. The impls just forward these to the underlying bitset type. This required fiddling with trait bounds in a few places. The commit also: - Moves `static_assert_size` from `rustc_data_structures` to `rustc_index` so it can be used in the latter; the former now re-exports it so existing users are unaffected. - Factors out some common "clear excess bits in the final word" functionality in `bit_set.rs`. - Uses `fill` in a few places instead of loops.	2022-02-23 10:18:49 +11:00
Mark Rousskov	2098ea6eba	Provide copy-free access to raw Decoder bytes	2022-02-22 18:11:59 -05:00
lcnr	6a1f5eab83	obligation forest docs	2022-02-21 12:00:26 +01:00
Mark Rousskov	9deed6f74e	Move Sharded maps into each QueryCache impl	2022-02-20 12:10:46 -05:00
bors	c1aa85475c	Auto merge of #93934 - rusticstuff:inline_ensure_sufficient_stack, r=estebank Allow inlining of `ensure_sufficient_stack()` This functions is monomorphized a lot and allowing the compiler to inline it improves instructions count and max RSS significantly in my local tests.	2022-02-20 15:10:19 +00:00
est31	2ef8af6619	Adopt let else in more places	2022-02-19 17:27:43 +01:00
Nicholas Nethercote	80632de4a1	Address review comments.	2022-02-15 16:20:01 +11:00
Nicholas Nethercote	0c2ebbd412	Rename `PtrKey` as `Interned` and improve it. In particular, there's now more protection against incorrect usage, because you can only create one via `Interned::new_unchecked`, which makes it more obvious that you must be careful. There are also some tests.	2022-02-15 15:50:29 +11:00
Santiago Pastorino	f4bb4500dd	Call the method fork instead of clone and add proper comments	2022-02-14 12:57:20 -03:00
Hans Kratz	59536fb023	Allow inlining of ensure_sufficient_stack()	2022-02-12 11:30:04 +01:00
Jakub Beránek	5fc2e5623b	Use const generics in SipHasher128's short_write	2022-02-05 19:55:44 +01:00
Jakub Beránek	c21b8e12a4	Fix `isize` optimization in `StableHasher` for big-endian architectures	2022-02-03 11:47:41 +01:00
bors	1be5c8f909	Auto merge of #93432 - Kobzol:stable-hash-isize-hash-compression, r=the8472 Compress amount of hashed bytes for `isize` values in StableHasher This is another attempt to land https://github.com/rust-lang/rust/pull/92103, this time hopefully with a correct implementation w.r.t. stable hashing guarantees. The previous PR was [reverted](https://github.com/rust-lang/rust/pull/93014) because it could produce the [same hash](https://github.com/rust-lang/rust/pull/92103#issuecomment-1014625442) for different values even in quite simple situations. I have since added a basic [test](https://github.com/rust-lang/rust/pull/93193) that should guard against that situation, I also added a new test in this PR, specialised for this optimization. ## Why this optimization helps Since the original PR, I have tried to analyze why this optimization even helps (and why it especially helps for `clap`). I found that the vast majority of stable-hashing `i64` actually comes from hashing `isize` (which is converted to `i64` in the stable hasher). I only found a single place where is this datatype used directly in the compiler, and this place has also been showing up in traces that I used to find out when is `isize` being hashed. This place is `rustc_span::FileName::DocTest`, however, I suppose that isizes also come from other places, but they might not be so easy to find (there were some other entries in the trace). `clap` hashes about 8.5 million `isize`s, and all of them fit into a single byte, which is why this optimization has helped it [quite a lot](https://github.com/rust-lang/rust/pull/92103#issuecomment-1005711861). Now, I'm not sure if special casing `isize` is the correct solution here, maybe something could be done with that `isize` inside `DocTest` or in other places, but that's for another discussion I suppose. In this PR, instead of hardcoding a special case inside `SipHasher128`, I instead put it into `StableHasher`, and only used it for `isize` (I tested that for `i64` it doesn't help, or at least not for `clap` and other few benchmarks that I was testing). ## New approach Since the most common case is a single byte, I added a fast path for hashing `isize` values which positive value fits within a single byte, and a cold path for the rest of the values. To avoid the previous correctness problem, we need to make sure that each unique `isize` value will produce a unique hash stream to the hasher. By hash stream I mean a sequence of bytes that will be hashed (a different sequence should produce a different hash, but that is of course not guaranteed). We have to distinguish different values that produce the same bit pattern when we combine them. For example, if we just simply skipped the leading zero bytes for values that fit within a single byte, `(0xFF, 0xFFFFFFFFFFFFFFFF)` and `(0xFFFFFFFFFFFFFFFF, 0xFF)` would send the same hash stream to the hasher, which must not happen. To avoid this situation, values `[0, 0xFE]` are hashed as a single byte. When we hash a larger (treating `isize` as `u64`) value, we first hash an additional byte `0xFF`. Since `0xFF` cannot occur when we apply the single byte optimization, we guarantee that the hash streams will be unique when hashing two values `(a, b)` and `(b, a)` if `a != b`: 1) When both `a` and `b` are within `[0, 0xFE]`, their hash streams will be different. 2) When neither `a` and `b` are within `[0, 0xFE]`, their hash streams will be different. 3) When `a` is within `[0, 0xFE]` and `b` isn't, when we hash `(a, b)`, the hash stream will definitely not begin with `0xFF`. When we hash `(b, a)`, the hash stream will definitely begin with `0xFF`. Therefore the hash streams will be different. r? `@the8472`	2022-02-03 01:08:45 +00:00
Matthias Krüger	21ffe45631	Rollup merge of #92528 - tmiasko:combine-commutative, r=michaelwoerister Make `Fingerprint::combine_commutative` associative The previous implementation swapped lower and upper 64-bits of a result of modular addition, so the function was non-associative. r? `@Aaron1011`	2022-02-02 19:34:01 +01:00
lcnr	a1a30f7548	add a rustc::query_stability lint	2022-02-01 10:15:59 +01:00
Jakub Beránek	8de59be933	Compress amount of hashed bytes for `isize` values in StableHasher	2022-01-30 09:52:44 +01:00
Matthias Krüger	bc26f97394	Rollup merge of #93193 - Kobzol:stable-hash-permutation-test, r=the8472 Add test for stable hash uniqueness of adjacent field values This PR adds a simple test to check that stable hash will produce a different hash if the order of two values that have the same combined bit pattern changes. r? `@the8472`	2022-01-27 22:32:24 +01:00
bors	e7825f2b69	Auto merge of #90842 - pierwill:localdefid-indexmap, r=wesleywiser Use `indexmap` to avoid sorting `LocalDefId`s See discussion in https://github.com/rust-lang/rust/pull/90408#discussion_r745935459. Related to work on https://github.com/rust-lang/rust/issues/90317.	2022-01-24 22:04:55 +00:00
Jakub Beránek	1ffd043caf	Add test stable hash uniqueness of adjacent field values	2022-01-24 15:35:52 +01:00
Jakub Beránek	50f8062316	Revert "Do not hash leading zero bytes of i64 numbers in Sip128 hasher"	2022-01-24 09:07:47 +01:00
pierwill	4f89224f7f	Use an `indexmap` to avoid sorting `LocalDefId`s Update `indexmap` to 1.8.0. Bless test	2022-01-22 22:34:16 -06:00
Nicholas Nethercote	416399dc10	Make `Decodable` and `Decoder` infallible. `Decoder` has two impls: - opaque: this impl is already partly infallible, i.e. in some places it currently panics on failure (e.g. if the input is too short, or on a bad `Result` discriminant), and in some places it returns an error (e.g. on a bad `Option` discriminant). The number of places where either happens is surprisingly small, just because the binary representation has very little redundancy and a lot of input reading can occur even on malformed data. - json: this impl is fully fallible, but it's only used (a) for the `.rlink` file production, and there's a `FIXME` comment suggesting it should change to a binary format, and (b) in a few tests in non-fundamental ways. Indeed #85993 is open to remove it entirely. And the top-level places in the compiler that call into decoding just abort on error anyway. So the fallibility is providing little value, and getting rid of it leads to some non-trivial performance improvements. Much of this commit is pretty boring and mechanical. Some notes about a few interesting parts: - The commit removes `Decoder::{Error,error}`. - `InternIteratorElement::intern_with`: the impl for `T` now has the same optimization for small counts that the impl for `Result<T, E>` has, because it's now much hotter. - Decodable impls for SmallVec, LinkedList, VecDeque now all use `collect`, which is nice; the one for `Vec` uses unsafe code, because that gave better perf on some benchmarks.	2022-01-22 10:38:31 +11:00
bors	42852d7857	Auto merge of #92740 - cuviper:update-rayons, r=Mark-Simulacrum Update rayon and rustc-rayon This updates rayon for various tools and rustc-rayon for the compiler's parallel mode. - rayon v1.3.1 -> v1.5.1 - rayon-core v1.7.1 -> v1.9.1 - rustc-rayon v0.3.1 -> v0.3.2 - rustc-rayon-core v0.3.1 -> v0.3.2 ... and indirectly, this updates all of crossbeam-* to their latest versions. Fixes #92677 by removing crossbeam-queue, but there's still a lingering question about how tidy discovers "runtime" dependencies. None of this is truly in the standard library's dependency tree at all.	2022-01-16 08:12:23 +00:00
bors	2e2c86eba2	Auto merge of #92070 - rukai:replace_vec_into_iter_with_array_into_iter, r=Mark-Simulacrum Replace usages of vec![].into_iter with [].into_iter `[].into_iter` is idiomatic over `vec![].into_iter` because its simpler and faster (unless the vec is optimized away in which case it would be the same) So we should change all the implementation, documentation and tests to use it. I skipped: * `src/tools` - Those are copied in from upstream * `src/test/ui` - Hard to tell if `vec![].into_iter` was used intentionally or not here and not much benefit to changing it. * any case where `vec![].into_iter` was used because we specifically needed a `Vec::IntoIter<T>` * any case where it looked like we were intentionally using `vec![].into_iter` to test it.	2022-01-11 14:23:24 +00:00
Josh Stone	f3b8812f24	Update rayon and rustc-rayon	2022-01-10 11:34:07 -08:00
Lucas Kent	08829853d3	eplace usages of vec![].into_iter with [].into_iter	2022-01-09 14:09:25 +11:00
Aaron Hill	5580e5e1dd	Ensure that `Fingerprint` caching respects hashing configuration Fixes #92266 In some `HashStable` impls, we use a cache to avoid re-computing the same `Fingerprint` from the same structure (e.g. an `AdtDef`). However, the `StableHashingContext` used can be configured to perform hashing in different ways (e.g. skipping `Span`s). This configuration information is not included in the cache key, which will cause an incorrect `Fingerprint` to be used if we hash the same structure with different `StableHashingContext` settings. To fix this, the configuration settings of `StableHashingContext` are split out into a separate `HashingControls` struct. This struct is used as part of the cache key, ensuring that our caches always produce the correct result for the given settings. With this in place, we now turn off `Span` hashing during the entire process of computing the hash included in legacy symbols. This current has no effect, but will matter when a future PR starts hashing more `Span`s that we currently skip.	2022-01-05 10:13:28 -05:00
Jakub Beránek	65a3279f4a	Do not hash zero bytes of i64 and u32 in Sip128 hasher	2022-01-04 19:12:10 +01:00
Tomasz Miąsko	1d64b59664	Make `Fingerprint::combine_commutative` associative The previous implementation swapped lower and upper 64-bits of a result of modular addition, so the function was non-associative.	2022-01-03 19:07:29 +01:00
Jakub Beránek	3d8d3f1435	Rustdoc: use ThinVec for GenericArgs bindings	2022-01-01 11:29:14 +01:00
bors	41ce641a40	Auto merge of #92130 - Kobzol:stable-hash-str, r=cjgillot Use hash_stable for hashing str This seemed like an oversight. With this change the hash can go through the `HashStable` machinery directly.	2021-12-28 01:04:33 +00:00
pierwill	e6ff0bac1e	rustc `VecGraph`: require the index type to implement Ord	2021-12-22 10:50:57 -06:00
pierwill	8df9248591	Remove `PartialOrd` and `Ord` from `LocalDefId` Implement `Ord`, `PartialOrd` for SpanData	2021-12-22 10:50:57 -06:00
bors	87e8639d8d	Auto merge of #91903 - tmiasko:bit-set-hash, r=jackh726 Implement StableHash for BitSet and BitMatrix via Hash This fixes an issue where bit sets / bit matrices the same word content but a different domain size would receive the same hash.	2021-12-21 05:42:10 +00:00
Jakub Beránek	c695b6026c	Use hash_stable for hashing str	2021-12-20 18:46:34 +01:00
bors	daf2204aa4	Auto merge of #91837 - Kobzol:stable-hash-map-avoid-sort, r=the8472 Avoid sorting in hash map stable hashing Suggested by `@the8472` [here](https://github.com/rust-lang/rust/pull/89404#issuecomment-991813333). I hope that I understood it right, I replaced the sort with modular multiplication, which should be commutative. Can I ask for a perf. run? However, locally it didn't help at all. Creating the `StableHasher` all over again is probably slowing it down quite a lot. And using `FxHasher` is not straightforward, because the keys and values only implement `HashStable` (and probably they shouldn't be just hashed via `Hash` anyway for it to actually be stable). Maybe the `StableHash` interface could be changed somehow to better suppor these scenarios where the hasher is short-lived. Or the `StableHasher` implementation could have variants with e.g. a shorter buffer for these scenarios.	2021-12-18 21:23:37 +00:00
Tomasz Miąsko	d0281bcb25	Implement StableHash for BitSet and BitMatrix via Hash This fixes an issue where bit sets / bit matrices the same word content but a different domain size would receive the same hash.	2021-12-18 00:00:00 +00:00
Jakub Beránek	1f284b07ed	Add special case for length 1	2021-12-13 21:34:54 +01:00
Jakub Beránek	ac08f13948	Remove sort from hashing hashset, treeset and treemap	2021-12-13 16:11:28 +01:00
Jakub Beránek	6e33d3ecc2	Use modular arithmetic	2021-12-12 23:48:11 +01:00
bors	22f8bde876	Auto merge of #91549 - fee1-dead:const_env, r=spastorino Eliminate ConstnessAnd again Closes #91489. Closes #89432. Reverts #91491. Reverts #89450. r? `@spastorino`	2021-12-12 22:15:32 +00:00
Jakub Beránek	a5f5f6b689	Avoid sorting in hash map stable hashing	2021-12-12 20:57:24 +01:00
Deadbeef	83587e8d30	Small performance tweaks	2021-12-12 12:35:01 +08:00
bors	58457bbfd3	Auto merge of #89404 - Kobzol:hash-stable-sort, r=Mark-Simulacrum Slightly optimize hash map stable hashing I was profiling some of the `rustc-perf` benchmarks locally and noticed that quite some time is spent inside the stable hash of hashmaps. I tried to use a `SmallVec` instead of a `Vec` there, which helped very slightly. Then I tried to remove the sorting, which was a bottleneck, and replaced it with insertion into a binary heap. Locally, it yielded nice improvements in instruction counts and RSS in several benchmarks for incremental builds. The implementation could probably be much nicer and possibly extended to other stable hashes, but first I wanted to test the perf impact properly. Can I ask someone to do a perf run? Thank you!	2021-12-12 03:50:30 +00:00
Matthias Krüger	7e18b79c77	Rollup merge of #91426 - eggyal:idfunctor-panic-safety, r=lcnr Make IdFunctor::try_map_id panic-safe Addresses FIXME comment created in #78313 r? ```@lcnr```	2021-12-11 08:22:32 +01:00
est31	15de4cbc4b	Remove redundant [..]s	2021-12-09 00:01:29 +01:00
Alan Egerton	acd39ff0fe	Make IdFunctor::try_map_id panic-safe	2021-12-07 11:11:23 +00:00
Mark Rousskov	15483ccf9d	Annotate comments onto the LT algorithm	2021-12-06 20:30:15 -05:00
Mark Rousskov	3187480070	Avoid using Option where values are always Some	2021-12-06 15:05:22 -05:00
Mark Rousskov	2b63059772	Create newtype around the pre order index	2021-12-06 15:05:22 -05:00
Mark Rousskov	cc63ec32fb	Use variables rather than lengths directly	2021-12-06 15:05:22 -05:00
Mark Rousskov	345ada0e1b	Optimize: reuse the real-to-preorder mapping as the visited set	2021-12-06 15:05:22 -05:00
Mark Rousskov	8991002644	Remove separate RPO traversal This integrates the preorder and postorder traversals into one.	2021-12-06 15:05:22 -05:00
Mark Rousskov	7d12767dc5	Use preorder indices for data structures This largely avoids remapping from and to the 'real' indices, with the exception of predecessor lookup and the final merge back, and is conceptually better.	2021-12-06 15:05:22 -05:00
Mark Rousskov	92186cb5c9	Avoid inserting into buckets if not necessary	2021-12-06 15:05:22 -05:00
Mark Rousskov	7379d24ebc	Optimization: process buckets only once	2021-12-06 15:05:22 -05:00
Mark Rousskov	c82fe0efb4	Optimization: Merge parent and ancestor arrays As the paper indicates, the unprocessed vertices in the DFS tree and processed vertices are disjoint, and we can use them in the same space, tracking only the index of the split.	2021-12-06 15:05:22 -05:00
Mark Rousskov	e8d7248093	Implement the simple Lengauer-Tarjan algorithm This replaces the previous implementation with the simple variant of Lengauer-Tarjan, which performs better in the general case. Performance on the keccak benchmark is about equivalent between the two, but we don't see regressions (and indeed see improvements) on other benchmarks, even on a partially optimized implementation. The implementation here follows that of the pseudocode in "Linear-Time Algorithms for Dominators and Related Problems" thesis by Loukas Georgiadis. The next few commits will optimize the implementation as suggested in the thesis. Several related works are cited in the comments within the implementation, as well. Implement the simple Lengauer-Tarjan algorithm This replaces the previous implementation (from #34169), which has not been optimized since, with the simple variant of Lengauer-Tarjan which performs better in the general case. A previous attempt -- not kept in commit history -- attempted a replacement with a bitset-based implementation, but this led to regressions on perf.rust-lang.org benchmarks and equivalent wins for the keccak benchmark, so was rejected. The implementation here follows that of the pseudocode in "Linear-Time Algorithms for Dominators and Related Problems" thesis by Loukas Georgiadis. The next few commits will optimize the implementation as suggested in the thesis. Several related works are cited in the comments within the implementation, as well. On the keccak benchmark, we were previously spending 15% of our cycles computing the NCA / intersect function; this function is quite expensive, especially on modern CPUs, as it chases pointers on every iteration in a tight loop. With this commit, we spend ~0.05% of our time in dominator computation.	2021-12-06 15:03:09 -05:00
Scott McMurray	308fd59f42	Stop enabling `in_band_lifetimes` in rustc_data_structures There's a conversation in the tracking issue about possibly unaccepting `in_band_lifetimes`, but it's used heavily in the compiler, and thus there'd need to be a bunch of PRs like this if that were to happen. So here's one to see how much of an impact it has. (Oh, and I removed `nll` while I was here too, since it didn't seem needed. Let me know if I should put that back.)	2021-12-05 20:17:35 -08:00
Matthias Krüger	31003a3089	Rollup merge of #88906 - Kixunil:box-maybe-uninit-write, r=dtolnay Implement write() method for Box<MaybeUninit<T>> This adds method similar to `MaybeUninit::write` main difference being it returns owned `Box`. This can be used to elide copy from stack safely, however it's not currently tested that the optimization actually occurs. Analogous methods are not provided for `Rc` and `Arc` as those need to handle the possibility of sharing. Some version of them may be added in the future. This was discussed in #63291 which this change extends.	2021-12-03 06:24:11 +01:00
Martin Habovstiak	41e21aa1c2	Implement write() method for Box<MaybeUninit<T>> This adds method similar to `MaybeUninit::write` main difference being it returns owned `Box`. This can be used to elide copy from stack safely, however it's not currently tested that the optimization actually occurs. Analogous methods are not provided for `Rc` and `Arc` as those need to handle the possibility of sharing. Some version of them may be added in the future. This was discussed in #63291 which this change extends.	2021-12-02 17:18:34 +01:00
Alan Egerton	db7295fa96	Remove no-longer used `IdFunctor::map_id`	2021-12-02 15:45:40 +00:00
Alan Egerton	afa6f92c46	Use intrinsic pointer methods	2021-11-27 16:59:18 +00:00
Alan Egerton	9f714ef035	Delegate from `map_id` to `try_map_id`	2021-11-27 16:55:21 +00:00
Alan Egerton	04f1c09f90	Avoid UB when short-circuiting try_map_id for Vec	2021-11-27 15:06:06 +00:00
LeSeulArtichaut	6e3fa20b00	Make `TypeFoldable` implementors short-circuit on error Co-authored-by: Alan Egerton <eggyal@gmail.com>	2021-11-26 07:17:59 +00:00
Yuki Okushi	8d4fbc9a73	Add `#[inline]`s to `SortedIndexMultiMap`	2021-11-11 08:35:59 +09:00
Matthias Krüger	5c454551da	more clippy fixes	2021-11-07 16:59:05 +01:00
bors	88a5a984fe	Auto merge of #90380 - Mark-Simulacrum:revert-89558-query-stable-lint, r=lcnr Revert "Add rustc lint, warning when iterating over hashmaps" Fixes perf regressions introduced in https://github.com/rust-lang/rust/pull/90235 by temporarily reverting the relevant PR.	2021-10-29 04:55:51 +00:00

1 2 3 4 5 ...

376 Commits