mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
bors	f00c139998	Auto merge of #110050 - saethlin:better-u32-encoding, r=nnethercote Use a specialized varint + bitpacking scheme for DepGraph encoding The previous scheme here uses leb128 to encode the edge tables that represent the incr comp dependency graph. The problem with that scheme is that leb128 has overhead for larger values, and generally relies on the distribution of encoded values being heavily skewed towards smaller values. That is definitely not the case for a dep node index, since they are handed out sequentially and the whole range is covered, the distribution is actually biased in the opposite direction: Most dep nodes are large. This PR implements a different varint encoding scheme. Instead of applying varint encoding to individual dep node indices (which is extremely branchy) we now apply it per node. While being built, each node now stores its edges in a `SmallVec` with a bit of extra logic to track the max value of each edge. Then we varint encode the whole batch. This is a gamble: We save on space by only claiming 2 bits per node instead of ~3 bits per edge which is a nice savings but needs to balance out with the space overhead that a single large index in a node with a lot of edges will encode unnecessary bytes in each of that node's edge indices. Then, to keep the runtime overhead of this encoding scheme down we deserialize our indices by loading 4 bytes for each then masking off the bytes that are't ours. This is much less code and branches than leb128, but relies on having some readable bytes past the end of each edge list. We explicitly add such padding to the in-memory data during decoding. And we also do this decoding lazily, turning a dense on-disk encoding into a peak memory reduction. Then we apply a bit-packing scheme; since in https://github.com/rust-lang/rust/pull/115391 we now have unused bits on `DepKind`, we use those unused bits (currently there are 7!) to store the 2 bits that we need for the byte width of the edges in each node, then use the remaining bits to store the length of the edge list, if it fits. r? `@nnethercote`	2023-09-07 02:09:41 +00:00
Ben Kimock	469dc8f0fa	Add comments with the same level of detail as the PR description	2023-09-06 21:15:03 -04:00
Ben Kimock	94fe18f84b	Use a specialized varint + bitpacking scheme for DepGraph encoding	2023-09-04 12:16:50 -04:00
Camille GILLOT	258ace613d	Use relative positions inside a SourceFile.	2023-09-03 12:56:10 +00:00
John Kåre Alsaker	90f5f94699	Use `OnceLock` for `SingleCache`	2023-09-01 03:11:51 +02:00
John Kåre Alsaker	d35179f665	Don't use `wait_for_query` without the Rayon thread pool	2023-08-30 06:10:02 +02:00
bors	6d32b298ed	Auto merge of #114894 - Zoxc:sharded-cfg-cleanup2, r=cjgillot Remove conditional use of `Sharded` from query state `Sharded` is already a zero cost abstraction, so it shouldn't affect the performance of the single thread compiler if LLVM does its job. r? `@cjgillot`	2023-08-29 12:04:37 +00:00
Michael Goulet	e7b3c94b0e	Pass ErrorGuaranteed to cycle error	2023-08-27 22:03:00 +00:00
John Kåre Alsaker	3040d92dc4	Fix waiting on a query that panicked	2023-08-25 03:34:36 +02:00
John Kåre Alsaker	f458b112f8	Optimize `lock_shards`	2023-08-24 23:29:48 +02:00
John Kåre Alsaker	b74cb78d63	Remove conditional use of `Sharded` from query state	2023-08-24 23:29:47 +02:00
bors	840ed5d133	Auto merge of #114860 - Zoxc:sharded-layout, r=SparrowLii Make `Sharded` an enum and specialize it for the single thread case This changes `Sharded` to use a single shard by an enum, reducing the size of `Sharded` for greater cache efficiency. Performance improvement with 1 thread and `cfg(parallel_compiler)`: <table><tr><td rowspan="2">Benchmark</td><td colspan="1"><b>Before</b></th><td colspan="2"><b>After</b></th></tr><tr><td align="right">Time</td><td align="right">Time</td><td align="right">%</th></tr><tr><td>🟣 <b>clap</b>:check</td><td align="right">1.7009s</td><td align="right">1.6748s</td><td align="right">💚 -1.53%</td></tr><tr><td>🟣 <b>hyper</b>:check</td><td align="right">0.2525s</td><td align="right">0.2451s</td><td align="right">💚 -2.90%</td></tr><tr><td>🟣 <b>regex</b>:check</td><td align="right">0.9519s</td><td align="right">0.9353s</td><td align="right">💚 -1.74%</td></tr><tr><td>🟣 <b>syn</b>:check</td><td align="right">1.5504s</td><td align="right">1.5280s</td><td align="right">💚 -1.45%</td></tr><tr><td>🟣 <b>syntex_syntax</b>:check</td><td align="right">5.9536s</td><td align="right">5.8873s</td><td align="right">💚 -1.11%</td></tr><tr><td>Total</td><td align="right">10.4092s</td><td align="right">10.2706s</td><td align="right">💚 -1.33%</td></tr><tr><td>Summary</td><td align="right">1.0000s</td><td align="right">0.9825s</td><td align="right">💚 -1.75%</td></tr></table> I did see an unexpected 0.23% change for the serial compiler, so this could use a perf run to see if that reproduces. cc `@SparrowLii`	2023-08-24 02:24:25 +00:00
John Kåre Alsaker	a4e55f140b	Remove conditional use of `Sharded` from query caches	2023-08-16 14:16:05 +02:00
John Kåre Alsaker	81220c0ace	Keep SHARDS fixed instead of a function of `cfg!(parallel_compiler)`	2023-08-16 10:00:25 +02:00
bjorn3	980143b50c	Pass WorkProductMap to build_dep_graph instead of FxIndexMap Constructing an FxIndexMap is useless work as the iteration order never matters.	2023-08-13 16:07:48 +00:00
Matthias Krüger	3cd0a109a8	Rollup merge of #114566 - fmease:type-alias-laziness-is-crate-specific, r=oli-obk Store the laziness of type aliases in their `DefKind` Previously, we would treat paths referring to type aliases as lazy type aliases if the current crate had lazy type aliases enabled independently of whether the crate which the alias was defined in had the feature enabled or not. With this PR, the laziness of a type alias depends on the crate it is defined in. This generally makes more sense to me especially if / once lazy type aliases become the default in a new edition and we need to think about edition interoperability: Consider the hypothetical case where the dependency crate has an older edition (and thus eager type aliases), it exports a type alias with bounds & a where-clause (which are void but technically valid), the dependent crate has the latest edition (and thus lazy type aliases) and it uses that type alias. Arguably, the bounds should not be checked since at any time, the dependency crate should be allowed to change the bounds at will with a non-major version bump & without negatively affecting downstream crates. As for the reverse case (dependency: lazy type aliases, dependent: eager type aliases), I guess it rules out anything from slight confusion to mild annoyance from upstream crate authors that would be caused by the compiler ignoring the bounds of their type aliases in downstream crates with older editions. --- This fixes #114468 since before, my assumption that the type alias associated with a given weak projection was lazy (and therefore had its variances computed) did not necessarily hold in cross-crate scenarios (which [I kinda had a hunch about](https://github.com/rust-lang/rust/pull/114253#discussion_r1278608099)) as outlined above. Now it does hold. `@rustbot` label F-lazy_type_alias r? `@oli-obk`	2023-08-08 03:30:56 +02:00
León Orell Valerian Liehr	5468336d6b	Store the laziness of type aliases in the DefKind	2023-08-07 15:54:31 +02:00
klensy	e3700953c1	replace few explicit use of parking_lot with rustc_data_structures::sync onces	2023-08-04 18:41:56 +03:00
Matthias Krüger	23815467a2	inline format!() args up to and including rustc_middle	2023-07-30 13:18:33 +02:00
David Tolnay	5bbf0a8306	Revert "Auto merge of #113166 - moulins:ref-niches-initial, r=oli-obk" This reverts commit `557359f925`, reversing changes made to `1e6c09a803`.	2023-07-21 22:35:57 -07:00
Moulins	cb8b1d1bc9	add `naive_layout_of` query	2023-07-21 03:31:45 +02:00
bors	0646a5d1aa	Auto merge of #113622 - RickleAndMortimer:issue-113184-fix, r=oli-obk add links to query documentation for E0391 This PR adds links to https://rustc-dev-guide.rust-lang.org/overview.html#queries and https://rustc-dev-guide.rust-lang.org/query.html for the rustc --explain E0391 and within the compiler error itself. Fixes: #113184	2023-07-20 03:18:41 +00:00
Esteban Küber	8eb5843a59	On nightly, dump ICE backtraces to disk Implement rust-lang/compiler-team#578. When an ICE is encountered on nightly releases, the new rustc panic handler will also write the contents of the backtrace to disk. If any `delay_span_bug`s are encountered, their backtrace is also added to the file. The platform and rustc version will also be collected.	2023-07-19 14:10:07 +00:00
nxya	a54a66830d	moved note as unspanned note, moved note to the bottom of the msg	2023-07-18 21:53:34 -04:00
nxya	bef91ee687	added links as a note	2023-07-18 09:20:25 -04:00
Oli Scherer	97d831d008	Show which type was not specialized on query cycle misuse	2023-07-05 07:30:28 +00:00
Oli Scherer	d6b82ff761	Remove a redundant argument	2023-07-05 07:30:28 +00:00
Matthias Krüger	4f2e1df29d	Rollup merge of #112333 - Zoxc:try_collect_active_jobs-deadlock, r=cjgillot Don't hold the active queries lock while calling `make_query` This moves the call to `make_query` outside the parts that holds the active queries lock in `try_collect_active_jobs`. This should help removed the deadlock and borrow panic that has been observed when printing the query stack during an ICE. cc `@SparrowLii` r? `@cjgillot`	2023-06-08 12:36:18 +02:00
Andrew Xie	54d7b327e5	Removed stable/unstable sort arg from into_sorted_stable_ord, fixed a few misc issues, added collect to UnordItems	2023-06-08 00:38:50 -04:00
John Kåre Alsaker	fd3d2d49f2	Don't hold the active queries lock while calling `make_query`	2023-06-06 04:51:34 +02:00
Andrew Xie	96b577860d	Fixed failing test + minor cleanup	2023-06-04 21:55:32 -04:00
Andrew Xie	1be19f710c	Switched some uses to UnordMap	2023-06-04 21:55:30 -04:00
Andrew Xie	17412bae30	Removed use of iteration through a HashMap/HashSet in rustc_incremental and replaced with IndexMap/IndexSet	2023-06-04 21:54:28 -04:00
Matthias Krüger	a9743e108a	Rollup merge of #111875 - WaffleLapkin:defer_on_drop, r=Nilstrieb Don't leak the function that is called on drop It probably wasn't causing problems anyway, but still, a `// this leaks, please don't pass anything that owns memory` is not sustainable. I could implement a version which does not require `Option`, but it would require `unsafe`, at which point it's probably not worth it.	2023-05-25 08:01:08 +02:00
Maybe Waffle	fb0f74a8c9	Use `Option::is_some_and` and `Result::is_ok_and` in the compiler	2023-05-24 14:20:41 +00:00
Maybe Waffle	e2b953063d	Don't leak the function that is called on drop	2023-05-23 14:53:36 +00:00
Camille GILLOT	3bb5d1dfc1	Delay a bug when overwriting fed value.	2023-05-17 20:42:03 +00:00
John Kåre Alsaker	882a9684f9	Specialize query execution for incremental and non-incremental	2023-05-14 21:53:05 +02:00
bors	8e8116cfe5	Auto merge of #108638 - Zoxc:erase-query-values-map, r=cjgillot Use dynamic dispatch for queries This replaces most concrete query values `V` with `MaybeUninit<[u8; { size_of::<V>() }]>` reducing the code instantiated by queries. The compile time of `rustc_query_impl` is reduced by 27%. It is an alternative to https://github.com/rust-lang/rust/pull/107937 which uses unstable const generics while this uses a `EraseType` trait which maps query values to their erased variant. This is achieved by introducing an `Erased` type which does sanity check with `cfg(debug_assertions)`. The query caches gets instantiated with these erased types leaving the code in `rustc_query_system` unaware of them. `rustc_query_system` is changed to use instances of `QueryConfig` so that `rustc_query_impl` can pass in `DynamicConfig` which holds a pointer to a virtual table. <table><tr><td rowspan="2">Benchmark</td><td colspan="1"><b>Before</b></th><td colspan="2"><b>After</b></th></tr><tr><td align="right">Time</td><td align="right">Time</td><td align="right">%</th></tr><tr><td>🟣 <b>clap</b>:check</td><td align="right">1.7055s</td><td align="right">1.6949s</td><td align="right"> -0.62%</td></tr><tr><td>🟣 <b>hyper</b>:check</td><td align="right">0.2547s</td><td align="right">0.2528s</td><td align="right"> -0.73%</td></tr><tr><td>🟣 <b>regex</b>:check</td><td align="right">0.9590s</td><td align="right">0.9553s</td><td align="right"> -0.39%</td></tr><tr><td>🟣 <b>syn</b>:check</td><td align="right">1.5457s</td><td align="right">1.5440s</td><td align="right"> -0.11%</td></tr><tr><td>🟣 <b>syntex_syntax</b>:check</td><td align="right">5.9092s</td><td align="right">5.9009s</td><td align="right"> -0.14%</td></tr><tr><td>Total</td><td align="right">10.3741s</td><td align="right">10.3479s</td><td align="right"> -0.25%</td></tr><tr><td>Summary</td><td align="right">1.0000s</td><td align="right">0.9960s</td><td align="right"> -0.40%</td></tr></table> <table><tr><td rowspan="2">Benchmark</td><td colspan="1"><b>Before</b></th><td colspan="2"><b>After</b></th></tr><tr><td align="right">Time</td><td align="right">Time</td><td align="right">%</th></tr><tr><td>🟣 <b>clap</b>:check:initial</td><td align="right">2.0605s</td><td align="right">2.0575s</td><td align="right"> -0.15%</td></tr><tr><td>🟣 <b>hyper</b>:check:initial</td><td align="right">0.3218s</td><td align="right">0.3216s</td><td align="right"> -0.07%</td></tr><tr><td>🟣 <b>regex</b>:check:initial</td><td align="right">1.1848s</td><td align="right">1.1839s</td><td align="right"> -0.07%</td></tr><tr><td>🟣 <b>syn</b>:check:initial</td><td align="right">1.9409s</td><td align="right">1.9376s</td><td align="right"> -0.17%</td></tr><tr><td>🟣 <b>syntex_syntax</b>:check:initial</td><td align="right">7.3105s</td><td align="right">7.2928s</td><td align="right"> -0.24%</td></tr><tr><td>Total</td><td align="right">12.8185s</td><td align="right">12.7935s</td><td align="right"> -0.20%</td></tr><tr><td>Summary</td><td align="right">1.0000s</td><td align="right">0.9986s</td><td align="right"> -0.14%</td></tr></table> <table><tr><td rowspan="2">Benchmark</td><td colspan="1"><b>Before</b></th><td colspan="2"><b>After</b></th></tr><tr><td align="right">Time</td><td align="right">Time</td><td align="right">%</th></tr><tr><td>🟣 <b>clap</b>:check:unchanged</td><td align="right">0.4606s</td><td align="right">0.4617s</td><td align="right"> 0.24%</td></tr><tr><td>🟣 <b>hyper</b>:check:unchanged</td><td align="right">0.1335s</td><td align="right">0.1336s</td><td align="right"> 0.08%</td></tr><tr><td>🟣 <b>regex</b>:check:unchanged</td><td align="right">0.3324s</td><td align="right">0.3346s</td><td align="right"> 0.65%</td></tr><tr><td>🟣 <b>syn</b>:check:unchanged</td><td align="right">0.6268s</td><td align="right">0.6307s</td><td align="right"> 0.64%</td></tr><tr><td>🟣 <b>syntex_syntax</b>:check:unchanged</td><td align="right">1.8248s</td><td align="right">1.8508s</td><td align="right">💔 1.43%</td></tr><tr><td>Total</td><td align="right">3.3779s</td><td align="right">3.4113s</td><td align="right"> 0.99%</td></tr><tr><td>Summary</td><td align="right">1.0000s</td><td align="right">1.0061s</td><td align="right"> 0.61%</td></tr></table> It's based on https://github.com/rust-lang/rust/pull/108167. r? `@cjgillot`	2023-05-14 13:47:01 +00:00
Nicholas Nethercote	6b62f37402	Restrict `From<S>` for `{D,Subd}iagnosticMessage`. Currently a `{D,Subd}iagnosticMessage` can be created from any type that impls `Into<String>`. That includes `&str`, `String`, and `Cow<'static, str>`, which are reasonable. It also includes `&String`, which is pretty weird, and results in many places making unnecessary allocations for patterns like this: ``` self.fatal(&format!(...)) ``` This creates a string with `format!`, takes a reference, passes the reference to `fatal`, which does an `into()`, which clones the reference, doing a second allocation. Two allocations for a single string, bleh. This commit changes the `From` impls so that you can only create a `{D,Subd}iagnosticMessage` from `&str`, `String`, or `Cow<'static, str>`. This requires changing all the places that currently create one from a `&String`. Most of these are of the `&format!(...)` form described above; each one removes an unnecessary static `&`, plus an allocation when executed. There are also a few places where the existing use of `&String` was more reasonable; these now just use `clone()` at the call site. As well as making the code nicer and more efficient, this is a step towards possibly using `Cow<'static, str>` in `{D,Subd}iagnosticMessage::{Str,Eager}`. That would require changing the `From<&'a str>` impls to `From<&'static str>`, which is doable, but I'm not yet sure if it's worthwhile.	2023-05-03 08:44:39 +10:00
John Kåre Alsaker	2fe28ae0a4	Use dynamic dispatch for queries	2023-04-30 09:48:47 +02:00
bors	f5adff6bd8	Auto merge of #109611 - Zoxc:query-engine-rem, r=cjgillot Remove `QueryEngine` trait This removes the `QueryEngine` trait and `Queries` from `rustc_query_impl` and replaced them with function pointers and fields in `QuerySystem`. As a side effect `OnDiskCache` is moved back into `rustc_middle` and the `OnDiskCache` trait is also removed. This has a couple of benefits. - `TyCtxt` is used in the query system instead of the removed `QueryCtxt` which is larger. - Function pointers are more flexible to work with. A variant of https://github.com/rust-lang/rust/pull/107802 is included which avoids the double indirection. For https://github.com/rust-lang/rust/pull/108938 we can name entry point `__rust_end_short_backtrace` to avoid some overhead. For https://github.com/rust-lang/rust/pull/108062 it avoids the duplicate `QueryEngine` structs. - `QueryContext` now implements `DepContext` which avoids many `dep_context()` calls in `rustc_query_system`. - The `rustc_driver` size is reduced by 0.33%, hopefully that means some bootstrap improvements. - This avoids the unsafe code around the `QueryEngine` trait. r? `@cjgillot`	2023-04-29 21:58:13 +00:00
Matthias Krüger	50ccd17db6	Rollup merge of #110886 - nnethercote:dep-graph-cleanups, r=cjgillot `DepGraph` cleanups r? `@cjgillot`	2023-04-27 21:34:17 +02:00
Nicholas Nethercote	793b2ffb67	Factor out common code in `intern_node`. There are three very similar blocks in this function.	2023-04-27 17:50:02 +10:00
Nicholas Nethercote	207cec017f	Clean up `with_task`. Currently it creates an `Option` and then does `map`/`unwrap_or` and `map_or_else` on it, which is hard to read. This commit simplifies things by moving more code into the two arms of the if/else.	2023-04-27 17:48:15 +10:00
John Kåre Alsaker	4440e8196a	Add query accessor functions	2023-04-26 07:46:14 +02:00
bors	adaac6b166	Auto merge of #110634 - saethlin:pointy-decoder, r=cjgillot Rewrite MemDecoder around pointers not a slice This is basically https://github.com/rust-lang/rust/pull/109910 but I'm being a lot more aggressive. The pointer-based structure means that it makes a lot more sense to absorb more complexity into `MemDecoder`, most of the diff is just complexity moving from one place to another. The primary argument for this structure is that we only incur a single bounds check when doing multi-byte reads from a `MemDecoder`. With the slice-based implementation we need to do those with `data[position..position + len]` , which needs to account for `position + len` wrapping. It would be possible to dodge the first bounds check if we stored a slice that starts at `position`, but that would require updating the pointer and length on every read. This PR also embeds the failure path in a separate function, which means that this PR should subsume all the perf wins observed in https://github.com/rust-lang/rust/pull/109867.	2023-04-26 02:36:42 +00:00
Maybe Waffle	e496fbec92	Split `{Idx, IndexVec, IndexSlice}` into their own modules	2023-04-24 13:53:35 +00:00
Ben Kimock	1f67ba61a9	Rewrite MemDecoder around pointers not a slice	2023-04-23 17:25:11 -04:00
Camille GILLOT	76d573b656	Add info for no_hash panic.	2023-04-20 18:56:12 +00:00

1 2 3 4 5 ...

452 Commits