mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
bors	8cfa7b4ec9	Auto merge of #78588 - HeroicKatora:sccc, r=nikomatsakis Reworks Sccc computation to iteration instead of recursion Linear graphs, producing as many scc's as nodes, would recurse once for every node when entered from the start of the list. This adds a test that exhausted the stack at least on my machine with error: ``` thread 'graph::scc::tests::test_deep_linear' has overflowed its stack fatal runtime error: stack overflow ``` This may or may not be connected to #78567. I was only reminded that I started this rework some time ago. It might be plausible as borrow checking a long function with many borrow regions around each other—((((((…))))))— may produce the linear list setup to trigger this stack overflow ? I don't know enough about borrow check to say for sure. This is best read in two separate commits. The first addresses only `find_state` internally. This is classical union phase from union-find. There's also a common solution of using the parent pointers in the (virtual) linked list to track the backreferences while traversing upwards and then following them backwards in a second path compression phase. The second is more involved as it rewrites the mutually recursive `walk_node` and `walk_unvisited_node`. Firstly, the caller is required to handle the unvisited case of `walk_node` so a new `start_walk_from` method is added to handle that by walking the unvisited node if necessary. Then `walk_unvisited_node`, where we would previously recurse into in the missing case, is rewritten to construct a manual stack of its frames. The state fields consist of the previous stack slots.	2020-11-21 01:30:26 +00:00
Tyson Nottingham	142932ab19	Set unaligned_references lint to deny in rustc_data_structures To detect misuse of private packed field in `PackedFingerprint`.	2020-11-20 01:13:15 -08:00
Tyson Nottingham	05dde137ca	Make PackedFingerprint's Fingerprint private	2020-11-18 15:10:43 -08:00
Tyson Nottingham	f09d474836	Use PackedFingerprint in DepNode to reduce memory consumption	2020-11-18 12:49:09 -08:00
Mara Bos	fa45fce0d3	Rollup merge of #78702 - wesleywiser:self_profile_cgu_sizes, r=Mark-Simulacrum [self-profiling] Include the estimated size of each cgu in the profile This is helpful when looking for CGUs where the size estimate isn't a good indicator of compilation time. I verified that moving the profiling timer call doesn't affect the results. Results: <img width="297" alt="Screen Shot 2020-11-03 at 7 25 04 AM" src="https://user-images.githubusercontent.com/831192/97985503-5901d100-1da6-11eb-9f10-f3e399702952.png"> `measureme` doesn't have support for custom arg names yet so `arg0` is the CGU name and `arg1` is the estimated size.	2020-11-17 16:13:49 +01:00
lcnr	a6cbd64dae	words	2020-11-16 22:42:09 +01:00
Bastian Kauschke	2bf93bd852	compiler: fold by value	2020-11-16 22:34:57 +01:00
Bastian Kauschke	3ec6720bf1	add IdFunctor to rustc_data_structures	2020-11-16 22:27:20 +01:00
Jonas Schievink	ae1916b3b4	Rollup merge of #79058 - dtolnay:likelymacro, r=Mark-Simulacrum Move likely/unlikely argument outside of invisible unsafe block The previous `likely!`/`unlikely!` macros were unsound because it permits the caller's expr to contain arbitrary unsafe code. ```rust pub fn huh() -> bool { likely!(std::ptr::read(&() as const () as const bool)) } ``` Before: compiles cleanly. After: ```console error[E0133]: call to unsafe function is unsafe and requires unsafe function or block \| 70 \| likely!(std::ptr::read(&() as const () as const bool)) \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ call to unsafe function \| = note: consult the function's documentation for information on how to avoid undefined behavior ```	2020-11-15 13:40:03 +01:00
David Tolnay	afb817054c	Move likely/unlikely argument outside of invisible unsafe block The previous `likely!`/`unlikely!` macros were unsound because it permits the caller's expr to contain arbitrary unsafe code. pub fn huh() -> bool { likely!(std::ptr::read(&() as const () as const bool)) } Before: compiles cleanly. After: error[E0133]: call to unsafe function is unsafe and requires unsafe function or block \| 70 \| likely!(std::ptr::read(&() as const () as const bool)) \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ call to unsafe function \| = note: consult the function's documentation for information on how to avoid undefined behavior	2020-11-14 14:03:57 -08:00
Camille GILLOT	41c44b498f	Move Steal to rustc_data_structures.	2020-11-14 01:30:56 +01:00
Andreas Molzer	eb597f5c4e	Remove recursion from sccc walking This allows constructing the sccc for large that visit many nodes before finding a single cycle of sccc, for example lists. When used to find dependencies in borrow checking the list case is what occurs in very long functions.	2020-11-08 18:07:45 +01:00
Andreas Molzer	355904dca0	Add test for sccc of a long list	2020-11-05 19:24:49 +01:00
Andreas Molzer	a41e2fd963	Convert the recursive find_state to a loop The basic conversion is a straightforward conversion of the linear recursion to a loop forwards and backwards propagation of the result. But this uses an optimization to avoid the need for extra space that would otherwise be necessary to store the stack of unfinished states as the function is not tail recursive. Observe that only non-root-nodes in cycles have a recursive call and that every such call overwrites their own node state. Thus we reuse the node state itself as temporary storage for the stack of unfinished states by inverting the links to a chain back to the previous state update. When we hit the root or end of the full explored chain we propagate the node state update backwards by following the chain until a node with a link to itself.	2020-11-05 19:24:49 +01:00
Wesley Wiser	efe703a01a	[self-profiling] Include the estimated size of each cgu in the profile This is helpful when looking for CGUs where the size estimate isn't a good indicator of compilation time. I verified that moving the profiling timer call doesn't affect the results.	2020-11-03 07:55:17 -05:00
Andreas Molzer	af72a70ee2	Move post order walk to iterative approach The previous recursive approach might overflow the stack when walking a particularly deep, list-like, graph. In particular, dominator calculation for borrow checking does such a traversal and very long functions might lead to a region dependency graph with in this problematic structure.	2020-10-31 18:52:00 +01:00
Andreas Molzer	4fdf8a5630	Add a benchmark test for sccc finding While a bit primitive, it should get us at least a better number than nothing.	2020-10-31 01:05:15 +01:00
Joshua Nelson	57c6ed0c07	Fix even more clippy warnings	2020-10-30 10:13:39 -04:00
Yuki Okushi	f8539221d0	Rollup merge of #78524 - tmiasko:source-files-borrow, r=Aaron1011 Avoid BorrowMutError with RUSTC_LOG=debug ```console $ touch empty.rs $ env RUSTC_LOG=debug rustc +stage1 --crate-type=lib empty.rs ``` Fails with a `BorrowMutError` because source map files are already borrowed while `features_query` attempts to format a log message containing a span. Release the borrow before the query to avoid the issue.	2020-10-30 18:00:54 +09:00
Dániel Buga	3fba948510	Fix typos	2020-10-29 16:51:46 +01:00
Tomasz Miąsko	79cc5099b1	Use RwLock instead of Lock for SourceMap::files	2020-10-29 18:09:53 +01:00
Dániel Buga	b01c74b73c	Fix typo in vec_graph	2020-10-27 18:37:43 +01:00
bors	5171cc76c2	Auto merge of #77476 - tgnottingham:buffered_siphasher128, r=nnethercote perf: buffer SipHasher128 This is an attempt to improve Siphasher128 performance by buffering input. Although it reduces instruction count, I'm not confident the effect on wall times, or lack-thereof, is worth the change. --- Additional notes not reflected in source comments: * Implementation choices were guided by a combination of results from rustc-perf and micro-benchmarks, mostly the former. * ~~I tried a couple of different struct layouts that might be more cache friendly with no obvious effect.~~ Update: a particular struct layout was chosen, but it's not critical to performance. See comments in source and discussion below. * I suspect that buffering would be important to a SIMD-accelerated algorithm, but from what I've read and my own tests, SipHash does not seem very amenable to SIMD acceleration, at least by SSE.	2020-10-25 09:23:45 +00:00
Wesley Wiser	5ac5556d63	Upgrade to measureme 9.0.0	2020-10-24 22:39:42 -04:00
Jonas Schievink	4d72939af1	Rollup merge of #77830 - cjgillot:remacro, r=oli-obk Simplify query proc-macros The query code generation is split between proc-macros and regular macros in `rustc_middle::ty::query`. This PR removes unused capabilities of the proc-macros, and tend to use regular macros for the logic.	2020-10-24 22:39:46 +02:00
Leonora Tindall	bc2317915f	Don't re-export std::ops::ControlFlow in the compiler.	2020-10-22 17:26:55 -07:00
Leonora Tindall	84daccc559	change the order of type arguments on ControlFlow This allows ControlFlow<BreakType> which is much more ergonomic for common iterator combinator use cases.	2020-10-22 17:26:48 -07:00
Camille GILLOT	0a4d948b4a	Remove unused ProfileCategory.	2020-10-22 22:35:32 +02:00
bors	f90e617305	Auto merge of #77908 - bugadani:obl-forest, r=nnethercote Try to make ObligationForest more efficient This PR tries to decrease the number of allocations in ObligationForest, as well as moves some cold path code to an uninlined function.	2020-10-19 15:14:15 +00:00
Chai T. Rex	c2de8fe294	Stabilize or_insert_with_key	2020-10-18 15:45:09 -04:00
Dániel Buga	8c7a8a62dd	Turn Outcome into an opaque type to remove some runtime checks	2020-10-15 08:32:41 +02:00
Dániel Buga	5f11e71721	Reuse memory for process_cycles	2020-10-15 00:41:09 +02:00
Dániel Buga	86e030391b	Make sure cold code is as small as possible	2020-10-15 00:41:08 +02:00
est31	215cd36e1c	Remove unused code from remaining compiler crates	2020-10-14 04:14:32 +02:00
est31	a0fc455d30	Replace absolute paths with relative ones Modern compilers allow reaching external crates like std or core via relative paths in modules outside of lib.rs and main.rs.	2020-10-13 14:16:45 +02:00
Tyson Nottingham	a602d155f0	SipHasher128: improve constant names and add more comments	2020-10-11 23:48:35 -07:00
bors	a1dfd2490a	Auto merge of #77080 - richkadel:llvm-coverage-counters-2, r=tmandry Working branch-level code coverage Add a generalized implementation for computing branch-level coverage spans. This iteration resolves some of the challenges I had identified a few weeks ago. I've tried to implement a solution that is general enough to work for a lot of different graphs/patterns. It's encouraging to see the results on fairly large and complex crates seem to meet my expectations. This may be a "functionally complete" implementation. Except for bug fixes or edge cases I haven't run into yet, the next and essentially final step, I think, is to replace some Counters with CounterExpressions (where their counter values can be computed by adding or subtracting other counters/expressions). Examples of branch-level coverage support enabled in this PR: * https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-cov-reports-base/expected_show_coverage.coverage_of_drop_trait.txt * https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-cov-reports-base/expected_show_coverage.coverage_of_if.txt * https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-cov-reports-base/expected_show_coverage.coverage_of_if_else.txt * https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-cov-reports-base/expected_show_coverage.coverage_of_simple_loop.txt * https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-cov-reports-base/expected_show_coverage.coverage_of_simple_match.txt * ... _and others in the same directory_ Examples of coverage analysis results (MIR spanview files) used to inject counters in the right `BasicBlocks`: * https://htmlpreview.github.io/?https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-mir-cov-html-base/expected_mir_dump.coverage_of_drop_trait/coverage_of_drop_trait.main.-------.InstrumentCoverage.0.html * https://htmlpreview.github.io/?https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-mir-cov-html-base/expected_mir_dump.coverage_of_if/coverage_of_if.main.-------.InstrumentCoverage.0.html * https://htmlpreview.github.io/?https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-mir-cov-html-base/expected_mir_dump.coverage_of_if_else/coverage_of_if_else.main.-------.InstrumentCoverage.0.html * https://htmlpreview.github.io/?https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-mir-cov-html-base/expected_mir_dump.coverage_of_simple_loop/coverage_of_simple_loop.main.-------.InstrumentCoverage.0.html * https://htmlpreview.github.io/?https://github.com/richkadel/rust/blob/llvm-coverage-counters-2/src/test/run-make-fulldeps/instrument-coverage-mir-cov-html-base/expected_mir_dump.coverage_of_simple_match/coverage_of_simple_match.main.-------.InstrumentCoverage.0.html * ... _and others in the same directory_ Here is some sample coverage output after compiling a few real-world crates with the new branch-level coverage features: <img width="801" alt="Screen Shot 2020-09-25 at 1 03 11 PM" src="https://user-images.githubusercontent.com/3827298/94316848-fd882c00-ff39-11ea-9cff-0402d3abd1e7.png"> <img width="721" alt="Screen Shot 2020-09-25 at 1 00 36 PM" src="https://user-images.githubusercontent.com/3827298/94316886-11cc2900-ff3a-11ea-9d03-80b26c8a5173.png"> <img width="889" alt="Screen Shot 2020-09-25 at 12 54 57 PM" src="https://user-images.githubusercontent.com/3827298/94316900-18f33700-ff3a-11ea-8a80-58f67d84b8de.png"> r? `@tmandry` FYI: `@wesleywiser`	2020-10-05 19:34:44 +00:00
bors	ea7e131435	Auto merge of #77171 - VFLashM:better_sso_structures, r=oli-obk Better sso structures This change greatly expands interface of MiniSet/MiniMap and renames them because they are no longer "Mini".	2020-10-05 17:18:01 +00:00
Rich Kadel	f5aebad28f	Updates to experimental coverage counter injection This is a combination of 18 commits. Commit #2: Additional examples and some small improvements. Commit #3: fixed mir-opt non-mir extensions and spanview title elements Corrected a fairly recent assumption in runtest.rs that all MIR dump files end in .mir. (It was appending .mir to the graphviz .dot and spanview .html file names when generating blessed output files. That also left outdated files in the baseline alongside the files with the incorrect names, which I've now removed.) Updated spanview HTML title elements to match their content, replacing a hardcoded and incorrect name that was left in accidentally when originally submitted. Commit #4: added more test examples also improved Makefiles with support for non-zero exit status and to force validation of tests unless a specific test overrides it with a specific comment. Commit #5: Fixed rare issues after testing on real-world crate Commit #6: Addressed PR feedback, and removed temporary -Zexperimental-coverage -Zinstrument-coverage once again supports the latest capabilities of LLVM instrprof coverage instrumentation. Also fixed a bug in spanview. Commit #7: Fix closure handling, add tests for closures and inner items And cleaned up other tests for consistency, and to make it more clear where spans start/end by breaking up lines. Commit #8: renamed "typical" test results "expected" Now that the `llvm-cov show` tests are improved to normally expect matching actuals, and to allow individual tests to override that expectation. Commit #9: test coverage of inline generic struct function Commit #10: Addressed review feedback * Removed unnecessary Unreachable filter. * Replaced a match wildcard with remining variants. * Added more comments to help clarify the role of successors() in the CFG traversal Commit #11: refactoring based on feedback * refactored `fn coverage_spans()`. * changed the way I expand an empty coverage span to improve performance * fixed a typo that I had accidently left in, in visit.rs Commit #12: Optimized use of SourceMap and SourceFile Commit #13: Fixed a regression, and synched with upstream Some generated test file names changed due to some new change upstream. Commit #14: Stripping out crate disambiguators from demangled names These can vary depending on the test platform. Commit #15: Ignore llvm-cov show diff on test with generics, expand IO error message Tests with generics produce llvm-cov show results with demangled names that can include an unstable "crate disambiguator" (hex value). The value changes when run in the Rust CI Windows environment. I added a sed filter to strip them out (in a prior commit), but sed also appears to fail in the same environment. Until I can figure out a workaround, I'm just going to ignore this specific test result. I added a FIXME to follow up later, but it's not that critical. I also saw an error with Windows GNU, but the IO error did not specify a path for the directory or file that triggered the error. I updated the error messages to provide more info for next, time but also noticed some other tests with similar steps did not fail. Looks spurious. Commit #16: Modify rust-demangler to strip disambiguators by default Commit #17: Remove std::process::exit from coverage tests Due to Issue #77553, programs that call std::process::exit() do not generate coverage results on Windows MSVC. Commit #18: fix: test file paths exceeding Windows max path len	2020-10-05 08:02:58 -07:00
Tyson Nottingham	581cc4abf5	SipHasher128: use specific struct layout	2020-10-05 00:47:44 -07:00
Tyson Nottingham	b86161ad9c	SipHasher128: use more named constants, update comments	2020-10-05 00:47:26 -07:00
Tyson Nottingham	f6f96e2a87	perf: buffer SipHasher128	2020-10-03 10:03:30 -07:00
Valerii Lashmanov	d1d2184db4	SsoHashSet/Map - genericiy over Q removed Due to performance regression, see SsoHashMap comment.	2020-10-02 20:13:23 -05:00
Tyson Nottingham	d061fee177	Stable hashing: add comments and tests concerning platform-independence SipHasher128 implements short_write in an endian-independent way, yet its write_xxx Hasher trait methods undo this endian-independence by byte swapping the integer inputs on big-endian hardware. StableHasher then adds endian-independence back by also byte-swapping on big-endian hardware prior to invoking SipHasher128. This double swap may have the appearance of being a no-op, but is in fact by design. In particular, we really do want SipHasher128 to be platform-dependent, in order to be consistent with the libstd SipHasher. Try to clarify this intent. Also, add and update a couple of unit tests.	2020-09-30 00:57:35 -07:00
Valerii Lashmanov	92a0668c20	SsoHashMap minor refactoring, SSO_ARRAY_SIZE introduced	2020-09-27 23:48:19 -05:00
Valerii Lashmanov	41942fac7d	SsoHashSet reimplemented as a wrapper on top of SsoHashMap SsoHashSet::replace had to be removed because it requires missing API from SsoHashMap. It's not a widely used function, so I think it's ok to omit it for now. EitherIter moved into its own file. Also sprinkled code with #[inline] attributes where appropriate.	2020-09-26 18:42:26 -05:00
Valerii Lashmanov	0600b178aa	SsoHashSet/SsoHashMap API greatly expanded Now both provide almost complete API of their non-SSO counterparts.	2020-09-26 14:30:05 -05:00
Valerii Lashmanov	5c224a484d	MiniSet/MiniMap moved and renamed into SsoHashSet/SsoHashMap It is a more descriptive name and with upcoming changes there will be nothing "mini" about them.	2020-09-26 14:30:05 -05:00
est31	12187b7f86	Remove unused #[allow(...)] statements from compiler/	2020-09-26 01:25:55 +02:00
Jonas Schievink	6f3da3d53f	Rollup merge of #77121 - duckymirror:html-root-url, r=jyn514 Updated html_root_url for compiler crates Closes #77103 r? @jyn514	2020-09-25 02:29:45 +02:00
Erik Hofmayer	138a2e5eaa	/nightly/nightly-rustc	2020-09-23 21:51:56 +02:00
Erik Hofmayer	dd66ea2d3d	Updated html_root_url for compiler crates	2020-09-23 21:14:43 +02:00
Andreas Jonson	6586c37bec	Move MiniSet to data_structures remove the need for T to be copy from MiniSet as was done for MiniMap	2020-09-23 08:09:16 +02:00
bors	6d3acf5129	Auto merge of #76928 - lcnr:opaque-types-cache, r=tmandry cache types during normalization partially fixes #75992 reduces the following test from 14 to 3 seconds locally. cc `@Mark-Simulacrum` would it make sense to add that test to `perf`? ```rust #![recursion_limit="2048"] #![type_length_limit="112457564"] pub async fn h0(v: &String, x: &u64) { println!("{} {}", v, x) } pub async fn h1(v: &String, x: &u64) { h0(v, x).await } pub async fn h2(v: &String, x: &u64) { h1(v, x).await } pub async fn h3(v: &String, x: &u64) { h2(v, x).await } pub async fn h4(v: &String, x: &u64) { h3(v, x).await } pub async fn h5(v: &String, x: &u64) { h4(v, x).await } pub async fn h6(v: &String, x: &u64) { h5(v, x).await } pub async fn h7(v: &String, x: &u64) { h6(v, x).await } pub async fn h8(v: &String, x: &u64) { h7(v, x).await } pub async fn h9(v: &String, x: &u64) { h8(v, x).await } pub async fn h10(v: &String, x: &u64) { h9(v, x).await } pub async fn h11(v: &String, x: &u64) { h10(v, x).await } pub async fn h12(v: &String, x: &u64) { h11(v, x).await } pub async fn h13(v: &String, x: &u64) { h12(v, x).await } pub async fn h14(v: &String, x: &u64) { h13(v, x).await } pub async fn h15(v: &String, x: &u64) { h14(v, x).await } pub async fn h16(v: &String, x: &u64) { h15(v, x).await } pub async fn h17(v: &String, x: &u64) { h16(v, x).await } pub async fn h18(v: &String, x: &u64) { h17(v, x).await } pub async fn h19(v: &String, x: &u64) { h18(v, x).await } macro_rules! async_recursive { (29, $inner:expr) => { async { async_recursive!(28, $inner) }.await }; (28, $inner:expr) => { async { async_recursive!(27, $inner) }.await }; (27, $inner:expr) => { async { async_recursive!(26, $inner) }.await }; (26, $inner:expr) => { async { async_recursive!(25, $inner) }.await }; (25, $inner:expr) => { async { async_recursive!(24, $inner) }.await }; (24, $inner:expr) => { async { async_recursive!(23, $inner) }.await }; (23, $inner:expr) => { async { async_recursive!(22, $inner) }.await }; (22, $inner:expr) => { async { async_recursive!(21, $inner) }.await }; (21, $inner:expr) => { async { async_recursive!(20, $inner) }.await }; (20, $inner:expr) => { async { async_recursive!(19, $inner) }.await }; (19, $inner:expr) => { async { async_recursive!(18, $inner) }.await }; (18, $inner:expr) => { async { async_recursive!(17, $inner) }.await }; (17, $inner:expr) => { async { async_recursive!(16, $inner) }.await }; (16, $inner:expr) => { async { async_recursive!(15, $inner) }.await }; (15, $inner:expr) => { async { async_recursive!(14, $inner) }.await }; (14, $inner:expr) => { async { async_recursive!(13, $inner) }.await }; (13, $inner:expr) => { async { async_recursive!(12, $inner) }.await }; (12, $inner:expr) => { async { async_recursive!(11, $inner) }.await }; (11, $inner:expr) => { async { async_recursive!(10, $inner) }.await }; (10, $inner:expr) => { async { async_recursive!(9, $inner) }.await }; (9, $inner:expr) => { async { async_recursive!(8, $inner) }.await }; (8, $inner:expr) => { async { async_recursive!(7, $inner) }.await }; (7, $inner:expr) => { async { async_recursive!(6, $inner) }.await }; (6, $inner:expr) => { async { async_recursive!(5, $inner) }.await }; (5, $inner:expr) => { async { async_recursive!(4, $inner) }.await }; (4, $inner:expr) => { async { async_recursive!(3, $inner) }.await }; (3, $inner:expr) => { async { async_recursive!(2, $inner) }.await }; (2, $inner:expr) => { async { async_recursive!(1, $inner) }.await }; (1, $inner:expr) => { async { async_recursive!(0, $inner) }.await }; (0, $inner:expr) => { async { h19(&String::from("owo"), &0).await; $inner }.await }; } async fn f() { async_recursive!(14, println!("hello")); } fn main() { let _ = f(); } ``` r? `@eddyb` requires a perf run.	2020-09-22 22:52:07 +00:00
bors	b01326ab03	Auto merge of #76680 - Julian-Wollersberger:nongeneric_ensure_sufficient_stack, r=jyn514 Make `ensure_sufficient_stack()` non-generic, using cargo-llvm-lines Inspired by [this blog post](https://blog.mozilla.org/nnethercote/2020/08/05/how-to-speed-up-the-rust-compiler-some-more-in-2020/) from `@nnethercote,` I used [cargo-llvm-lines](https://github.com/dtolnay/cargo-llvm-lines/) on the rust compiler itself, to improve it's compile time. This PR contains only one low-hanging fruit, but I also want to share some measurements. The function `ensure_sufficient_stack()` was monomorphized 1500 times, and with it the `stacker` and `psm` crates, for a total of 1.5% of all llvm IR lines. With some trickery I convert the generic closure into a dynamic one, and thus all that code is only monomorphized once. # Measurements Getting these numbers took some fiddling with CLI flags and I [modified](https://github.com/Julian-Wollersberger/cargo-llvm-lines/blob/master/src/main.rs#L115) cargo-llvm-lines to read from a folder instead of invoking cargo. Commands I used: ``` ./x.py clean RUSTFLAGS="--emit=llvm-ir -C link-args=-fuse-ld=lld -Z self-profile=profile" CARGOFLAGS_BOOTSTRAP="-Ztimings" RUSTC_BOOTSTRAP=1 ./x.py build -i --stage 1 library/std # Then manually copy all .ll files into a folder I hardcoded in cargo-llvm-lines in main.rs#L115 cd ../cargo-llvm-lines cargo run llvm-lines ``` The result is this list (see [first 500 lines](https://github.com/Julian-Wollersberger/cargo-llvm-lines/blob/master/llvm-lines-rustc-before.txt) ), before the change: ``` Lines Copies Function name ----- ------ ------------- 16894211 (100%) 58417 (100%) (TOTAL) 2223855 (13.2%) 502 (0.9%) rustc_query_system::query::plumbing::get_query_impl::{{closure}} 1331918 (7.9%) 1287 (2.2%) hashbrown::raw::RawTable<T>::reserve_rehash 774434 (4.6%) 12043 (20.6%) core::ptr::drop_in_place 294170 (1.7%) 499 (0.9%) rustc_query_system::dep_graph::graph::DepGraph<K>::with_task_impl 245410 (1.5%) 1552 (2.7%) psm::on_stack::with_on_stack 210311 (1.2%) 1 (0.0%) rustc_target::spec::load_specific 200962 (1.2%) 513 (0.9%) rustc_query_system::query::plumbing::get_query_impl 190704 (1.1%) 1 (0.0%) rustc_middle::ty::query::<impl rustc_middle::ty::context::TyCtxt>::alloc_self_profile_query_strings 180272 (1.1%) 468 (0.8%) rustc_query_system::query::plumbing::load_from_disk_and_cache_in_memory 177396 (1.1%) 114 (0.2%) rustc_query_system::query::plumbing::force_query_impl 161134 (1.0%) 445 (0.8%) rustc_query_system::dep_graph::graph::DepGraph<K>::with_anon_task 141551 (0.8%) 186 (0.3%) rustc_query_system::query::plumbing::incremental_verify_ich 110191 (0.7%) 7 (0.0%) rustc_middle::ty::context::_DERIVE_rustc_serialize_Decodable_D_FOR_TypeckResults::<impl rustc_serialize::serialize::Decodable<__D> for rustc_middle::ty::context::TypeckResults>::decode::{{closure}} 108590 (0.6%) 420 (0.7%) core::ops::function::FnOnce::call_once 88488 (0.5%) 21 (0.0%) rustc_query_system::dep_graph::graph::DepGraph<K>::try_mark_previous_green 86368 (0.5%) 1 (0.0%) rustc_middle::ty::query::stats::query_stats 85654 (0.5%) 3973 (6.8%) <&T as core::fmt::Debug>::fmt 84475 (0.5%) 1 (0.0%) rustc_middle::ty::query::Queries::try_collect_active_jobs 81220 (0.5%) 862 (1.5%) <hashbrown::raw::RawIterHash<T> as core::iter::traits::iterator::Iterator>::next 77636 (0.5%) 54 (0.1%) core::slice::sort::recurse 66484 (0.4%) 461 (0.8%) <hashbrown::raw::RawIter<T> as core::iter::traits::iterator::Iterator>::next ``` All `.ll` files together had 4.4GB. After my change they had 4.2GB. So a few percent less code LLVM has to process. Hurray! Sadly, I couldn't measure an actual wall-time improvement. Watching YouTube while compiling added to much noise... Here is the top of the list after the change: ``` 16460866 (100%) 58341 (100%) (TOTAL) 1903085 (11.6%) 504 (0.9%) rustc_query_system::query::plumbing::get_query_impl::{{closure}} 1331918 (8.1%) 1287 (2.2%) hashbrown::raw::RawTable<T>::reserve_rehash 777796 (4.7%) 12031 (20.6%) core::ptr::drop_in_place 551462 (3.4%) 1519 (2.6%) rustc_data_structures::stack::ensure_sufficient_stack::{{closure}} ``` Note that the total was reduced by 430 000 lines and `psm::on_stack::with_on_stack` has disappeared. Instead `rustc_data_structures::stack::ensure_sufficient_stack::{{closure}}` appeared. I'm confused about that one, but it seems to consist of inlined calls to `rustc_query_system::*` stuff. Further note the other two big culprits in this list: `rustc_query_system` and `hashbrown`. These two are monomorphized many times, the query system summing to more than 20% of all lines, not even counting code that's probably inlined elsewhere. Assuming compile times scale linearly with llvm-lines, that means a possible 20% compile time reduction. Reducing eg. `get_query_impl` would probably need a major refactoring of the qery system though. _Everything_ in there is generic over multiple types, has associated types and passes generic Self arguments by value. Which means you can't simply make things `dyn`. --------------------------------------- This PR is a small step to make rustc compile faster and thus make contributing to rustc less painful. Nonetheless I love Rust and I find the work around rustc fascinating :)	2020-09-21 17:32:57 +00:00
Ralf Jung	e177757a04	Rollup merge of #76963 - est31:remove_static_assert, r=oli-obk Remove unused static_assert macro	2020-09-21 10:40:47 +02:00
Ralf Jung	048866bd6b	Rollup merge of #76958 - est31:ns, r=oli-obk Replace manual as_nanos and as_secs_f64 reimplementations	2020-09-21 10:40:39 +02:00
Julian Wollersberger	53aaa1e532	To avoid monomorphizing `psm::on_stack::with_on_stack` 1500 times, I made a change in `stacker` to wrap the callback in `dyn`.	2020-09-20 19:07:52 +02:00
Ralf Jung	50d56bc774	Rollup merge of #76825 - lcnr:array-windows-apply, r=varkor use `array_windows` instead of `windows` in the compiler I do think these changes are beautiful, but do have to admit that using type inference for the window length can easily be confusing. This seems like a general issue with const generics, where inferring constants adds an additional complexity which users have to learn and keep in mind.	2020-09-20 12:08:26 +02:00
Ralf Jung	4322e1b92d	Rollup merge of #76821 - est31:remove_redundant_nightly_features, r=oli-obk,Mark-Simulacrum Remove redundant nightly features Removes a bunch of redundant/outdated nightly features. The first commit removes a `core_intrinsics` use for which a stable wrapper has been provided since. The second commit replaces the `const_generics` feature with `min_const_generics` which might get stabilized this year. The third commit is the result of a trial/error run of removing every single feature and then adding it back if compile failed. A bunch of unused features are the result that the third commit removes.	2020-09-20 12:08:22 +02:00
est31	c2dad1c6b9	Remove unused static_assert macro	2020-09-20 11:40:51 +02:00
est31	43193dcb88	Use as_secs_f64 in profiling.rs	2020-09-20 10:27:14 +02:00
Bastian Kauschke	3435683fd5	use `array_windows` instead of `windows` in the compiler	2020-09-20 08:11:05 +02:00
Bastian Kauschke	1146c39da7	cache types during normalization	2020-09-19 17:27:13 +02:00
Mara Bos	1e2dba1e7c	Use `T::BITS` instead of `size_of::<T> * 8`.	2020-09-19 06:54:42 +02:00
est31	ebdea01143	Remove redundant #![feature(...)] 's from compiler/	2020-09-17 07:58:45 +02:00
est31	4fe6ca3789	Replace const_generics feature gate with min_const_generics The latter is on the path to stabilization.	2020-09-17 07:08:53 +02:00
Andreas Jonson	b8752fff19	update the version of itertools and parking_lot this is to avoid compiling multiple version of the crates in rustc	2020-09-12 08:26:53 +02:00
Flying-Toast	2799aec6ab	Capitalize safety comments	2020-09-08 22:37:18 -04:00
Scott McMurray	59e37332b0	Add `BREAK` too, and improve the comments	2020-09-04 16:28:23 -07:00
Scott McMurray	fac272688e	Use ops::ControlFlow in graph::iterate	2020-09-04 01:45:10 -07:00
bors	51f79b618d	Auto merge of #76233 - cuviper:unhasher, r=Mark-Simulacrum Avoid rehashing Fingerprint as a map key This introduces a no-op `Unhasher` for map keys that are already hash- like, for example `Fingerprint` and its wrapper `DefPathHash`. For these we can directly produce the `u64` hash for maps. The first use of this is `def_path_hash_to_def_id: Option<UnhashMap<DefPathHash, DefId>>`. cc #56308 r? @eddyb	2020-09-02 22:16:22 +00:00
Josh Stone	469ca379d6	Avoid rehashing Fingerprint as a map key This introduces a no-op `Unhasher` for map keys that are already hash- like, for example `Fingerprint` and its wrapper `DefPathHash`. For these we can directly produce the `u64` hash for maps. The first use of this is `def_path_hash_to_def_id: Option<UnhashMap<DefPathHash, DefId>>`.	2020-09-01 18:27:02 -07:00
marmeladema	1b650d0fea	datastructures: replace `lazy_static` by `SyncLazy` from std	2020-09-01 22:06:47 +01:00
marmeladema	68500ffacb	datastructures: replace `once_cell` crate with an impl from std	2020-08-30 20:06:14 +01:00
mark	9e5f7d5631	mv compiler to compiler/	2020-08-30 18:45:07 +03:00

... 4 5 6 7 8

376 Commits