mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
bors	76c500ec6c	Auto merge of #81635 - michaelwoerister:structured_def_path_hash, r=pnkfelix Let a portion of DefPathHash uniquely identify the DefPath's crate. This allows to directly map from a `DefPathHash` to the crate it originates from, without constructing side tables to do that mapping -- something that is useful for incremental compilation where we deal with `DefPathHash` instead of `DefId` a lot. It also allows to reliably and cheaply check for `DefPathHash` collisions which allows the compiler to gracefully abort compilation instead of running into a subsequent ICE at some random place in the code. The following new piece of documentation describes the most interesting aspects of the changes: ```rust /// A `DefPathHash` is a fixed-size representation of a `DefPath` that is /// stable across crate and compilation session boundaries. It consists of two /// separate 64-bit hashes. The first uniquely identifies the crate this /// `DefPathHash` originates from (see [StableCrateId]), and the second /// uniquely identifies the corresponding `DefPath` within that crate. Together /// they form a unique identifier within an entire crate graph. /// /// There is a very small chance of hash collisions, which would mean that two /// different `DefPath`s map to the same `DefPathHash`. Proceeding compilation /// with such a hash collision would very probably lead to an ICE and, in the /// worst case, to a silent mis-compilation. The compiler therefore actively /// and exhaustively checks for such hash collisions and aborts compilation if /// it finds one. /// /// `DefPathHash` uses 64-bit hashes for both the crate-id part and the /// crate-internal part, even though it is likely that there are many more /// `LocalDefId`s in a single crate than there are individual crates in a crate /// graph. Since we use the same number of bits in both cases, the collision /// probability for the crate-local part will be quite a bit higher (though /// still very small). /// /// This imbalance is not by accident: A hash collision in the /// crate-local part of a `DefPathHash` will be detected and reported while /// compiling the crate in question. Such a collision does not depend on /// outside factors and can be easily fixed by the crate maintainer (e.g. by /// renaming the item in question or by bumping the crate version in a harmless /// way). /// /// A collision between crate-id hashes on the other hand is harder to fix /// because it depends on the set of crates in the entire crate graph of a /// compilation session. Again, using the same crate with a different version /// number would fix the issue with a high probability -- but that might be /// easier said then done if the crates in questions are dependencies of /// third-party crates. /// /// That being said, given a high quality hash function, the collision /// probabilities in question are very small. For example, for a big crate like /// `rustc_middle` (with ~50000 `LocalDefId`s as of the time of writing) there /// is a probability of roughly 1 in 14,750,000,000 of a crate-internal /// collision occurring. For a big crate graph with 1000 crates in it, there is /// a probability of 1 in 36,890,000,000,000 of a `StableCrateId` collision. ``` Given the probabilities involved I hope that no one will ever actually see the error messages. Nonetheless, I'd be glad about some feedback on how to improve them. Should we create a GH issue describing the problem and possible solutions to point to? Or a page in the rustc book? r? `@pnkfelix` (feel free to re-assign)	2021-03-07 23:45:57 +00:00
Guillaume Gomez	0db8349fff	Rollup merge of #81940 - jhpratt:stabilize-str_split_once, r=m-ou-se Stabilize str_split_once Closes #74773	2021-02-26 15:52:29 +01:00
Vadim Petrochenkov	0038eaee6b	rustc_span: Remove obsolete `allow_internal_unstable_backcompat_hack`	2021-02-14 19:42:55 +03:00
Jacob Pratt	c28f2a8bee	Stabilize str_split_once	2021-02-09 23:17:11 -05:00
klensy	60cca83975	faster spans	2021-02-04 04:54:23 +03:00
Michael Woerister	22d489be76	Let a portion of DefPathHash uniquely identify the DefPath's crate. This allows to directly map from a DefPathHash to the crate it originates from, without constructing side tables to do that mapping. It also allows to reliably and cheaply check for DefPathHash collisions.	2021-02-02 17:40:29 +01:00
Aaron Hill	3540f9396a	Add disambiugator to ExpnData Due to macro expansion, its possible to end up with two distinct `ExpnId`s that have the same `ExpnData` contents. This violates the contract of `HashStable`, since two unequal `ExpnId`s will end up with equal `Fingerprint`s. This commit adds a `disambiguator` field to `ExpnData`, which is used to force two otherwise-equivalent `ExpnData`s to be distinct.	2021-01-23 15:41:17 -05:00
Aaron Hill	482a67d20f	Properly handle `SyntaxContext` of dummy spans in incr comp Fixes #80336 Due to macro expansion, we may end up with spans with an invalid location and non-root `SyntaxContext`. This commits preserves the `SyntaxContext` of such spans in the incremental cache, and ensures that we always hash the `SyntaxContext` when computing the `Fingerprint` of a `Span` Previously, we would discard the `SyntaxContext` during serialization to the incremental cache, causing the span's `Fingerprint` to change across compilation sessions.	2021-01-13 15:20:29 -05:00
bors	fe531d5a5f	Auto merge of #79012 - tgnottingham:span_data_to_lines_and_cols, r=estebank rustc_span: add span_data_to_lines_and_cols to caching source map view	2021-01-11 21:32:50 +00:00
Vadim Petrochenkov	3ff866ed7c	resolve: Scope visiting doesn't need an `Ident`	2021-01-07 16:09:47 +03:00
Mara Bos	f16ef7d7ce	Add edition 2021.	2020-12-31 19:06:09 +01:00
Yuki Okushi	6064be7ced	Rollup merge of #80358 - pierwill:edit_rustc_span, r=lcnr Edit rustc_span documentation Various changes to the `rustc_span` docs, including the following: - Additions to top-level docs - Edits to the source_map module docs - Edits to documentation for `Span` and `SpanData` - Added intra-docs links - Documentation for Levenshtein distances - Fixed missing punctuation	2020-12-30 18:15:11 +09:00
pierwill	a8775d44e9	Edit rustc_span documentation Various changes to the `rustc_span` docs, including the following: - Additions to top-level docs - Edits to the source_map module docs - Edits to documentation for `Span` and `SpanData` - Added intra-docs links - Documentation for Levenshtein distances - Fixed missing punctuation	2020-12-25 14:02:52 -08:00
Arpad Borsos	830ceaa419	Remap instrument-coverage line numbers in doctests This uses the `SourceMap::doctest_offset_line` method to re-map line numbers from doctests. Remapping columns is not yet done. Part of issue #79417.	2020-12-19 13:22:24 +01:00
Tyson Nottingham	75de8286c0	rustc_span: add span_data_to_lines_and_cols to caching source map view Gives a performance increase over calling byte_pos_to_line_and_col twice, partially because it decreases the function calling overhead, potentially because it doesn't populate the line cache with lines that turn out to belong to invalid spans, and likely because of some other incidental improvements made possible by having more context available.	2020-12-03 18:36:34 -08:00
Joshua Nelson	0ad3dce83a	Fix some clippy lints	2020-12-03 17:08:19 -05:00
Arlie Davis	5481c1bd6d	Move lev_distance to rustc_ast, make non-generic rustc_ast currently has a few dependencies on rustc_lexer. Ideally, an AST would not have any dependency its lexer, for minimizing unnecessarily design-time dependencies. Breaking this dependency would also have practical benefits, since modifying rustc_lexer would not trigger a rebuild of rustc_ast. This commit does not remove the rustc_ast --> rustc_lexer dependency, but it does remove one of the sources of this dependency, which is the code that handles fuzzy matching between symbol names for making suggestions in diagnostics. Since that code depends only on Symbol, it is easy to move it to rustc_span. It might even be best to move it to a separate crate, since other tools such as Cargo use the same algorithm, and have simply contain a duplicate of the code. This changes the signature of find_best_match_for_name so that it is no longer generic over its input. I checked the optimized binaries, and this function was duplicated at nearly every call site, because most call sites used short-lived iterator chains, generic over Map and such. But there's no good reason for a function like this to be generic, since all it does is immediately convert the generic input (the Iterator impl) to a concrete Vec<Symbol>. This has all of the costs of generics (duplicated method bodies) with no benefit. Changing find_best_match_for_name to be non-generic removed about 10KB of code from the optimized binary. I know it's a drop in the bucket, but we have to start reducing binary size, and beginning to tame over-use of generics is part of that.	2020-11-24 16:12:23 -08:00
bors	9722952f0b	Auto merge of #76256 - tgnottingham:issue-74890, r=nikomatsakis incr-comp: hash and serialize span end line/column Hash both the length and the end location (line/column) of a span. If we hash only the length, for example, then two otherwise equal spans with different end locations will have the same hash. This can cause a problem during incremental compilation wherein a previous result for a query that depends on the end location of a span will be incorrectly reused when the end location of the span it depends on has changed. A similar analysis applies if some query depends specifically on the length of the span, but we only hash the end location. So hash both. Fix #46744, fix #59954, fix #63161, fix #73640, fix #73967, fix #74890, fix #75900 --- See #74890 for a more in-depth analysis. I haven't thought about what other problems this root cause could be responsible for. Please let me know if anything springs to mind. I believe the issue has existed since the inception of incremental compilation.	2020-11-12 15:34:09 +00:00
Matthias Krüger	bcd2f2df67	fix a couple of clippy warnings: filter_next manual_strip redundant_static_lifetimes single_char_pattern unnecessary_cast unused_unit op_ref redundant_closure useless_conversion	2020-11-04 13:48:50 +01:00
Tyson Nottingham	b71e627b26	incr-comp: hash span end line/column Hash both the length and the end location (line/column) of a span. If we hash only the length, for example, then two otherwise equal spans with different end locations will have the same hash. This can cause a problem during incremental compilation wherein a previous result for a query that depends on the end location of a span will be incorrectly reused when the end location of the span it depends on has changed. A similar analysis applies if some query depends specifically on the length of the span, but we only hash the end location. So hash both. Fix #46744, fix #59954, fix #63161, fix #73640, fix #73967, fix #74890, fix #75900	2020-11-04 01:37:18 -08:00
Mara Bos	52405f7c0c	Rollup merge of #77950 - arlosi:sha256, r=eddyb Add support for SHA256 source file hashing Adds support for `-Z src-hash-algorithm sha256`, which became available in LLVM 11. Using an older version of LLVM will cause an error `invalid checksum kind` if the hash algorithm is set to sha256. r? `@eddyb` cc #70401 `@est31`	2020-11-03 19:32:26 +01:00
Jonas Schievink	151db25599	Rollup merge of #78423 - tgnottingham:caching_source_map_bounds_check, r=oli-obk rustc_span: improve bounds checks in byte_pos_to_line_and_col The effect of this change is to consider edge-case spans that start or end at the position one past the end of a file to be valid during span hashing and encoding. This change means that these spans will be preserved across incremental compilation sessions when they are part of a serialized query result, instead of causing the dummy span to be used.	2020-10-29 17:05:17 +01:00
bors	31ee872db5	Auto merge of #78415 - tgnottingham:expn_id_tag_hash, r=Aaron1011 rustc_span: avoid hashing ExpnId tag when using cached hash	2020-10-28 20:03:55 +00:00
Tyson Nottingham	47dad31a04	rustc_span: represent line bounds with Range	2020-10-27 15:47:29 -07:00
Dániel Buga	da64d07191	Fix typo in comment	2020-10-27 17:08:14 +01:00
Tyson Nottingham	df59a44fea	rustc_span: improve bounds checks in byte_pos_to_line_and_col The effect of this change is to consider edge-case spans that start or end at the position one past the end of a file to be valid during span hashing and encoding. This change means that these spans will be preserved across incremental compilation sessions when they are part of a serialized query result, instead of causing the dummy span to be used.	2020-10-26 16:34:04 -07:00
Tyson Nottingham	a3623e0542	rustc_span: avoid hashing ExpnId tag when using cached hash	2020-10-26 13:43:48 -07:00
Arlo Siemsen	3296d5ca7b	Add support for SHA256 source file hashing for LLVM 11+.	2020-10-14 15:09:51 -07:00
est31	58b3923ad3	Remove unused code from rustc_span	2020-10-14 04:14:32 +02:00
est31	7367cfef59	Use shorter path for std:#️⃣:Hash	2020-10-08 03:25:01 +02:00
Rich Kadel	f5aebad28f	Updates to experimental coverage counter injection This is a combination of 18 commits. Commit #2: Additional examples and some small improvements. Commit #3: fixed mir-opt non-mir extensions and spanview title elements Corrected a fairly recent assumption in runtest.rs that all MIR dump files end in .mir. (It was appending .mir to the graphviz .dot and spanview .html file names when generating blessed output files. That also left outdated files in the baseline alongside the files with the incorrect names, which I've now removed.) Updated spanview HTML title elements to match their content, replacing a hardcoded and incorrect name that was left in accidentally when originally submitted. Commit #4: added more test examples also improved Makefiles with support for non-zero exit status and to force validation of tests unless a specific test overrides it with a specific comment. Commit #5: Fixed rare issues after testing on real-world crate Commit #6: Addressed PR feedback, and removed temporary -Zexperimental-coverage -Zinstrument-coverage once again supports the latest capabilities of LLVM instrprof coverage instrumentation. Also fixed a bug in spanview. Commit #7: Fix closure handling, add tests for closures and inner items And cleaned up other tests for consistency, and to make it more clear where spans start/end by breaking up lines. Commit #8: renamed "typical" test results "expected" Now that the `llvm-cov show` tests are improved to normally expect matching actuals, and to allow individual tests to override that expectation. Commit #9: test coverage of inline generic struct function Commit #10: Addressed review feedback * Removed unnecessary Unreachable filter. * Replaced a match wildcard with remining variants. * Added more comments to help clarify the role of successors() in the CFG traversal Commit #11: refactoring based on feedback * refactored `fn coverage_spans()`. * changed the way I expand an empty coverage span to improve performance * fixed a typo that I had accidently left in, in visit.rs Commit #12: Optimized use of SourceMap and SourceFile Commit #13: Fixed a regression, and synched with upstream Some generated test file names changed due to some new change upstream. Commit #14: Stripping out crate disambiguators from demangled names These can vary depending on the test platform. Commit #15: Ignore llvm-cov show diff on test with generics, expand IO error message Tests with generics produce llvm-cov show results with demangled names that can include an unstable "crate disambiguator" (hex value). The value changes when run in the Rust CI Windows environment. I added a sed filter to strip them out (in a prior commit), but sed also appears to fail in the same environment. Until I can figure out a workaround, I'm just going to ignore this specific test result. I added a FIXME to follow up later, but it's not that critical. I also saw an error with Windows GNU, but the IO error did not specify a path for the directory or file that triggered the error. I updated the error messages to provide more info for next, time but also noticed some other tests with similar steps did not fail. Looks spurious. Commit #16: Modify rust-demangler to strip disambiguators by default Commit #17: Remove std::process::exit from coverage tests Due to Issue #77553, programs that call std::process::exit() do not generate coverage results on Windows MSVC. Commit #18: fix: test file paths exceeding Windows max path len	2020-10-05 08:02:58 -07:00
Erik Hofmayer	138a2e5eaa	/nightly/nightly-rustc	2020-09-23 21:51:56 +02:00
Erik Hofmayer	dd66ea2d3d	Updated html_root_url for compiler crates	2020-09-23 21:14:43 +02:00
James Whaley	9a1f1777d3	Remove cast to usize for BytePos and CharPos The case shouldn't be necessary and implicitly truncating BytePos is not desirable.	2020-09-21 19:42:43 +01:00
James Whaley	b4b4a2f092	Reduce boilerplate for BytePos and CharPos	2020-09-21 18:27:43 +01:00
Ralf Jung	50d56bc774	Rollup merge of #76825 - lcnr:array-windows-apply, r=varkor use `array_windows` instead of `windows` in the compiler I do think these changes are beautiful, but do have to admit that using type inference for the window length can easily be confusing. This seems like a general issue with const generics, where inferring constants adds an additional complexity which users have to learn and keep in mind.	2020-09-20 12:08:26 +02:00
Bastian Kauschke	3435683fd5	use `array_windows` instead of `windows` in the compiler	2020-09-20 08:11:05 +02:00
est31	ebdea01143	Remove redundant #![feature(...)] 's from compiler/	2020-09-17 07:58:45 +02:00
Ivan Tham	5dc9790e10	Add visualization of rustc span in doc It took me quite some time to figure out what Span::to means. A picture is worth a thousand words.	2020-09-13 20:48:15 +08:00
Rich Kadel	7225f66887	Adds two source span utility functions used in source-based coverage `span.is_empty()` - returns true if `lo()` and `hi()` are equal. This is not only a convenience, but makes it clear that a `Span` can be empty (that is, retrieving the source for an empty `Span` will return an empty string), and codifies the (otherwise undocumented--in the rustc_span package, at least) fact that `Span` is a half-open interval (where `hi()` is the open end). `source_map.lookup_file_span()` - returns an enclosing `Span` representing the start and end positions of the file enclosing the given `BytePos`. This gives developers a clear way to quickly determine if any any other `BytePos` or `Span` is also from the same file (for example, by simply calling `file_span.contains(span)`). This results in much simpler code and is much more runtime efficient compared with the obvious alternative: calling `source_map.lookup_line()` for any two `Span`'s byte positions, handle both arms of the `Result` (both contain the file), and then compare files. It is also more efficient than the non-public method `lookup_source_file_idx()` for each `BytePos`, because, while comparing the internal source file indexes would be efficient, looking up the source file index for every `BytePos` or `Span` to be compared requires a binary search (worst case performance being O(log n) for every lookup). `source_map.lookup_file_span()` performs the binary search only once, to get the `file_span` result that can be used to compare to any number of other `BytePos` or `Span` values and those comparisons are always O(1).	2020-08-31 18:41:57 -07:00
mark	9e5f7d5631	mv compiler to compiler/	2020-08-30 18:45:07 +03:00

1 2 3

141 Commits