rust/src at c8eff682fd486a405dfbfcc89e613dcbccac0904 - rust

History

bors c8eff682fd Auto merge of #33090 - bluss:special-zip-2, r=aturon Specialize .zip() for efficient slice and slice iteration The idea is to introduce a private trait TrustedRandomAccess and specialize .zip() for random access iterators into a counted loop. The implementation in the PR is internal and has no visible effect in the API Why a counted loop? To have each slice iterator compile to just a pointer, and both pointers are indexed with the same loop counter value in the generated code. When this succeeds, copying loops are readily recognized and replaced with memcpy and addition loops autovectorize well. The TrustedRandomAccess approach works very well on the surface. Microbenchmarks optimize well, following the ideas above, and that is a dramatic improvement of .zip()'s codegen. ```rust // old zip before this PR: bad, byte-for-byte loop // with specialized zip: memcpy pub fn copy_zip(xs: &[u8], ys: &mut [u8]) { for (a, b) in ys.iter_mut().zip(xs) { a = b; } } // old zip before this PR: single addition per iteration // with specialized zip: vectorized pub fn add_zip(xs: &[f32], ys: &mut [f32]) { for (a, b) in ys.iter_mut().zip(xs) { a += b; } } // old zip before this PR: single addition per iteration // with specialized zip: vectorized (!!) pub fn add_zip3(xs: &[f32], ys: &[f32], zs: &mut [f32]) { for ((a, b), c) in zs.iter_mut().zip(xs).zip(ys) { a += b * *c; } } ``` Yet in more complex situations, the .zip() loop can still fall back to its old behavior where phantom null checks throw in fake premature end of the loop conditionals. Remember that a NULL inside Option<(&T, &T)> makes it a `None` value and a premature (in this case) end of the loop. So even if we have 1) an explicit `Some` in the code and 2) the types of the pointers are `&T` or `&mut T` which are nonnull, we can still get a phantom null check at that point. One example that illustrates the difference is `copy_zip` with slice versus Vec arguments. The involved iterator types are exactly the same, but the Vec version doesn't compile down to memcpy. Investigating into this, the function argument metadata emitted to llvm plays the biggest role. As eddyb summarized, we need nonnull for the loop to autovectorize and noalias for it to replace with memcpy. There was an experiment to use `assume` to add a non-null assumption on each of the two elements in the specialized zip iterator, but this only helped in some of the test cases and regressed others. Instead I think the nonnull/noalias metadata issue is something we need to solve separately anyway. These have conditionally implemented TrustedRandomAccess - Enumerate - Zip These have not implemented it - Map is sideeffectful. The forward case would be workable, but the double ended case is complicated. - Chain, exact length semantics unclear - Filter, FilterMap, FlatMap and many others don't offer random access and/or exact length		2016-06-17 03:36:32 -07:00
..
bootstrap	Fix issue where rustbuild expected msvc to have ar	2016-06-16 08:38:06 -04:00
build_helper	Fix issue where rustbuild expected msvc to have ar	2016-06-16 08:38:06 -04:00
compiler-rt@57315f7e07
doc	reference: link to proper ffi attributes	2016-06-14 10:50:48 +08:00
driver
etc
grammar
jemalloc@aab1c0a0e0
liballoc
liballoc_jemalloc	Fix issue where rustbuild expected msvc to have ar	2016-06-16 08:38:06 -04:00
liballoc_system
libarena
libbacktrace
libcollections	Map::Entry::take() method to recover key and value together	2016-06-15 05:45:50 -07:00
libcollectionstest
libcore	Auto merge of #33090 - bluss:special-zip-2, r=aturon	2016-06-17 03:36:32 -07:00
libcoretest	specialize zip: Add benchmarks	2016-06-14 15:51:49 +02:00
libflate
libfmt_macros
libgetopts
libgraphviz
liblibc@45d85899e9
liblog
libpanic_abort
libpanic_unwind
librand
librbml
librustc	fix MirSource::Promoted handling	2016-06-16 17:30:09 +03:00
librustc_back
librustc_bitflags
librustc_borrowck	Auto merge of #34290 - arielb1:short-ladder, r=eddyb	2016-06-16 01:01:26 -07:00
librustc_const_eval	Rollup merge of #34207 - petrochenkov:nohyg, r=jseyfried	2016-06-16 01:39:43 +01:00
librustc_const_math
librustc_data_structures	Auto merge of #34221 - srinivasreddy:rm_redundant, r=alexcrichton	2016-06-14 13:42:28 -07:00
librustc_driver	Simplify gated cfg checking	2016-06-16 19:24:44 +00:00
librustc_incremental
librustc_lint
librustc_llvm
librustc_metadata	Add an abs_path member to FileMap, use it when writing debug info.	2016-06-16 18:08:46 +01:00
librustc_mir	Auto merge of #34306 - arielb1:mir-dump-fixes, r=eddyb	2016-06-16 21:57:06 -07:00
librustc_passes
librustc_platform_intrinsics
librustc_plugin
librustc_privacy
librustc_resolve
librustc_save_analysis	Rollup merge of #34298 - nrc:save-parent, r=eddyb	2016-06-17 00:12:04 +01:00
librustc_trans	Add an abs_path member to FileMap, use it when writing debug info.	2016-06-16 18:08:46 +01:00
librustc_typeck	Auto merge of #34000 - estebank:missingargs, r=jseyfried	2016-06-15 22:12:26 -07:00
librustc_unicode
librustdoc	Add an abs_path member to FileMap, use it when writing debug info.	2016-06-16 18:08:46 +01:00
libserialize
libstd	Fix issue where rustbuild expected msvc to have ar	2016-06-16 08:38:06 -04:00
libsyntax	Auto merge of #34315 - Manishearth:rollup, r=Manishearth	2016-06-16 19:07:41 -07:00
libsyntax_ext	Simplify gated cfg checking	2016-06-16 19:24:44 +00:00
libterm
libtest
libunwind
llvm@80ad955b60
rt
rtstartup
rust-installer@c37d3747da
rustc
rustllvm
test	Auto merge of #33090 - bluss:special-zip-2, r=aturon	2016-06-17 03:36:32 -07:00
tools	Auto merge of #34220 - srinivasreddy:rf_cargotest, r=brson	2016-06-15 13:11:19 -07:00
stage0.txt

No results found.