mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
Alex Crichton	a68f8866a7	Rollup merge of #38609 - alexcrichton:less-compress, r=japaric travis: Don't use -9 on gzip I timed this locally and plain old `gzip` took 2m06s while `gzip -9` took a whopping 6m23s to save a mere 4MB out of 1.2GB. Let's shave a few minutes off the Android builder by turning down the compression level.	2016-12-29 17:26:19 -08:00
Alex Crichton	fe80a1d014	Rollup merge of #38587 - GuillaumeGomez:arc_docs, r=frewsxcv Add missing urls in Arc docs r? @frewsxcv	2016-12-29 17:26:18 -08:00
Alex Crichton	bcfd504744	Rollup merge of #38559 - japaric:ptx2, r=alexcrichton PTX support, take 2 - You can generate PTX using `--emit=asm` and the right (custom) target. Which then you can run on a NVIDIA GPU. - You can compile `core` to PTX. [Xargo] also works and it can compile some other crates like `collections` (but I doubt all of those make sense on a GPU) [Xargo]: https://github.com/japaric/xargo - You can create "global" functions, which can be "called" by the host, using the `"ptx-kernel"` ABI, e.g. `extern "ptx-kernel" fn kernel() { .. }`. Every other function is a "device" function and can only be called by the GPU. - Intrinsics like `__syncthreads()` and `blockIdx.x` are available as `"platform-intrinsics"`. These intrinsics are not in the `core` crate but any Rust user can create "bindings" to them using an `extern "platform-intrinsics"` block. See example at the end. - Trying to emit PTX with `-g` (debuginfo); you get an LLVM error. But I don't think PTX can contain debuginfo anyway so `-g` should be ignored and a warning should be printed ("`-g` doesn't work with this target" or something). - "Single source" support. You can't write a single source file that contains both host and device code. I think that should be possible to implement that outside the compiler using compiler plugins / build scripts. - The equivalent to CUDA `__shared__` which it's used to declare memory that's shared between the threads of the same block. This could be implemented using attributes: `#[shared] static mut SCRATCH_MEMORY: [f32; 64]` but hasn't been implemented yet. - Built-in targets. This PR doesn't add targets to the compiler just yet but one can create custom targets to be able to emit PTX code (see the example at the end). The idea is to have people experiment with this feature before committing to it (built-in targets are "insta-stable") - All functions must be "inlined". IOW, the `.rlib` must always contain the LLVM bitcode of all the functions of the crate it was produced from. Otherwise, you end with "undefined references" in the final PTX code but you won't get any linker error because no linker is involved. IOW, you'll hit a runtime error when loading the PTX into the GPU. The workaround is to use `#[inline]` on non-generic functions and to never use `#[inline(never)]` but this may not always be possible because e.g. you could be relying on third party code. - Should `--emit=asm` generate a `.ptx` file instead of a `.s` file? TL;DR Use Xargo to turn a crate into a PTX module (a `.s` file). Then pass that PTX module, as a string, to the GPU and run it. The full code is in [this repository]. This section gives an overview of how to run Rust code on a NVIDIA GPU. [this repository]: https://github.com/japaric/cuda - Create a custom target. Here's the 64-bit NVPTX target (NOTE: the comments are not valid because this is supposed to be a JSON file; remove them before you use this file): ``` js // nvptx64-nvidia-cuda.json { "arch": "nvptx64", // matches LLVM "cpu": "sm_20", // "oldest" compute capability supported by LLVM "data-layout": "e-i64:64-v16:16-v32:32-n16:32:64", "llvm-target": "nvptx64-nvidia-cuda", "max-atomic-width": 0, // LLVM errors with any other value :-( "os": "cuda", // matches LLVM "panic-strategy": "abort", "target-endian": "little", "target-pointer-width": "64", "target-vendor": "nvidia", // matches LLVM -- not required } ``` (There's a 32-bit target specification in the linked repository) - Write a kernel ``` rust extern "platform-intrinsic" { fn nvptx_block_dim_x() -> i32; fn nvptx_block_idx_x() -> i32; fn nvptx_thread_idx_x() -> i32; } /// Copies an array of `n` floating point numbers from `src` to `dst` pub unsafe extern "ptx-kernel" fn memcpy(dst: mut f32, src: const f32, n: usize) { let i = (nvptx_block_dim_x() as isize) .wrapping_mul(nvptx_block_idx_x() as isize) .wrapping_add(nvptx_thread_idx_x() as isize); if (i as usize) < n { dst.offset(i) = src.offset(i); } } ``` - Emit PTX code ``` $ xargo rustc --target nvptx64-nvidia-cuda --release -- --emit=asm Compiling core v0.0.0 (file://..) (..) Compiling nvptx-builtins v0.1.0 (https://github.com/japaric/nvptx-builtins) Compiling kernel v0.1.0 $ cat target/nvptx64-nvidia-cuda/release/deps/kernel-.s // // Generated by LLVM NVPTX Back-End // .version 3.2 .target sm_20 .address_size 64 // .globl memcpy .visible .entry memcpy( .param .u64 memcpy_param_0, .param .u64 memcpy_param_1, .param .u64 memcpy_param_2 ) { .reg .pred %p<2>; .reg .s32 %r<5>; .reg .s64 %rd<12>; ld.param.u64 %rd7, [memcpy_param_2]; mov.u32 %r1, %ntid.x; mov.u32 %r2, %ctaid.x; mul.wide.s32 %rd8, %r2, %r1; mov.u32 %r3, %tid.x; cvt.s64.s32 %rd9, %r3; add.s64 %rd10, %rd9, %rd8; setp.ge.u64 %p1, %rd10, %rd7; @%p1 bra LBB0_2; ld.param.u64 %rd3, [memcpy_param_0]; ld.param.u64 %rd4, [memcpy_param_1]; cvta.to.global.u64 %rd5, %rd4; cvta.to.global.u64 %rd6, %rd3; shl.b64 %rd11, %rd10, 2; add.s64 %rd1, %rd6, %rd11; add.s64 %rd2, %rd5, %rd11; ld.global.u32 %r4, [%rd2]; st.global.u32 [%rd1], %r4; LBB0_2: ret; } ``` - Run it on the GPU ``` rust // `kernel.ptx` is the `.s` file we got in the previous step const KERNEL: &'static str = include_str!("kernel.ptx"); driver::initialize()?; let device = Device(0)?; let ctx = device.create_context()?; let module = ctx.load_module(KERNEL)?; let kernel = module.function("memcpy")?; let h_a: Vec<f32> = /* create some random data /; let h_b = vec![0.; N]; let d_a = driver::allocate(bytes)?; let d_b = driver::allocate(bytes)?; // Copy from host to GPU driver::copy(h_a, d_a)?; // Run `memcpy` on the GPU kernel.launch(d_b, d_a, N)?; // Copy from GPU to host driver::copy(d_b, h_b)?; // Verify assert_eq!(h_a, h_b); // `d_a`, `d_b`, `h_a`, `h_b` are dropped/freed here ``` --- cc @alexcrichton @brson @rkruppe > What has changed since #34195? - `core` now can be compiled into PTX. Which makes it very easy to turn `no_std` crates into "kernels" with the help of Xargo. - There's now a way, the `"ptx-kernel"` ABI, to generate "global" functions. The old PR required a manual step (it was hack) to "convert" "device" functions into "global" functions. (Only "global" functions can be launched by the host) - Everything is unstable. There are not "insta stable" built-in targets this time (\). The users have to use a custom target to experiment with this feature. Also, PTX instrinsics, like `__syncthreads` and `blockIdx.x`, are now implemented as `"platform-intrinsics"` so they no longer live in the `core` crate. (\*) I'd actually like to have in-tree targets because that makes this target more discoverable, removes the need to lug around .json files, etc. However, bundling a target with the compiler immediately puts it in the path towards stabilization. Which gives us just two cycles to find and fix any problem with the target specification. Afterwards, it becomes hard to tweak the specification because that could be a breaking change. A possible solution could be "unstable built-in targets". Basically, to use an unstable target, you'll have to also pass `-Z unstable-options` to the compiler. And unstable targets, being unstable, wouldn't be available on stable. > Why should this be merged? - To let people experiment with the feature out of tree. Having easy access to the feature (in every nightly) allows this. I also think that, as it is, it should be possible to start prototyping type-safe single source support using build scripts, macros and/or plugins. - It's a straightforward implementation. No different that adding support for any other architecture.	2016-12-29 17:26:15 -08:00
Alex Crichton	4e2e01ec95	Rollup merge of #38517 - frewsxcv:clarify-msys2-dependencies, r=alexcrichton Clarify phrasing of MSYS2 dependencies in README.md. Fixes https://github.com/rust-lang/rust/issues/36115.	2016-12-29 17:26:14 -08:00
Alex Crichton	26f28ec96d	Rollup merge of #38491 - GuillaumeGomez:builder_docs, r=frewsxcv Builder docs r? @frewsxcv	2016-12-29 17:26:13 -08:00
Alex Crichton	2293ea5e9b	Rollup merge of #37149 - edunham:more-cargotest, r=alexcrichton Add some more repos to cargotest From suggestions at https://users.rust-lang.org/t/what-stable-rust-applications-do-you-use-frequently/7618 This adds some applications which use stable Rust and come with their own lockfiles in their respective trees. ripgrep, xsv, and bins have 33 unique dependencies between them. I alphabetized the list by project name because that seems tidier. r? @brson	2016-12-29 17:26:12 -08:00
bors	e7c788af75	Auto merge of #38503 - brson:bootstrap, r=alexcrichton Bump bootstrap compiler r? @alexcrichton I'm not clear on whether cargo will need to additionally be bumped beyond what's in https://github.com/rust-lang/rust/pull/38470	2016-12-29 19:34:23 +00:00
Alex Crichton	a0f3c93d64	Fixes for new cargo test repos * Update to ripgrep HEAD because the previous rev would still change the lock file when `cargo build` was issued. * Remove `bins` as it depends on OpenSSL on Windows, which won't work on our bots * Update rev of tokei to get a rev that doesn't change the lockfile	2016-12-29 08:56:01 -08:00
E. Dunham	99580212b2	Add some more repos to cargotest From suggestions at https://users.rust-lang.org/t/what-stable-rust-applications-do-you-use-frequently/7618 This adds some applications which use stable Rust and come with their own lockfiles in tree. ripgrep, xsv, and bins have 33 unique dependencies between them.	2016-12-29 08:48:47 -08:00
Alex Crichton	03bc2cf35a	Fallout from updating bootstrap Cargo	2016-12-29 08:47:26 -08:00
Brian Anderson	6207e80d2c	Bump bootstrap compiler	2016-12-29 08:47:20 -08:00
bors	3f957ebeff	Auto merge of #38627 - rkruppe:ninja-build, r=alexcrichton Accept ninja-build binary in place of ninja See comment in the diff for rationale. r? @alexcrichton	2016-12-29 13:48:50 +00:00
bors	e571f2d778	Auto merge of #38571 - nrc:emit-metadata-change, r=alexcrichton Change --crate-type metadata to --emit=metadata WIP	2016-12-29 11:01:11 +00:00
bors	ebc293bcd3	Auto merge of #38645 - nikomatsakis:incr-comp-fix-time-depth, r=nrc propagate TIME_DEPTH to the helper threads for -Z time-passes Currently, the timing measurements for LLVM passes and the like don't come out indented, which messes up `perf.rust-lang.org`. r? @nrc	2016-12-29 08:16:58 +00:00
Nick Cameron	b059a80d4c	Support --emit=foo,metadata	2016-12-29 18:17:07 +13:00
bors	1d9965b5ba	Auto merge of #38619 - alexcrichton:less-android-flaky, r=brson travis: Attempt to fix Android flakiness There's been some flaky runs on Travis where the Android emulator is having problems staying alive... presumably? For example: * https://travis-ci.org/rust-lang/rust/jobs/186736745 This commit spawns the emulator in the same way as buildbot with `nohup` to hope that it goes into the background successfully, followed by a `wait-for-device` command. I'm not actually sure if this'll fix the problems we're seeing, but I figure it can't hurt to test out.	2016-12-29 02:44:28 +00:00
Nick Cameron	9c89166611	Restore --crate-type=metadata as an alias for --crate-type=rlib,--emit=metadata + a warning	2016-12-29 13:24:46 +13:00
Nick Cameron	71f161c887	Add a test for #38273 Closes 38273	2016-12-29 13:24:46 +13:00
Nick Cameron	7720cf02e3	Change --crate-type metadata to --emit=metadata	2016-12-29 13:24:45 +13:00
bors	4ecc85beb3	Auto merge of #38449 - eddyb:lazy-10, r=nikomatsakis [10/n] Split constants and functions' arguments into disjoint bodies. _This is part of a series ([prev](https://github.com/rust-lang/rust/pull/38053) \| [next]()) of patches designed to rework rustc into an out-of-order on-demand pipeline model for both better feature support (e.g. [MIR-based](https://github.com/solson/miri) early constant evaluation) and incremental execution of compiler passes (e.g. type-checking), with beneficial consequences to IDE support as well. If any motivation is unclear, please ask for additional PR description clarifications or code comments._ <hr> Finishes the signature-body split started in #37918, namely: * `trait` items are separated just like `impl` items were, for uniformity, closing #37712 * `static`s, `const`s (including associated ones), `enum` discriminants and array lengths get bodies * even the count in "repeat expressions", i.e. `n` in `[x; n]`, which fixes #24414 * arguments' patterns are moved to the bodies, with the types staying in `FnDecl` * `&self` now desugars to `self: &Self` instead of `self: &_` (similarly for other `self` forms) * `astconv`'s and metadata's (for rustdoc) informative uses are explicitly ignored for the purposes of the dep graph. this could be fixed in the future by hashing the exact information being extracted about the arguments as opposed to generating a dependency on the whole body	2016-12-28 20:19:39 +00:00
bors	02b22ec7bd	Auto merge of #38639 - xen0n:nightly-dist-hotfix, r=brson rustbuild: Hotfix to unbreak nightly Fixes an oversight unnoticed in #38468 that eventually broke nightly packaging. I didn't realize this until some moments ago, when I finally found out the failure is actually deterministic. Many apologies for eating 3 nightlies during the holidays. r? @alexcrichton	2016-12-28 17:27:13 +00:00
bors	371f4d6bf6	Auto merge of #38626 - redox-os:args_fix, r=alexcrichton Fix argument handling on Redox After switching the start code to be handled in libc, we are no longer passing in slices as arguments into the libstd main function. This means that handling had to be rewritten to match the unix way of doing things. Additional commits on this branch are going to be merged in this PR: https://github.com/rust-lang/rust/pull/38577#issuecomment-269138394	2016-12-28 14:33:48 +00:00
bors	469fd779ee	Auto merge of #38616 - pnkfelix:refactor-mir-dataflow-remove-ctxt, r=arielb1 Refactor mir::dataflow: remove Ctxt associated type from BitDenotation trait Refactor mir::dataflow: remove Ctxt associated type from BitDenotation trait I no longer remember why I needed this (or thought I did). The way that the `BitDenotation` is passed around in all existing use cases (and planned future ones), the thing that were in the `Ctxt` can just be part of `Self` instead. (I think ariel had been pushing me to do this back when I first put in this infrastructure; it took me a while to see how much of pain the `Ctxt` was causing.)	2016-12-28 09:42:46 +00:00
Eduard-Mihai Burtescu	ee0ea95343	rustdoc: pretty-print nested bodies in inlined constants.	2016-12-28 11:29:21 +02:00
Eduard-Mihai Burtescu	4aae835803	rustc: always print nested nodes where a HIR map is available.	2016-12-28 11:29:20 +02:00
Eduard-Mihai Burtescu	f64e73b6ec	rustc: simplify constant cross-crate loading and rustc_passes::consts.	2016-12-28 11:29:19 +02:00
Eduard-Mihai Burtescu	f89856be6c	rustc: move function arguments into hir::Body.	2016-12-28 11:29:19 +02:00
Eduard-Mihai Burtescu	e64f64a2fc	rustc: separate bodies for static/(associated)const and embedded constants.	2016-12-28 11:27:57 +02:00
Eduard-Mihai Burtescu	864928297d	rustc: separate TraitItem from their parent Item, just like ImplItem.	2016-12-28 11:21:45 +02:00
Eduard-Mihai Burtescu	6ebb6fdbee	hir: lower `ImplicitSelf` to resolved `Self` TyQPath's.	2016-12-28 11:21:45 +02:00
bors	17f1fba353	Auto merge of #38589 - aidanhs:aphs-stage0-rustdoc-test, r=alexcrichton Teach `rustdoc --test` about `--sysroot`, pass it when testing rust This permits rustdoc tests to work in stage0. Logical continuation of #36586. Snippet from https://github.com/rust-lang/rust/issues/38575#issuecomment-269090724: > it should actually be possible to run all the libstd tests immediately after creating std of stage0-out - there's no reason to build librustc at all if you've just made a change to (for example) libcollections, `./x.py test src/libcollections --stage 0 -v --incremental` should just work This PR makes it so (or appears to in my testing). r? @alexcrichton	2016-12-28 07:05:07 +00:00
bors	0807104c8f	Auto merge of #38579 - whitequark:min_atomic_width, r=alexcrichton Add a min_atomic_width target option, like max_atomic_width Rationale: some ISAs, e.g. OR1K, do not have atomic instructions for byte and halfword access, and at the same time do not have a fixed endianness, which makes it unreasonable to implement these through word-sized atomic accesses.	2016-12-28 04:12:11 +00:00
Wang Xuerui	cf89453506	rustbuild: fix host-only rules ignoring targets in dist steps `arr` is the actual list of targets participating in steps construction, but due to #38468 the hosts array now consists of only the build triple for the `dist` steps, hence all non-build-triple targets are lost for the host-only rules. Fix this by using the original non-shadowed hosts array in `arr` calculation. This should unbreak the nightly packaging process. Fixes #38637.	2016-12-28 10:54:13 +08:00
Niko Matsakis	ad747c5869	propagate TIME_DEPTH to the helper threads for -Z time-passes	2016-12-27 21:35:34 -05:00
bors	a9ab778815	Auto merge of #38479 - michaelwoerister:extern_mod_ich, r=nikomatsakis ICH: Fix and test foreign mod hashing. r? @nikomatsakis	2016-12-28 01:30:31 +00:00
bors	314c28b729	Auto merge of #38329 - ollie27:rustdoc_stab_em_div, r=steveklabnik rustdoc: Fix invalid HTML in stability notices `em` tags cannot contain `p` tags so use `div`s instead of `em`s as the Markdown will create `p` tags.	2016-12-27 21:10:31 +00:00
Wang Xuerui	3991046d52	rustbuild: clarify comment on target array calculation The comment touched, as originally written, only concerned itself with the `test` steps. However, since #38468 the `arr` variable actually has gained an indirect relationship with the `dist` steps too. The comment failed to convey the extra meaning, contributing to the misunderstanding which eventually lead to #38637. Fix that by moving the comment into the right place near the relevant condition, and properly documenting `arr`'s purpose.	2016-12-28 03:41:09 +08:00
Wang Xuerui	9d3616f78d	rustbuild: get an empty slice the straight-forward way	2016-12-28 03:41:05 +08:00
bors	82801b552e	Auto merge of #38600 - arielb1:dead-drop, r=eddyb clear discriminant drop flag at the bottom of a drop ladder Fixes #38437. Beta-nominating because serious I-wrong. r? @eddyb	2016-12-27 18:25:22 +00:00
Jeremy Soller	88df0e3918	Fix arguments on Redox	2016-12-27 10:55:41 -07:00
Robin Kruppe	d44bcbf9ee	Accept ninja-build binary in place of ninja	2016-12-27 16:36:53 +01:00
bors	86896ba0be	Auto merge of #38577 - redox-os:master, r=alexcrichton Add Debug to OpenOptions and DirBuilder This fixes the build on Redox as the platform independent structs now implement Debug.	2016-12-27 14:15:29 +00:00
bors	d849b13267	Auto merge of #38574 - Mark-Simulacrum:box-free-unspecialize, r=eddyb Remove special case for Box<ZST> in trans Remove extra lang item, `exchange_free`; use `box_free` instead. Trans used to insert code equivalent to `box_free` in a wrapper around `exchange_free`, and that code is now removed from trans. Fixes #37710.	2016-12-27 11:32:39 +00:00
bors	9351c2c8e7	Auto merge of #38537 - jseyfried:fix_rustdoc_ice, r=nrc Fix ICE in rustdoc Fixes #38237. r? @nrc	2016-12-27 05:07:34 +00:00
bors	c2ee32ab45	Auto merge of #38507 - alexcrichton:travis-lldb, r=brson travis: Update the OSX image we run tests in The current image is `xcode7.3`, Travis's current default. Unfortunately this has a version of LLDB which doesn't support debuginfo-lldb tests (see #32520), so we're not running LLDB tests on Travis yet. This switches us to the newest image from Travis, `xcode8.2`, which should have a newer version of LLDB we can run tests against.	2016-12-27 02:18:20 +00:00
Jorge Aparicio	aac5ff7664	fix ui test	2016-12-26 21:06:23 -05:00
Jorge Aparicio	18d49288d5	PTX support - `--emit=asm --target=nvptx64-nvidia-cuda` can be used to turn a crate into a PTX module (a `.s` file). - intrinsics like `__syncthreads` and `blockIdx.x` are exposed as `"platform-intrinsics"`. - "cabi" has been implemented for the nvptx and nvptx64 architectures. i.e. `extern "C"` works. - a new ABI, `"ptx-kernel"`. That can be used to generate "global" functions. Example: `extern "ptx-kernel" fn kernel() { .. }`. All other functions are "device" functions.	2016-12-26 21:06:23 -05:00
Mark Simulacrum	ca115dd083	Remove extra lang item, exchange_free; use box_free instead. Trans used to insert code equivalent to box_free in a wrapper around exchange_free, and that code is now removed from trans.	2016-12-26 17:13:51 -07:00
Alex Crichton	dad0076569	travis: Update the OSX image we run tests in The current image is `xcode7.3`, Travis's current default. Unfortunately this has a version of LLDB which doesn't support debuginfo-lldb tests (see #32520), so we're not running LLDB tests on Travis yet. This switches us to the newest image from Travis, `xcode8.2`, which should have a newer version of LLDB we can run tests against.	2016-12-26 15:40:22 -08:00
bors	77f7c7aaf5	Auto merge of #38274 - elahn:windows-readconsole-ctrl-z, r=alexcrichton Ctrl-Z returns from Stdin.read() when reading from the console on Windows Fixes #19914. Fixes read(), read_to_string(), read_to_end(), etc. r? @alexcrichton	2016-12-26 23:33:21 +00:00

1 2 3 4 5 ...

59971 Commits