mikros/rust - rust - Gitea.pterpstra.com

Author	SHA1	Message	Date
bors	d2c795932b	Auto merge of #38837 - eddyb:issue-38074, r=nikomatsakis Allow projections to be promoted to constants in MIR. This employs the `LvalueContext` additions by @pcwalton to properly extend the MIR promotion of temporaries to allow projections (field accesses, indexing and dereferences) on said temporaries. It's needed both parity with the old constant qualification logic (for current borrowck) and it fixes #38074. The former is required for soundness if we accept the RFC for promoting rvalues to `'static` constants. That is, until we get MIR borrowck and the same source of truth will be used for both checks and codegen.	2017-01-08 15:51:49 +00:00
bors	cbf88730e7	Auto merge of #38813 - eddyb:lazy-11, r=nikomatsakis [11/n] Separate ty::Tables into one per each body. _This is part of a series ([prev](https://github.com/rust-lang/rust/pull/38449) \| [next]()) of patches designed to rework rustc into an out-of-order on-demand pipeline model for both better feature support (e.g. [MIR-based](https://github.com/solson/miri) early constant evaluation) and incremental execution of compiler passes (e.g. type-checking), with beneficial consequences to IDE support as well. If any motivation is unclear, please ask for additional PR description clarifications or code comments._ <hr> In order to track the results of type-checking and inference for incremental recompilation, they must be stored separately for each function or constant value, instead of lumped together. These side-`Tables` also have to be tracked by various passes, as they visit through bodies (all of which have `Tables`, even if closures share the ones from their parent functions). This is usually done by switching a `tables` field in an override of `visit_nested_body` before recursing through `visit_body`, to the relevant one and then restoring it - however, in many cases the nesting is unnecessary and creating the visitor for each body in the crate and then visiting that body, would be a much cleaner solution. To simplify handling of inlined HIR & its side-tables, their `NodeId` remapping and entries HIR map were fully stripped out, which means that `NodeId`s from inlined HIR must not be used where a local `NodeId` is expected. It might be possible to make the nodes (`Expr`, `Block`, `Pat`, etc.) that only show up within a `Body` have IDs that are scoped to that `Body`, which would also allow `Tables` to use `Vec`s. That last part also fixes #38790 which was accidentally introduced in a previous refactor.	2017-01-08 11:36:52 +00:00
bors	7ac9d337dc	Auto merge of #38679 - alexcrichton:always-deny-warnings, r=nrc Remove not(stage0) from deny(warnings) Historically this was done to accommodate bugs in lints, but there hasn't been a bug in a lint since this feature was added which the warnings affected. Let's completely purge warnings from all our stages by denying warnings in all stages. This will also assist in tracking down `stage0` code to be removed whenever we're updating the bootstrap compiler.	2017-01-08 08:22:06 +00:00
bors	0576869085	Auto merge of #38822 - michaelwoerister:collect-fn-once-adapter, r=eddyb trans: Fix missing closure env drop-glue in trans-item collector. FnOnce adapters automatically generated by the compiler introduce a call to drop the closure environment. The collector didn't pick up on that because this drop call does not show up in MIR. That could lead to an assertion being triggered if the drop-glue for the environment wasn't instantiated via something else. Fixes #38810 cc @arielb1 r? @eddyb or @nikomatsakis	2017-01-08 04:18:32 +00:00
bors	47c8d9fdcf	Auto merge of #38798 - jsgf:fix-rpath, r=nikomatsakis rustc: use -Xlinker when specifying an rpath with ',' in it The `-Wl` option splits its parameters on commas, so if rustc specifies `-Wl,-rpath,<path>` when `<path>` contains commas, the path gets split up and the linker gets a partial path and spurious extra parameters. Gcc/clang support the more verbose `-Xlinker` option to pass options to the linker directly, so use it for comma-containing paths. Fixes issue #38795.	2017-01-08 00:10:15 +00:00
Eduard-Mihai Burtescu	cde0a7e7e0	rustc: store ty::Tables separately for each body (except closures').	2017-01-06 22:23:29 +02:00
Eduard-Mihai Burtescu	85a4a192c7	rustc: keep track of tables everywhere as if they were per-body.	2017-01-06 22:23:29 +02:00
Mark Simulacrum	97c008c398	Fix ICE on i686 when calling immediate() on OperandValue::Ref in return	2017-01-05 12:59:50 -07:00
Eduard-Mihai Burtescu	8f84e955e0	Allow projections to be promoted to constants in MIR.	2017-01-05 02:33:09 +02:00
Michael Woerister	8b94267a8c	trans: Make the trans-item collector see through VTableFnPointer.	2017-01-04 16:54:37 -05:00
Mark Simulacrum	b01b6e1d56	Fix errors introduced during rebase	2017-01-04 11:47:43 -07:00
Mark Simulacrum	21f86ba1bc	Simplify handling of dropping structs.	2017-01-04 11:38:11 -07:00
Mark Simulacrum	7dadd14d6c	Pull out downcasting into caller of iter_variant Renames iter_variant to iter_variant_fields to more clearly communicate the purpose of the function.	2017-01-04 11:38:11 -07:00
Mark Simulacrum	d25fc9ec5f	Remove extraneous setting of builder positions.	2017-01-04 11:38:11 -07:00
Mark Simulacrum	ca328e1bb4	Simplify code further	2017-01-04 11:38:11 -07:00
Mark Simulacrum	c3fe2590f5	Inline and remove Builder::entry_block	2017-01-04 11:38:10 -07:00
Mark Simulacrum	ba37c91831	Fix style nit	2017-01-04 11:38:10 -07:00
Mark Simulacrum	901984e1d1	Builder.build_new_block -> Builder.build_sibling_block	2017-01-04 11:38:10 -07:00
Mark Simulacrum	81e8137b0d	Inline trans_switch to simplify code	2017-01-04 11:38:10 -07:00
Mark Simulacrum	426c558c5a	Move trans_field_ptr and struct_field_ptr to mir/lvalue	2017-01-04 11:38:09 -07:00
Mark Simulacrum	982b8f4f49	Move trans_const to mir::constant	2017-01-04 11:37:44 -07:00
Mark Simulacrum	ea0ebe41c7	Change trans_field_ptr to utilize LvalueTy to determine discriminant.	2017-01-04 11:37:42 -07:00
Mark Simulacrum	8038489357	Use LvalueRef instead of MaybeSizedValue	2017-01-04 11:35:33 -07:00
Mark Simulacrum	4c9995a3f9	Simpliy block creation in MirContext	2017-01-04 11:34:27 -07:00
Mark Simulacrum	37dd9f6c7b	Add Builder::sess and Builder::tcx methods	2017-01-04 11:34:26 -07:00
Mark Simulacrum	f67e7d6b4a	Add method, new_block, to MirContext for block construction. This makes a slow transition to block construction happening only from MirContext easier.	2017-01-04 11:34:00 -07:00
Mark Simulacrum	937e8da349	Purge FunctionContext	2017-01-04 11:33:59 -07:00
Mark Simulacrum	1be170b01a	Replace BlockAndBuilder with Builder.	2017-01-04 11:33:31 -07:00
Michael Woerister	ab8fff20d2	trans: Collect drop-glue translation item for closure env in fn-once-adapters.	2017-01-04 10:01:27 -05:00
bors	d40d01bd0e	Auto merge of #38670 - dotdash:transmute_align, r=eddyb Fix transmute::<T, U> where T requires a bigger alignment than U For transmute::<T, U> we simply pointercast the destination from a U pointer to a T pointer, without providing any alignment information, thus LLVM assumes that the destination is aligned to hold a value of type T, which is not necessarily true. This can lead to LLVM emitting machine instructions that assume said alignment, and thus cause aborts. To fix this, we need to provide the actual alignment to store_operand() and in turn to store() so they can set the proper alignment information on the stores and LLVM can emit the proper machine instructions. Fixes #32947	2017-01-04 14:26:17 +00:00
Jeremy Fitzhardinge	a8fa2cff28	rustc: use -Xlinker when specifying an rpath with ',' in it The `-Wl` option splits its parameters on commas, so if rustc specifies `-Wl,-rpath,<path>` when `<path>` contains commas, the path gets split up and the linker gets a partial path and spurious extra parameters. Gcc/clang support the more verbose `-Xlinker` option to pass options to the linker directly, so use it for comma-containing paths. Fixes rust issue #38795.	2017-01-03 11:40:48 -08:00
bors	d3a2efa14b	Auto merge of #38543 - philipc:unsized-debuginfo, r=michaelwoerister Fix debuginfo for unsized struct members The member was given the size of a fat pointer, which caused llvm to emit DWARF attributes for a 128-bit bitfield.	2017-01-02 20:17:01 +00:00
Seo Sanghyeon	b14785d3d0	Merge branch 'master' into sparc64	2017-01-01 12:40:10 +09:00
Björn Steinbrink	71a11a0b10	Fix transmute::<T, U> where T requires a bigger alignment than U For transmute::<T, U> we simply pointercast the destination from a U pointer to a T pointer, without providing any alignment information, thus LLVM assumes that the destination is aligned to hold a value of type T, which is not necessarily true. This can lead to LLVM emitting machine instructions that assume said alignment, and thus cause aborts. To fix this, we need to provide the actual alignment to store_operand() and in turn to store() so they can set the proper alignment information on the stores and LLVM can emit the proper machine instructions. Fixes #32947	2016-12-31 13:13:30 +01:00
Simonas Kazlauskas	ee69cd7925	Calculate discriminant bounds within 64 bits Since discriminants do not support i128 yet, lets just calculate the boundaries within the 64 bits that are supported. This also avoids an issue with bootstrapping on 32 bit systems due to #38727.	2016-12-31 04:55:29 +02:00
Simonas Kazlauskas	86ce3a2f7c	Further and hopefully final Windows fixes	2016-12-30 15:19:50 +01:00
Simonas Kazlauskas	208c8f58b2	Fix sign-extension in stage1 compiler	2016-12-30 15:17:30 +01:00
est31	92163f1c5e	Windows x64 ABI requires i128 params to be passed as reference	2016-12-30 15:17:29 +01:00
est31	8bcb021991	Use LLVMRustConstInt128Get on stage1 too llvm::LLVMConstIntGetZExtValue doesn't accept values with more than 64 bits. This fixes an LLVM assertion error when compiling libcore with stage1: src/llvm/include/llvm/ADT/APInt.h:1336: uint64_t llvm::APInt::getZExtValue() const: Assertion `getActiveBits() <= 64 && "Too many bits for uint64_t"' failed.	2016-12-30 15:17:27 +01:00
Simonas Kazlauskas	7a3704c500	Fix rebase fallout This commit includes manual merge conflict resolution changes from a rebase by @est31.	2016-12-30 15:17:27 +01:00
Simonas Kazlauskas	9aad2d551e	Add a way to retrieve constant value in 128 bits Fixes rebase fallout, makes code correct in presence of 128-bit constants. This commit includes manual merge conflict resolution changes from a rebase by @est31.	2016-12-30 15:17:26 +01:00
Simonas Kazlauskas	d9eb756cbf	Wrapping<i128> and attempt at LLVM 3.7 compat This commit includes manual merge conflict resolution changes from a rebase by @est31.	2016-12-30 15:17:26 +01:00
Simonas Kazlauskas	b0e55a83a8	Such large. Very 128. Much bits. This commit introduces 128-bit integers. Stage 2 builds and produces a working compiler which understands and supports 128-bit integers throughout. The general strategy used is to have rustc_i128 module which provides aliases for iu128, equal to iu64 in stage9 and iu128 later. Since nowhere in rustc we rely on large numbers being supported, this strategy is good enough to get past the first bootstrap stages to end up with a fully working 128-bit capable compiler. In order for this strategy to work, number of locations had to be changed to use associated max_value/min_value instead of MAX/MIN constants as well as the min_value (or was it max_value?) had to be changed to use xor instead of shift so both 64-bit and 128-bit based consteval works (former not necessarily producing the right results in stage1). This commit includes manual merge conflict resolution changes from a rebase by @est31.	2016-12-30 15:15:44 +01:00
Alex Crichton	9b0b5b45db	Remove not(stage0) from deny(warnings) Historically this was done to accommodate bugs in lints, but there hasn't been a bug in a lint since this feature was added which the warnings affected. Let's completely purge warnings from all our stages by denying warnings in all stages. This will also assist in tracking down `stage0` code to be removed whenever we're updating the bootstrap compiler.	2016-12-29 21:07:20 -08:00
Jonathan A. Kollasch	011ebda40c	Add cabi_sparc64	2016-12-29 21:30:01 -05:00
Alex Crichton	bcfd504744	Rollup merge of #38559 - japaric:ptx2, r=alexcrichton PTX support, take 2 - You can generate PTX using `--emit=asm` and the right (custom) target. Which then you can run on a NVIDIA GPU. - You can compile `core` to PTX. [Xargo] also works and it can compile some other crates like `collections` (but I doubt all of those make sense on a GPU) [Xargo]: https://github.com/japaric/xargo - You can create "global" functions, which can be "called" by the host, using the `"ptx-kernel"` ABI, e.g. `extern "ptx-kernel" fn kernel() { .. }`. Every other function is a "device" function and can only be called by the GPU. - Intrinsics like `__syncthreads()` and `blockIdx.x` are available as `"platform-intrinsics"`. These intrinsics are not in the `core` crate but any Rust user can create "bindings" to them using an `extern "platform-intrinsics"` block. See example at the end. - Trying to emit PTX with `-g` (debuginfo); you get an LLVM error. But I don't think PTX can contain debuginfo anyway so `-g` should be ignored and a warning should be printed ("`-g` doesn't work with this target" or something). - "Single source" support. You can't write a single source file that contains both host and device code. I think that should be possible to implement that outside the compiler using compiler plugins / build scripts. - The equivalent to CUDA `__shared__` which it's used to declare memory that's shared between the threads of the same block. This could be implemented using attributes: `#[shared] static mut SCRATCH_MEMORY: [f32; 64]` but hasn't been implemented yet. - Built-in targets. This PR doesn't add targets to the compiler just yet but one can create custom targets to be able to emit PTX code (see the example at the end). The idea is to have people experiment with this feature before committing to it (built-in targets are "insta-stable") - All functions must be "inlined". IOW, the `.rlib` must always contain the LLVM bitcode of all the functions of the crate it was produced from. Otherwise, you end with "undefined references" in the final PTX code but you won't get any linker error because no linker is involved. IOW, you'll hit a runtime error when loading the PTX into the GPU. The workaround is to use `#[inline]` on non-generic functions and to never use `#[inline(never)]` but this may not always be possible because e.g. you could be relying on third party code. - Should `--emit=asm` generate a `.ptx` file instead of a `.s` file? TL;DR Use Xargo to turn a crate into a PTX module (a `.s` file). Then pass that PTX module, as a string, to the GPU and run it. The full code is in [this repository]. This section gives an overview of how to run Rust code on a NVIDIA GPU. [this repository]: https://github.com/japaric/cuda - Create a custom target. Here's the 64-bit NVPTX target (NOTE: the comments are not valid because this is supposed to be a JSON file; remove them before you use this file): ``` js // nvptx64-nvidia-cuda.json { "arch": "nvptx64", // matches LLVM "cpu": "sm_20", // "oldest" compute capability supported by LLVM "data-layout": "e-i64:64-v16:16-v32:32-n16:32:64", "llvm-target": "nvptx64-nvidia-cuda", "max-atomic-width": 0, // LLVM errors with any other value :-( "os": "cuda", // matches LLVM "panic-strategy": "abort", "target-endian": "little", "target-pointer-width": "64", "target-vendor": "nvidia", // matches LLVM -- not required } ``` (There's a 32-bit target specification in the linked repository) - Write a kernel ``` rust extern "platform-intrinsic" { fn nvptx_block_dim_x() -> i32; fn nvptx_block_idx_x() -> i32; fn nvptx_thread_idx_x() -> i32; } /// Copies an array of `n` floating point numbers from `src` to `dst` pub unsafe extern "ptx-kernel" fn memcpy(dst: mut f32, src: const f32, n: usize) { let i = (nvptx_block_dim_x() as isize) .wrapping_mul(nvptx_block_idx_x() as isize) .wrapping_add(nvptx_thread_idx_x() as isize); if (i as usize) < n { dst.offset(i) = src.offset(i); } } ``` - Emit PTX code ``` $ xargo rustc --target nvptx64-nvidia-cuda --release -- --emit=asm Compiling core v0.0.0 (file://..) (..) Compiling nvptx-builtins v0.1.0 (https://github.com/japaric/nvptx-builtins) Compiling kernel v0.1.0 $ cat target/nvptx64-nvidia-cuda/release/deps/kernel-.s // // Generated by LLVM NVPTX Back-End // .version 3.2 .target sm_20 .address_size 64 // .globl memcpy .visible .entry memcpy( .param .u64 memcpy_param_0, .param .u64 memcpy_param_1, .param .u64 memcpy_param_2 ) { .reg .pred %p<2>; .reg .s32 %r<5>; .reg .s64 %rd<12>; ld.param.u64 %rd7, [memcpy_param_2]; mov.u32 %r1, %ntid.x; mov.u32 %r2, %ctaid.x; mul.wide.s32 %rd8, %r2, %r1; mov.u32 %r3, %tid.x; cvt.s64.s32 %rd9, %r3; add.s64 %rd10, %rd9, %rd8; setp.ge.u64 %p1, %rd10, %rd7; @%p1 bra LBB0_2; ld.param.u64 %rd3, [memcpy_param_0]; ld.param.u64 %rd4, [memcpy_param_1]; cvta.to.global.u64 %rd5, %rd4; cvta.to.global.u64 %rd6, %rd3; shl.b64 %rd11, %rd10, 2; add.s64 %rd1, %rd6, %rd11; add.s64 %rd2, %rd5, %rd11; ld.global.u32 %r4, [%rd2]; st.global.u32 [%rd1], %r4; LBB0_2: ret; } ``` - Run it on the GPU ``` rust // `kernel.ptx` is the `.s` file we got in the previous step const KERNEL: &'static str = include_str!("kernel.ptx"); driver::initialize()?; let device = Device(0)?; let ctx = device.create_context()?; let module = ctx.load_module(KERNEL)?; let kernel = module.function("memcpy")?; let h_a: Vec<f32> = /* create some random data /; let h_b = vec![0.; N]; let d_a = driver::allocate(bytes)?; let d_b = driver::allocate(bytes)?; // Copy from host to GPU driver::copy(h_a, d_a)?; // Run `memcpy` on the GPU kernel.launch(d_b, d_a, N)?; // Copy from GPU to host driver::copy(d_b, h_b)?; // Verify assert_eq!(h_a, h_b); // `d_a`, `d_b`, `h_a`, `h_b` are dropped/freed here ``` --- cc @alexcrichton @brson @rkruppe > What has changed since #34195? - `core` now can be compiled into PTX. Which makes it very easy to turn `no_std` crates into "kernels" with the help of Xargo. - There's now a way, the `"ptx-kernel"` ABI, to generate "global" functions. The old PR required a manual step (it was hack) to "convert" "device" functions into "global" functions. (Only "global" functions can be launched by the host) - Everything is unstable. There are not "insta stable" built-in targets this time (\). The users have to use a custom target to experiment with this feature. Also, PTX instrinsics, like `__syncthreads` and `blockIdx.x`, are now implemented as `"platform-intrinsics"` so they no longer live in the `core` crate. (\*) I'd actually like to have in-tree targets because that makes this target more discoverable, removes the need to lug around .json files, etc. However, bundling a target with the compiler immediately puts it in the path towards stabilization. Which gives us just two cycles to find and fix any problem with the target specification. Afterwards, it becomes hard to tweak the specification because that could be a breaking change. A possible solution could be "unstable built-in targets". Basically, to use an unstable target, you'll have to also pass `-Z unstable-options` to the compiler. And unstable targets, being unstable, wouldn't be available on stable. > Why should this be merged? - To let people experiment with the feature out of tree. Having easy access to the feature (in every nightly) allows this. I also think that, as it is, it should be possible to start prototyping type-safe single source support using build scripts, macros and/or plugins. - It's a straightforward implementation. No different that adding support for any other architecture.	2016-12-29 17:26:15 -08:00
bors	e571f2d778	Auto merge of #38571 - nrc:emit-metadata-change, r=alexcrichton Change --crate-type metadata to --emit=metadata WIP	2016-12-29 11:01:11 +00:00
bors	ebc293bcd3	Auto merge of #38645 - nikomatsakis:incr-comp-fix-time-depth, r=nrc propagate TIME_DEPTH to the helper threads for -Z time-passes Currently, the timing measurements for LLVM passes and the like don't come out indented, which messes up `perf.rust-lang.org`. r? @nrc	2016-12-29 08:16:58 +00:00
Nick Cameron	b059a80d4c	Support --emit=foo,metadata	2016-12-29 18:17:07 +13:00
Nick Cameron	7720cf02e3	Change --crate-type metadata to --emit=metadata	2016-12-29 13:24:45 +13:00

1 2 3 4 5 ...

2798 Commits