Sync rustc_codegen_gcc subtree #148481

GuillaumeGomez · 2025-11-04T15:02:34Z

r? ghost

As opposed to passing it around through Result.

…update_cg_gcc_2025-08-26

Update GCC version

…lsewhere A lot of places had special handling just in case they would get an allocator module even though most of these places could never get one or would have a trivial implementation for the allocator module. Moving all handling of the allocator module to a single place simplifies things a fair bit.

Signed-off-by: dvermd <[email protected]>

It is only used within cg_llvm.

It is always false nowadays. ThinLTO summary writing is instead done by llvm_optimize.

Signed-off-by: dvermd <[email protected]>

Misc LTO cleanups Follow up to rust-lang#145955. * Remove want_summary argument from `prepare_thin`. Since rust-lang#133250 ThinLTO summary writing is instead done by `llvm_optimize`. * Two minor cleanups

We need a different attribute than `rustc_align` because unstable attributes are tied to their feature (we can't have two unstable features use the same unstable attribute). Otherwise this uses all of the same infrastructure as `#[rustc_align]`.

…lmann,ralfjung,traviscross Implement `#[rustc_align_static(N)]` on `static`s Tracking issue: rust-lang#146177 ```rust #![feature(static_align)] #[rustc_align_static(64)] static SO_ALIGNED: u64 = 0; ``` We need a different attribute than `rustc_align` because unstable attributes are tied to their feature (we can't have two unstable features use the same unstable attribute). Otherwise this uses all of the same infrastructure as `#[rustc_align]`. r? `@traviscross`

…n codegen

…anymore

…9_16 Sync from rust 2025/09/16

…ethercote Add panic=immediate-abort MCP: rust-lang/compiler-team#909 This adds a new panic strategy, `-Cpanic=immediate-abort`. This panic strategy essentially just codifies use of `-Zbuild-std-features=panic_immediate_abort`. This PR is intended to just set up infrastructure, and while it will change how the compiler is invoked for users of the feature, there should be no other impacts. In many parts of the compiler, `PanicStrategy::ImmediateAbort` behaves just like `PanicStrategy::Abort`, because actually most parts of the compiler just mean to ask "can this unwind?" so I've added a helper function so we can say `sess.panic_strategy().unwinds()`. The panic and unwind strategies have some level of compatibility, which mostly means that we can pre-compile the sysroot with unwinding panics then the sysroot can be linked with aborting panics later. The immediate-abort strategy is all-or-nothing, enforced by `compiler/rustc_metadata/src/dependency_format.rs` and this is tested for in `tests/ui/panic-runtime/`. We could _technically_ be more compatible with the other panic strategies, but immediately-aborting panics primarily exist for users who want to eliminate all the code size responsible for the panic runtime. I'm open to other use cases if people want to present them, but not right now. This PR is already large. `-Cpanic=immediate-abort` sets both `cfg(panic = "immediate-abort")` _and_ `cfg(panic = "abort")`. bjorn3 pointed out that people may be checking for the abort cfg to ask if panics will unwind, and also the sysroot feature this is replacing used to require `-Cpanic=abort` so this seems like a good back-compat step. At least for the moment. Unclear if this is a good idea indefinitely. I can imagine this being confusing. The changes to the standard library attributes are purely mechanical. Apart from that, I removed an `unsafe` we haven't needed for a while since the `abort` intrinsic became safe, and I've added a helpful diagnostic for people trying to use the old feature. To test that `-Cpanic=immediate-abort` conflicts with other panic strategies, I've beefed up the core-stubs infrastructure a bit. There is now a separate attribute to set flags on it. I've added a test that this produces the desired codegen, called `tests/run-make-cargo/panic-immediate-abort-codegen/` and also a separate run-make-cargo test that checks that we can build a binary.

…monomorphization Unify zero-length and oversized SIMD errors

…, r=lcnr,RalfJung Add an attribute to check the number of lanes in a SIMD vector after monomorphization Allows std::simd to drop the `LaneCount<N>: SupportedLaneCount` trait and maintain good error messages. Also, extends rust-lang#145967 by including spans in layout errors for all ADTs. r? ``@RalfJung`` cc ``@workingjubilee`` ``@programmerjake``

TypeTree support in autodiff # TypeTrees for Autodiff ## What are TypeTrees? Memory layout descriptors for Enzyme. Tell Enzyme exactly how types are structured in memory so it can compute derivatives efficiently. ## Structure ```rust TypeTree(Vec<Type>) Type { offset: isize, // byte offset (-1 = everywhere) size: usize, // size in bytes kind: Kind, // Float, Integer, Pointer, etc. child: TypeTree // nested structure } ``` ## Example: `fn compute(x: &f32, data: &[f32]) -> f32` **Input 0: `x: &f32`** ```rust TypeTree(vec![Type { offset: -1, size: 8, kind: Pointer, child: TypeTree(vec![Type { offset: -1, size: 4, kind: Float, child: TypeTree::new() }]) }]) ``` **Input 1: `data: &[f32]`** ```rust TypeTree(vec![Type { offset: -1, size: 8, kind: Pointer, child: TypeTree(vec![Type { offset: -1, size: 4, kind: Float, // -1 = all elements child: TypeTree::new() }]) }]) ``` **Output: `f32`** ```rust TypeTree(vec![Type { offset: -1, size: 4, kind: Float, child: TypeTree::new() }]) ``` ## Why Needed? - Enzyme can't deduce complex type layouts from LLVM IR - Prevents slow memory pattern analysis - Enables correct derivative computation for nested structures - Tells Enzyme which bytes are differentiable vs metadata ## What Enzyme Does With This Information: Without TypeTrees (current state): ```llvm ; Enzyme sees generic LLVM IR: define float ``@distance(ptr*`` %p1, ptr* %p2) { ; Has to guess what these pointers point to ; Slow analysis of all memory operations ; May miss optimization opportunities } ``` With TypeTrees (our implementation): ```llvm define "enzyme_type"="{[]:Float@float}" float ``@distance(`` ptr "enzyme_type"="{[]:Pointer}" %p1, ptr "enzyme_type"="{[]:Pointer}" %p2 ) { ; Enzyme knows exact type layout ; Can generate efficient derivative code directly } ``` # TypeTrees - Offset and -1 Explained ## Type Structure ```rust Type { offset: isize, // WHERE this type starts size: usize, // HOW BIG this type is kind: Kind, // WHAT KIND of data (Float, Int, Pointer) child: TypeTree // WHAT'S INSIDE (for pointers/containers) } ``` ## Offset Values ### Regular Offset (0, 4, 8, etc.) **Specific byte position within a structure** ```rust struct Point { x: f32, // offset 0, size 4 y: f32, // offset 4, size 4 id: i32, // offset 8, size 4 } ``` TypeTree for `&Point` (internal representation): ```rust TypeTree(vec![ Type { offset: 0, size: 4, kind: Float }, // x at byte 0 Type { offset: 4, size: 4, kind: Float }, // y at byte 4 Type { offset: 8, size: 4, kind: Integer } // id at byte 8 ]) ``` Generates LLVM: ```llvm "enzyme_type"="{[]:Float@float}" ``` ### Offset -1 (Special: "Everywhere") **Means "this pattern repeats for ALL elements"** #### Example 1: Array `[f32; 100]` ```rust TypeTree(vec![Type { offset: -1, // ALL positions size: 4, // each f32 is 4 bytes kind: Float, // every element is float }]) ``` Instead of listing 100 separate Types with offsets `0,4,8,12...396` #### Example 2: Slice `&[i32]` ```rust // Pointer to slice data TypeTree(vec![Type { offset: -1, size: 8, kind: Pointer, child: TypeTree(vec![Type { offset: -1, // ALL slice elements size: 4, // each i32 is 4 bytes kind: Integer }]) }]) ``` #### Example 3: Mixed Structure ```rust struct Container { header: i64, // offset 0 data: [f32; 1000], // offset 8, but elements use -1 } ``` ```rust TypeTree(vec![ Type { offset: 0, size: 8, kind: Integer }, // header Type { offset: 8, size: 4000, kind: Pointer, child: TypeTree(vec![Type { offset: -1, size: 4, kind: Float // ALL array elements }]) } ]) ```

GuillaumeGomez · 2025-11-07T10:52:50Z

Sending a PR.

GuillaumeGomez · 2025-11-07T10:58:11Z

Opened rust-lang/ci-mirrors#17.

GuillaumeGomez · 2025-11-07T13:58:36Z

Restarted CI, let's see if it's happier now.

GuillaumeGomez · 2025-11-07T15:37:16Z

Seems not.

@Kobzol when/how are files uploaded to our CI mirrors?

Kobzol · 2025-11-07T19:56:17Z

Immediately when the PR is merged. The file has been uploaded to the mirrors correctly: https://ci-mirrors.rust-lang.org/rustc/gcc/gmp-6.3.0.tar.bz2 and it was downloaded in the latest CI run of this PR. The SHA512 hash doesn't seem to match though:

gmp-6.3.0.tar.bz2: FAILED
  sha512sum: WARNING: 1 computed checksum did NOT match
  error: Cannot verify integrity of possibly corrupted file gmp-6.3.0.tar.bz2

thesamesam · 2025-11-07T20:07:11Z

https://github.com/rust-lang/ci-mirrors/pull/17/files#r2505298478

GuillaumeGomez · 2025-11-08T10:50:14Z

Sent rust-lang/ci-mirrors#18 to fix it.

GuillaumeGomez · 2025-11-12T19:37:54Z

It'd be much more convenient if we knew ahead of time all the missing files... :-/

Kobzol · 2025-11-12T19:39:24Z

The files are listed in the GCC source code, you can check if they are present on the mirrors 😆 But yeah, this is kinda annoying.

GuillaumeGomez · 2025-11-12T19:41:40Z

Gonna take a broader look tomorrow to see all I missed. In the meantime I'll send a PR to add mpc.

rustbot · 2025-11-13T14:23:33Z

⚠️ Warning ⚠️

Some commits in this PR modify submodules.

If this was not intentional, see I changed a submodule on accident in the rustc dev guide.
The following commits have merge commits (commits with multiple parents) in your changes. We have a no merge policy so these commits will need to be removed for this pull request to be merged.
- 06a51e4
- 13230c0
- 1c8a353
- 1f6e396
- 2426181
- 28d461d
- 346f79b
- 35cd7f5
- 36d02fc
- 3a8b6a1
- 3f93968
- 44790a9
- 465d04a
- 5c50a42
- 5cb8d34
- 62c1eea
- 655a519
- 68c97a8
- 80340cd
- 825f2bf
- 867a310
- 8e737d6
- 900fcfd
- 93b5b1b
- 97e1942
- 9cc826c
- a949fa1
- b25c2bf
- ca3e2ea
- d2ade4e
- d5e9cc5
- e785c50
- e7b5847
- f870dd2
You can start a rebase with the following commands:
```
$ # rebase
$ git pull --rebase https://github.com/rust-lang/rust.git main
$ git push --force-with-lease
```

GuillaumeGomez · 2025-11-13T15:41:29Z

Finally! \o/

@bors r+ p=1 rollup=never

bors · 2025-11-13T15:41:33Z

📌 Commit 866de5f has been approved by GuillaumeGomez

It is now in the queue for this repository.

bors · 2025-11-13T18:00:07Z

⌛ Testing commit 866de5f with merge 2286e5d...

bors · 2025-11-13T21:14:12Z

☀️ Test successful - checks-actions
Approved by: GuillaumeGomez
Pushing 2286e5d to main...

github-actions · 2025-11-13T21:18:39Z

What is this?

This is an experimental post-merge analysis report that shows differences in test outcomes between the merged PR and its parent PR.

Comparing af5c5b7 (parent) -> 2286e5d (this PR)

Test differences

Show 1 test diff

Stage 2

[ui] tests/ui/lto/lto-global-allocator.rs: pass -> ignore (gcc backend is marked as ignore) (J0)

Job group index

J0: x86_64-gnu-gcc

Test dashboard

Run

cargo run --manifest-path src/ci/citool/Cargo.toml -- \
    test-dashboard 2286e5d224b3413484cf4f398a9f078487e7b49d --output-dir test-dashboard

And then open test-dashboard/index.html in your browser to see an overview of all executed tests.

Job duration changes

x86_64-gnu-llvm-20: 2408.7s -> 3415.5s (+41.8%)
x86_64-gnu-gcc: 3081.6s -> 4038.1s (+31.0%)
dist-aarch64-apple: 5534.5s -> 7029.2s (+27.0%)
dist-apple-various: 4430.9s -> 5508.4s (+24.3%)
pr-check-1: 1303.1s -> 1503.5s (+15.4%)
test-various: 6618.1s -> 5842.4s (-11.7%)
x86_64-rust-for-linux: 2488.7s -> 2750.2s (+10.5%)
x86_64-gnu-llvm-20-3: 6264.1s -> 5643.3s (-9.9%)
tidy: 150.3s -> 164.3s (+9.3%)
i686-gnu-2: 5004.4s -> 5416.2s (+8.2%)

How to interpret the job duration changes?

Job durations can vary a lot, based on the actual runner instance
that executed the job, system noise, invalidated caches, etc. The table above is provided
mostly for t-infra members, for simpler debugging of potential CI slow-downs.

rust-timer · 2025-11-13T23:25:51Z

Finished benchmarking commit (2286e5d): comparison URL.

Overall result: ✅ improvements - no action needed

@rustbot label: -perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-1.1%	[-1.4%, -0.2%]	7
All ❌✅ (primary)	-	-	0

Max RSS (memory usage)

Results (secondary 3.2%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	3.2%	[3.2%, 3.2%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-	-	0

Cycles

Results (secondary 2.3%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	2.3%	[2.3%, 2.3%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-	-	0

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 475.694s -> 474.605s (-0.23%)
Artifact size: 388.39 MiB -> 388.41 MiB (0.00%)

bjorn3 and others added 30 commits August 24, 2025 11:20

Directly raise fatal errors inside the codegen backends

0415c11

As opposed to passing it around through Result.

Merge commit 'feb42827f11a7ae241ceecc81e9ae556fb6ba214' into subtree-…

a224101

…update_cg_gcc_2025-08-26

Fix sync conflict

059f00d

fix target-pointer-width in tests

26736f9

Update GCC version

01ef552

Merge pull request rust-lang#755 from rust-lang/update-gcc-2025-08-28

35cd7f5

Update GCC version

cg_gcc: run run-make-cargo tests

fada75d

Rework some build_system/utils return value

03aa02f

Signed-off-by: dvermd <[email protected]>

Remove thin_link_data method from ThinBufferMethods

bd33abf

It is only used within cg_llvm.

Remove want_summary argument from prepare_thin

79955e1

It is always false nowadays. ThinLTO summary writing is instead done by llvm_optimize.

Always pass git options to run_command

bd303ca

Signed-off-by: dvermd <[email protected]>

Rollup merge of rust-lang#146209 - bjorn3:lto_refactors5, r=dianqk

d5e9cc5

Misc LTO cleanups Follow up to rust-lang#145955. * Remove want_summary argument from `prepare_thin`. Since rust-lang#133250 ThinLTO summary writing is instead done by `llvm_optimize`. * Two minor cleanups

erase_regions to erase_and_anonymize_regions

a211d63

Remove unreachable unsized arg handling in store_fn_arg/store_arg i…

b5b58d1

…n codegen

Merge branch 'master' into sync_from_rust_2025_09_16

3f93968

Update to nightly-2025-09-16

dbc3ace

Ignore failing test

b072f38

added typetree support for memcpy

8ecf880

Switch to Ubuntu Plucky repository because Oracular is not supported …

ee849b0

…anymore

Disable funnel shift tests since they fail

7c12d9d

Merge pull request rust-lang#760 from rust-lang/sync_from_rust_2025_0…

5c50a42

…9_16 Sync from rust 2025/09/16

Support ctr and lr as clobber-only registers in PowerPC inline assembly

b9a2e04

Add panic=immediate-abort

8f24436

Add an attribute to check the number of lanes in a SIMD vector after …

994d3e1

…monomorphization Unify zero-length and oversized SIMD errors

GuillaumeGomez mentioned this pull request Nov 7, 2025

Add new gcc.toml entry file for gmp 6.3.0 GuillaumeGomez/ci-mirrors#1

Closed

GuillaumeGomez mentioned this pull request Nov 7, 2025

Add new gcc.toml entry file for gmp 6.3.0 rust-lang/ci-mirrors#17

Merged

antoyo mentioned this pull request Nov 11, 2025

Split LLVM intrinsic abi handling from the rest of the abi handling #148533

Open

bjorn3 mentioned this pull request Nov 11, 2025

Use let...else instead of match foo { ... _ => return }; and if let ... else return #148837

Open

GuillaumeGomez mentioned this pull request Nov 12, 2025

Add mpfr-4.2.2 file rust-lang/ci-mirrors#20

Merged

GuillaumeGomez mentioned this pull request Nov 12, 2025

Add mpc-1.3.1 file rust-lang/ci-mirrors#21

Merged

Ignore tests/ui/lto/lto-global-allocator.rs for GCC backend

866de5f

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Nov 13, 2025

bors added the merged-by-bors This PR was explicitly merged by bors. label Nov 13, 2025

bors merged commit 2286e5d into rust-lang:main Nov 13, 2025
12 checks passed

rustbot added this to the 1.93.0 milestone Nov 13, 2025

GuillaumeGomez deleted the subtree-update_cg_gcc_2025-11-04 branch November 14, 2025 10:20

Sync rustc_codegen_gcc subtree #148481

Sync rustc_codegen_gcc subtree #148481

Uh oh!

Conversation

GuillaumeGomez commented Nov 4, 2025

Uh oh!

GuillaumeGomez commented Nov 7, 2025

Uh oh!

GuillaumeGomez commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GuillaumeGomez commented Nov 7, 2025

Uh oh!

GuillaumeGomez commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Kobzol commented Nov 7, 2025

Uh oh!

thesamesam commented Nov 7, 2025

Uh oh!

GuillaumeGomez commented Nov 8, 2025

Uh oh!

GuillaumeGomez commented Nov 12, 2025

Uh oh!

Kobzol commented Nov 12, 2025

Uh oh!

GuillaumeGomez commented Nov 12, 2025

Uh oh!

rustbot commented Nov 13, 2025

Uh oh!

GuillaumeGomez commented Nov 13, 2025

Uh oh!

bors commented Nov 13, 2025

Uh oh!

bors commented Nov 13, 2025

Uh oh!

bors commented Nov 13, 2025

Uh oh!

Uh oh!

github-actions bot commented Nov 13, 2025

Test differences

Stage 2

Job duration changes

Uh oh!

rust-timer commented Nov 13, 2025

Overall result: ✅ improvements - no action needed

Instruction count

Max RSS (memory usage)

Cycles

Binary size

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

28 participants

GuillaumeGomez commented Nov 7, 2025 •

edited

Loading

GuillaumeGomez commented Nov 7, 2025 •

edited

Loading