skip codegen for intrinsics with big fallback bodies if backend does not need them #150605

RalfJung · 2026-01-02T17:40:51Z

This hopefully fixes the perf regression from #148478. I only added the intrinsics with big fallback bodies to the list; it doesn't seem worth the effort of going through the entire list.

Fixes #149945
Cc @scottmcm @bjorn3

rustbot · 2026-01-02T17:40:55Z

r? @jieyouxu

rustbot has assigned @jieyouxu.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

RalfJung · 2026-01-02T17:41:52Z

compiler/rustc_session/src/session.rs

+
+    /// The names of intrinsics that the current codegen backend replaces
+    /// with its own implementations.
+    pub replaced_intrinsics: Vec<Symbol>,


It seems there is no way to get the current codegen backend from a tcx. I wasn't sure what the best way is to make this list of symbols available to monomorphization, and went for a new field in Session -- does that make sense?

I don't know enough about how all this should be structured to know what the best option is here.

This seems at least plausible, since at worst it stays empty and that doesn't hurt anything (other than perf).

@bjorn3 do you have any suggestions for how to deal with this?

I am not the biggest fan of another Session field, but don't have any other suggestions either.

RalfJung · 2026-01-02T17:42:07Z

@bors try
@rust-timer queue

skip codegen for intrinsics with big fallback bodies if backend does not need them

rust-bors · 2026-01-02T20:06:03Z

☀️ Try build successful (CI)
Build commit: 4763a83 (4763a83f81ae539aaa6f6e5e773ba1fc73de0a10, parent: 8a24a202aa02f677fc2a3e0e1a69af7545803952)

rust-timer · 2026-01-02T21:13:45Z

Finished benchmarking commit (4763a83): comparison URL.

Overall result: ❌✅ regressions and improvements - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

	mean	range	count
Regressions ❌ (primary)	0.6%	[0.6%, 0.6%]	1
Regressions ❌ (secondary)	0.1%	[0.1%, 0.1%]	1
Improvements ✅ (primary)	-1.8%	[-2.8%, -0.8%]	2
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-1.0%	[-2.8%, 0.6%]	3

Max RSS (memory usage)

Results (primary -1.5%, secondary 3.5%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	1.3%	[0.7%, 1.9%]	2
Regressions ❌ (secondary)	3.5%	[3.5%, 3.5%]	1
Improvements ✅ (primary)	-4.3%	[-7.2%, -1.4%]	2
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-1.5%	[-7.2%, 1.9%]	4

Cycles

Results (primary -3.9%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-3.9%	[-3.9%, -3.9%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-3.9%	[-3.9%, -3.9%]	1

Binary size

Results (primary 0.2%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	1.4%	[1.4%, 1.4%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-0.0%	[-0.1%, -0.0%]	7
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.2%	[-0.1%, 1.4%]	8

Bootstrap: 473.485s -> 474.195s (0.15%)
Artifact size: 390.77 MiB -> 390.79 MiB (0.01%)

…not need them

RalfJung · 2026-01-02T22:14:16Z

@bors try
@rust-timer queue

skip codegen for intrinsics with big fallback bodies if backend does not need them

rust-bors · 2026-01-03T00:43:41Z

☀️ Try build successful (CI)
Build commit: c75310a (c75310a5c412df8835187dd0ef37361b2f00d085, parent: 5497a36a7faf3d2af37beebcff7008e493202902)

rust-timer · 2026-01-03T01:24:45Z

Finished benchmarking commit (c75310a): comparison URL.

Overall result: ❌✅ regressions and improvements - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

	mean	range	count
Regressions ❌ (primary)	0.7%	[0.7%, 0.7%]	1
Regressions ❌ (secondary)	0.1%	[0.1%, 0.1%]	1
Improvements ✅ (primary)	-1.8%	[-2.9%, -0.8%]	2
Improvements ✅ (secondary)	-0.4%	[-0.4%, -0.4%]	1
All ❌✅ (primary)	-1.0%	[-2.9%, 0.7%]	3

Max RSS (memory usage)

Results (primary -4.1%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-4.1%	[-7.3%, -1.7%]	3
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-4.1%	[-7.3%, -1.7%]	3

Cycles

Results (primary -3.9%, secondary 15.2%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	15.2%	[15.0%, 15.4%]	2
Improvements ✅ (primary)	-3.9%	[-3.9%, -3.9%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-3.9%	[-3.9%, -3.9%]	1

Binary size

Results (primary 0.2%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	1.4%	[1.4%, 1.4%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-0.0%	[-0.1%, -0.0%]	7
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.2%	[-0.1%, 1.4%]	8

Bootstrap: 471.287s -> 473.923s (0.56%)
Artifact size: 390.83 MiB -> 390.83 MiB (-0.00%)

jieyouxu · 2026-01-03T06:17:50Z

@rustbot reroll

RalfJung · 2026-01-31T14:15:38Z

Seems like I had no luck with that reroll.
@rustbot reroll

@scottmcm or could you review this?

mati865 · 2026-01-31T15:48:59Z

Cool idea!

I'll wait a few days to give @scottmcm time to respond respond as the much more knowledgeable person.

Do you know if there is a list of similarly optimised intrinsics somewhere?

RalfJung · 2026-02-02T14:19:21Z

In principle one could go over all the intrinsics that have fallback bodies, and then check whether the LLVM backend has implementations for them.

But most fallback bodies are small so the cost of monomorphizing them is tiny. Not sure if it's worth going through the entire list. I think I got all the ones that have big fallback bodies where we really don't want to pay the monomorphization cost.

rustbot assigned jieyouxu Jan 2, 2026

RalfJung commented Jan 2, 2026

View reviewed changes

This comment has been minimized.

Sign in to view

rust-bors bot added a commit that referenced this pull request Jan 2, 2026

Auto merge of #150605 - RalfJung:fallback-intrinsic-skip, r=<try>

4763a83

skip codegen for intrinsics with big fallback bodies if backend does not need them

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jan 2, 2026

This comment has been minimized.

Sign in to view

RalfJung force-pushed the fallback-intrinsic-skip branch from 4ca06da to a170604 Compare January 2, 2026 19:29

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Jan 2, 2026

skip codegen for intrinsics with big fallback bodies if backend does …

57e44f5

…not need them

RalfJung force-pushed the fallback-intrinsic-skip branch from a170604 to 57e44f5 Compare January 2, 2026 22:14

This comment has been minimized.

Sign in to view

rust-bors bot added a commit that referenced this pull request Jan 2, 2026

Auto merge of #150605 - RalfJung:fallback-intrinsic-skip, r=<try>

c75310a

skip codegen for intrinsics with big fallback bodies if backend does not need them

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jan 2, 2026

This comment has been minimized.

Sign in to view

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jan 3, 2026

rustbot assigned SparrowLii and unassigned jieyouxu Jan 3, 2026

rustbot assigned mati865 and unassigned SparrowLii Jan 31, 2026

Uh oh!

skip codegen for intrinsics with big fallback bodies if backend does not need them #150605

Are you sure you want to change the base?

skip codegen for intrinsics with big fallback bodies if backend does not need them #150605

Conversation

RalfJung commented Jan 2, 2026

Uh oh!

rustbot commented Jan 2, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RalfJung commented Jan 2, 2026

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Jan 2, 2026

Uh oh!

This comment has been minimized.

rust-timer commented Jan 2, 2026

Overall result: ❌✅ regressions and improvements - please read the text below

Uh oh!

RalfJung commented Jan 2, 2026

Uh oh!

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Jan 3, 2026

Uh oh!

This comment has been minimized.

rust-timer commented Jan 3, 2026

Overall result: ❌✅ regressions and improvements - please read the text below

Uh oh!

jieyouxu commented Jan 3, 2026

Uh oh!

RalfJung commented Jan 31, 2026

Uh oh!

mati865 commented Jan 31, 2026

Uh oh!

RalfJung commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants