const-eval: always do mem-to-mem copies if there might be padding involved #148967

RalfJung · 2025-11-15T09:04:44Z

This is the final piece of the puzzle for #148470: when copying data of a type that has padding, always do a mem-to-mem copy, so that we always preserve the source padding exactly. That prevents rustc implementation choices from leaking into user-visible behavior.

This is technically a breaking change: the example at the top of #148470 no longer compiles with this. However, it seems very unlikely that anyone would have dependent on this. My main concern is not backwards compatibility, it is performance.

Fixes #148470

Actually that seems to be entirely fine, it even helps with some benchmarks! I guess the mem-to-mem codepath is actually faster than the scalar pair codepath for the copy itself. It can slow things down later since now we have to do everything bytewise, but that doesn't show up in our benchmarks and might not be very relevant after all (in particular, it only affects types with padding, so the rather common wide pointers still always use the efficient scalar representation).

So that would be my proposal to for resolving this issue then: to make const-eval behavior consistent, we always copy the padding from the source to the target. IOW, potentially pre-existing provenance in the target always gets overwritten (that part is already in #148259), and potentially existing provenance in padding in the source always gets carried over (that's #148967). If there's provenance elsewhere in the source our existing handling is fine:

If it's in an integer, that's UB during const-eval so we can do whatever.

If it's in a pointer, the the fragments must combine back together to a pointer or else we have UB.

If it's in a union we just carry it over unchanged.

@traviscross we should check that this special const-eval-only UB is properly reflected in the reference. Currently we have this but that only talks about int2ptr, not about invalid pointer fragments at pointer type. I also wonder if this shouldn't rather be part of "invalid values" to make it clear that this applies recursively inside fields as well.
EDIT: Reference PR is up at rust-lang/reference#2091.

Originally posted by @RalfJung in #148470

Worth noting that this does not resolve the concerns @theemathas had about -Zextra-const-ub-checks sometimes causing more code to compile. Specifically, with that flag, the behavior changes to "potentially existing provenance in padding in the source never gets carried over". However, it's a nightly-only flag (used by Miri) so while the behavior is odd, I don't think this is a problem.

Originally posted by @RalfJung in #148470

rustbot · 2025-11-15T09:04:47Z

Some changes occurred to the CTFE machinery

cc @RalfJung, @oli-obk, @lcnr

Some changes occurred to the CTFE / Miri interpreter

cc @rust-lang/miri

rustbot · 2025-11-15T09:04:49Z

r? @JonathanBrouwer

rustbot has assigned @JonathanBrouwer.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

RalfJung · 2025-11-15T09:05:19Z

@bors try
@rust-timer queue

…try> const-eval: always do mem-to-mem copies if there might be padding involved

rust-bors · 2025-11-15T11:23:40Z

☀️ Try build successful (CI)
Build commit: 78c81ee (78c81ee3917a99dcff6e2e6822800f0492c415c3, parent: 733108b6d4acaa93fe26ae281ea305aacd6aac4e)

rust-timer · 2025-11-15T12:42:56Z

Finished benchmarking commit (78c81ee): comparison URL.

Overall result: ❌✅ regressions and improvements - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	0.2%	[0.0%, 0.3%]	7
Improvements ✅ (primary)	-2.8%	[-2.8%, -2.8%]	1
Improvements ✅ (secondary)	-0.4%	[-0.5%, -0.2%]	12
All ❌✅ (primary)	-2.8%	[-2.8%, -2.8%]	1

Max RSS (memory usage)

Results (primary -3.2%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-3.2%	[-3.2%, -3.2%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-3.2%	[-3.2%, -3.2%]	1

Cycles

Results (primary -2.7%, secondary -9.4%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-2.7%	[-2.7%, -2.7%]	1
Improvements ✅ (secondary)	-9.4%	[-16.0%, -2.8%]	2
All ❌✅ (primary)	-2.7%	[-2.7%, -2.7%]	1

Binary size

Results (primary -1.1%, secondary 0.0%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	0.0%	[0.0%, 0.0%]	1
Improvements ✅ (primary)	-1.1%	[-1.1%, -1.1%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-1.1%	[-1.1%, -1.1%]	1

Bootstrap: 472.272s -> 472.014s (-0.05%)
Artifact size: 388.64 MiB -> 388.68 MiB (0.01%)

RalfJung · 2025-11-15T13:07:07Z

Uh okay I guess this is actually good for perf.^^ At least for the benchmarks we have. The copy apparently gets a little cheaper, but we force more things to use the less efficient in-memory representation. The latter just does not seem to matter in our benchmarks. Just to be safe: @craterbot check

craterbot · 2025-11-15T13:07:37Z

👌 Experiment pr-148967 created and queued.
🤖 Automatically detected try build 78c81ee
⚠️ Try build based on commit c4acb77, but latest commit is 01194d7. Did you forget to make a new try build?
🔍 You can check out the queue and this experiment's details.

ℹ️ Crater is a tool to run experiments across parts of the Rust ecosystem. Learn more

rustbot · 2025-11-16T10:30:01Z

This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

theemathas · 2025-11-17T04:25:33Z

Most of the performance regressions are from the coercions benchmark. All it does is create an array of a large number of string literals in const. Why did this benchmark's performance regress? There is no padding involved in any of the types.

RalfJung · 2025-12-12T07:43:10Z

That's fair -- I should have gotten up-to-date perf numbers. We did try enabling full validation in const-eval before and it was never close to acceptable, but there's no harm in getting new numbers.

I made a separate PR for that: #149901.

RalfJung · 2025-12-12T12:24:38Z

@traviscross here are the numbers for the "naive" implementation: perf. It's pretty red. ;)

Now, there may be ways to optimize this if we only have to reset padding and provenance, but not do full validation... and there could be a fast-path for scalar types... like, one can pour basically an unbounded amount of time into making this faster. It's unclear how far this would actually get us. Also if we do that guided by our benchmark suite, we'd be overfitting for that very quickly I think. I'm afraid I don't think I have the time, patience, and motivation for that. I'd be happy to mentor/review patches for someone who does want to work on it, though.

But meanwhile, this PR doesn't really make things any worse than they are already, and it avoids accidentally changing behavior when the layout algorithm changes.

traviscross · 2025-12-12T16:22:14Z

Thanks for running those numbers. Does caching/memoizing come into play for optimizing this (and are we doing any now)? (Is that hard here?) As described ("doing a full traversal"), it seems the sort of thing that would be at risk of traversing the same path repeatedly, e.g. as a final value is built up from parts. Without memoizing, it's easy to see how this would go O(N^2).

RalfJung · 2025-12-12T16:37:25Z

There's currently no caching of any sort. Also, adding caching is not trivial... the naive option of "here's some pairs of types and raw byte lists for which we can skip validation in the future" would quickly consume huge amounts of RAM caching values we'll never need again. Representing raw byte lists (including uninit and provenance) is itself non-trivial, and even just checking the cache would be expensive.

Or alternatively we build a system where we remember "the bytes at offset O in allocation A have been validated for type T" -- but we'd have to be really sure we catch all codepaths where memory gets changed so that we can invalidate the cache. Also, Rust moves values around a lot, so this alone wouldn't suffice; we'd also want this information to be preserved when copying things from one place to another. This could be an interesting student project to speed up Miri, but it's not going to happen short-term.

traviscross · 2025-12-12T19:02:46Z

But meanwhile, this PR doesn't really make things any worse than they are already, and it avoids accidentally changing behavior when the layout algorithm changes.

For concreteness, the proposal I'd put before us, then, would be to accept this PR and to document in the Reference:

We say that basically any uninit byte in the result may actually end up being a pointer fragment. This means the compiler could legally fail to compile any time there's an uninit byte in the result -- which of course it won't, but we make no hard stable guarantees for when exactly this will or will not fail. It's kind of a cop-out but also avoids us having to nail down details of the const-eval interpreter that cannot be observed in the AM.

We would be explicit that this remains part of our RFC 1122 underspecified language semantics and that we remain within our rights to change the behavior of typed copies within consteval.

@rfcbot fcp merge lang
@rustbot label +S-waiting-on-documentation

rust-rfcbot · 2025-12-12T19:02:48Z

Team member @traviscross has proposed to merge this. The next step is review by the rest of the tagged team members:

No concerns currently listed.

Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

cc @rust-lang/lang-advisors: FCP proposed for lang, please feel free to register concerns.
See this document for info about what commands tagged team members can give me.

Nadrieril · 2025-12-15T00:44:10Z

I find these perf results rather encouraging actually, given that this was a naive implementation. Noticeable regressions are on a limited set of tests, all of them stress-tests. I don't mind the currently-proposed lang decision so that we can get this merged, but if anyone with enough const-eval knowledge could try a non-naive version I'd be keen to revisit this.

RalfJung · 2025-12-15T07:20:10Z

Noticeable regressions are on a limited set of tests, all of them stress-tests.

I think this mostly shows that we have very few interesting consts in our benchmarks.

tmandry · 2025-12-15T17:48:16Z

@rfcbot reviewed

Ok, the proposal on the table sounds reasonable. Options I can see for having a more satisfying story in the future:

Uninitialize padding on typed copy (if we can improve the performance enough).
Tracking type information + Ralf's suggestion in const-eval: always do mem-to-mem copies if there might be padding involved #148967 (comment) for a typed "carry alloc to runtime" operation – that sounds like it would be very visible to end-users. I could imagine making optional to users for more precise tracking if this error becomes a problem for them.

The second brings to mind a more general typed allocator API, which also would have to be optional. Then we could have typed/untyped allocations just as we have typed/untyped copies. That could be useful e.g. for heap profiling applications, among other things.

I'm certain I'm not the first to think of this though. I guess the reasons not to go down this path are some combination of the following; are there others?

Not wanting added monomorphization; needing a TypeId mechanism that works for non-'static types to avoid it
Not being usable in 100% of applications
Prior art
Lack of need

theemathas · 2025-12-25T15:17:03Z

Turns out that there is already some code in the wild that depends on how exactly we do typed copies in consteval, according to crater on the PR implementing an alternative to this PR.

Edit: It is indeed just previously-uncaught consteval UB.

RalfJung · 2025-12-26T18:05:27Z

I think that's just code with const-UB that we currently do not detect. We are at liberty to break such code.

nikomatsakis · 2026-01-07T18:15:34Z

@rfcbot reviewed

rust-rfcbot · 2026-01-07T18:15:42Z

🔔 This is now entering its final comment period, as per the review above. 🔔

rust-rfcbot · 2026-01-17T18:16:00Z

The final comment period, with a disposition to merge, as per the review above, is now complete.

As the automated representative of the governance process, I would like to thank the author for their work and everyone else who contributed.

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Nov 15, 2025

rustbot assigned JonathanBrouwer Nov 15, 2025

This comment has been minimized.

Sign in to view

rust-bors bot added a commit that referenced this pull request Nov 15, 2025

Auto merge of #148967 - RalfJung:const-eval-preserve-src-padding, r=<…

78c81ee

…try> const-eval: always do mem-to-mem copies if there might be padding involved

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Nov 15, 2025

RalfJung mentioned this pull request Nov 15, 2025

Partial pointers in padding can make const-eval fail #148470

Open

This comment has been minimized.

Sign in to view

RalfJung force-pushed the const-eval-preserve-src-padding branch from c4acb77 to 01194d7 Compare November 15, 2025 10:22

This comment has been minimized.

Sign in to view

traviscross added the I-lang-radar Items that are on lang's radar and will need eventual work or consideration. label Nov 15, 2025

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Nov 15, 2025

craterbot added S-waiting-on-crater Status: Waiting on a crater run to be completed. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 15, 2025

RalfJung force-pushed the const-eval-preserve-src-padding branch from 01194d7 to 472364c Compare November 16, 2025 10:29

RalfJung mentioned this pull request Dec 12, 2025

const-eval: always do full typed copies #149901

Closed

traviscross added the I-lang-nominated Nominated for discussion during a lang team meeting. label Dec 12, 2025

This comment was marked as resolved.

Sign in to view

rustbot added the S-waiting-on-documentation Status: Waiting on approved PRs to documentation before merging label Dec 12, 2025

rust-rfcbot added proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. disposition-merge This issue / PR is in PFCP or FCP with a disposition to merge it. labels Dec 12, 2025

rust-rfcbot added final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. and removed proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. labels Jan 7, 2026

traviscross removed I-lang-nominated Nominated for discussion during a lang team meeting. P-lang-drag-1 Lang team prioritization drag level 1. https://rust-lang.zulipchat.com/#narrow/channel/410516-t-lang labels Jan 7, 2026

rustbot mentioned this pull request Jan 17, 2026

Tracking issue for release notes of #148967: const-eval: always do mem-to-mem copies if there might be padding involved #151264

Open

Uh oh!

const-eval: always do mem-to-mem copies if there might be padding involved #148967

Are you sure you want to change the base?

const-eval: always do mem-to-mem copies if there might be padding involved #148967

Conversation

RalfJung commented Nov 15, 2025 • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rustbot commented Nov 15, 2025

Uh oh!

rustbot commented Nov 15, 2025

Uh oh!

RalfJung commented Nov 15, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Nov 15, 2025

Uh oh!

This comment has been minimized.

rust-timer commented Nov 15, 2025

Overall result: ❌✅ regressions and improvements - please read the text below

Instruction count

Max RSS (memory usage)

Cycles

Binary size

Uh oh!

RalfJung commented Nov 15, 2025 via email

Uh oh!

craterbot commented Nov 15, 2025

Uh oh!

rustbot commented Nov 16, 2025

Uh oh!

theemathas commented Nov 17, 2025

Uh oh!

RalfJung commented Dec 12, 2025

Uh oh!

RalfJung commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

traviscross commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RalfJung commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

traviscross commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-rfcbot commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nadrieril commented Dec 15, 2025

Uh oh!

RalfJung commented Dec 15, 2025

Uh oh!

tmandry commented Dec 15, 2025

Uh oh!

theemathas commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RalfJung commented Dec 26, 2025

Uh oh!

nikomatsakis commented Jan 7, 2026

Uh oh!

rust-rfcbot commented Jan 7, 2026

Uh oh!

rust-rfcbot commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

RalfJung commented Nov 15, 2025 •

edited by rustbot

Loading

RalfJung commented Dec 12, 2025 •

edited

Loading

traviscross commented Dec 12, 2025 •

edited

Loading

RalfJung commented Dec 12, 2025 •

edited

Loading

traviscross commented Dec 12, 2025 •

edited

Loading

rust-rfcbot commented Dec 12, 2025 •

edited

Loading

theemathas commented Dec 25, 2025 •

edited

Loading