Reduce branches in padding adjustment in FromBase64_ComputeResultLength#115022

rameel · 2025-04-24T22:31:39Z

No description provided.

gfoidl · 2025-04-25T08:58:56Z

src/libraries/System.Private.CoreLib/src/System/Convert.cs

+            ReadOnlySpan<byte> map = [0, 2, 1];
+            padding = map[padding];


This trades branches vs. memory access -- I don't know which approach will be faster and in benchmarks it will be hard to measure, because branch predictor will do a good job, and for the memory access the data will be in the cpu caches, so fast retrieval.

Or put differently: it trades possible branch mispredicts vs. potential cache eviction.

Maybe it's better to write the code as idiomatic as possible and let the compilers optimize it out. Actually with a switch it looks quite good and produces code similar to what native compilers produce.

You're right, writing the code in a more idiomatic style is preferable. However, at the moment, the JIT doesn't seem to generate a value-to-result mapping (like C/C++ does) for simple cases like this. Even for a switch, it just falls back to a jump table, which still involves a memory access.

I agree - it's better to either leave it as-is or rewrite it using a switch, and help the JIT recognize and optimize such patterns. I think I even saw an issue about this a while ago, but couldn't find it right away.

Just for reference, here's what C/C++ produces:

mov edi, edi mov eax, DWORD PTR CSWTCH.1[0+rdi*4] CSWTCH.1: .long 0 .long 2 .long 1

As we can see the C/C++ compiler prefers a memory access here, rather than relying on the branch predictor.

And here's what the JIT produces in my case - basically the same thing as the C++ version:

mov rdi, 0x72BEDC82CB80 ; static handle movzx rax, byte ptr [rax+rdi]

And here's what a typical switch ends up looking like:

lea rdi, [reloc @RWD00] mov edi, dword ptr [rdi+4*rax] lea rcx, G_M26941_IG02 add rdi, rcx jmp rdi mov eax, 1 jmp SHORT G_M26941_IG07 mov eax, 2 jmp SHORT G_M26941_IG07 xor eax, eax G_M26941_IG07: ;; offset=0x0035

I’ve found the issue related to this: #114041

So I think it would be still good to change to the switch-approach here.
Once the JIT got improved, then the benefit here comes for free.

dotnet-policy-service · 2025-04-28T12:59:05Z

Tagging subscribers to this area: @dotnet/area-system-runtime
See info in area-owners.md if you want to be subscribed.

stephentoub · 2025-05-28T21:51:35Z

src/libraries/System.Private.CoreLib/src/System/Convert.cs

+                1 => 2,
+                2 => 1,
+                _ => throw new FormatException(SR.Format_BadBase64Char)
+            };


The title of the PR is "reduce branches", but this doesn't actually reduce branches today, does it?

How about this: "Improve readability of Base64 padding adjustment logic"?

However, the change seems too minor to have a meaningful impact. It might be more worthwhile to focus on removing "unsafe" in FromBase64_ComputeResultLength and related methods. Here's the commit for that: rameel@fe9f28b

Unfortunately due to the current JIT's inability to eliminate unnecessary bounds checks in cases like this (see #115091):

while (chars.Length != 0) { if (chars[^1] is not (' ' or '\n' or '\r' or '\t')) break; chars = chars.Slice(0, chars.Length - 1); // Unnecessary bounds checks }

and a performance regression compared to NET9 (#115090) I decided to postpone creating PR for this until later.

Thanks, anyway

Reduce branches in padding adjustment in FromBase64_ComputeResultLength

7c6832d

ghost added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Apr 24, 2025

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Apr 24, 2025

gfoidl reviewed Apr 25, 2025

View reviewed changes

rameel closed this Apr 25, 2025

rameel reopened this Apr 28, 2025

rameel added 2 commits April 28, 2025 17:21

Merge branch 'main' into optimize-base64-padding-map

ed1ea95

Use switch instead of array lookup

7ebb256

am11 added area-System.Runtime and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Apr 28, 2025

Add missing semicolon

2e29648

This was referenced Apr 28, 2025

tracing/runtimeeventsource/nativeruntimeeventsource/nativeruntimeeventsource failing in CI #90605

Open

[9.0] Test assert failure in X509Certificates.Tests.RevocationTests.AiaTests.AiaAcceptsCertTypesAndIgnoresNonCertTypes #107364

Open

stephentoub reviewed May 28, 2025

View reviewed changes

rameel closed this Jun 2, 2025

github-actions bot locked and limited conversation to collaborators Jul 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Reduce branches in padding adjustment in FromBase64_ComputeResultLength#115022

Reduce branches in padding adjustment in FromBase64_ComputeResultLength#115022
rameel wants to merge 4 commits intodotnet:mainfrom
rameel:optimize-base64-padding-map

rameel commented Apr 24, 2025

Uh oh!

gfoidl Apr 25, 2025

Uh oh!

rameel Apr 25, 2025

Uh oh!

rameel Apr 25, 2025

Uh oh!

gfoidl Apr 28, 2025

Uh oh!

dotnet-policy-service bot commented Apr 28, 2025

Uh oh!

stephentoub May 28, 2025

Uh oh!

rameel May 30, 2025

Uh oh!

rameel May 30, 2025

Uh oh!

stephentoub Jun 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

rameel commented Apr 24, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dotnet-policy-service bot commented Apr 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants