gh-138122: Add blocking mode for accurate stack traces in Tachyon #142998

pablogsal · 2025-12-20T01:32:06Z

Non-blocking sampling reads process memory while the target continues
running, which can produce torn stacks when generators or coroutines
rapidly switch between yield points. Blocking mode uses atomic process
suspension (task_suspend on macOS, NtSuspendProcess on Windows,
PTRACE_SEIZE on Linux) to stop the target during each sample, ensuring
consistent snapshots.

Use blocking mode with longer intervals (1ms+) to avoid impacting the
target too much. The default non-blocking mode remains best for most
cases since it has zero overhead.

📚 Documentation preview 📚: https://cpython-previews--142998.org.readthedocs.build/

Issue: Implement PEP 799 – A dedicated profiling package for organizing Python profiling tools #138122

pablogsal · 2025-12-20T03:00:47Z

@ivonastojanovic can you take a look?

Non-blocking sampling reads process memory while the target continues running, which can produce torn stacks when generators or coroutines rapidly switch between yield points. Blocking mode uses atomic process suspension (task_suspend on macOS, NtSuspendProcess on Windows, PTRACE_SEIZE on Linux) to stop the target during each sample, ensuring consistent snapshots. Use blocking mode with longer intervals (1ms+) to avoid impacting the target too much. The default non-blocking mode remains best for most cases since it has zero overhead. Also fix a frame cache bug: the cache was including the last_profiled_frame itself when extending with cached data, but this frame was executing in the previous sample and its line number may have changed. For example, if function A was sampled at line 6, then execution continued to line 10 and called B→C, the next sample would incorrectly report A at line 6 (from cache) instead of line 10. The fix uses start_idx + 1 to only trust frames ABOVE last_profiled_frame — these caller frames are frozen at their call sites and cannot change until their callees return. Signed-off-by: Pablo Galindo <[email protected]>

pablogsal · 2025-12-20T16:40:46Z

Modules/_remote_debugging/frame_cache.c


-    // Extend frame_info with frames from start_idx onwards
-    PyObject *slice = PyList_GetSlice(entry->frame_list, start_idx, num_frames);
+    // Extend frame_info with frames ABOVE start_idx (not including it).


This fixes a frame cache bug: the cache was including the last_profiled_frame
itself when extending with cached data, but this frame was executing in
the previous sample and its line number may have changed. For example,
if function A was sampled at line 6, then execution continued to line 10
and called B→C, the next sample would incorrectly report A at line 6
(from cache) instead of line 10. The fix uses start_idx + 1 to only trust
frames ABOVE last_profiled_frame: these caller frames are frozen at their
call sites and cannot change until their callees return.

ivonastojanovic

LGTM! Just a couple of nits

ivonastojanovic · 2025-12-20T18:08:26Z

Lib/profiling/sampling/sample.py

                        collector.collect_failed_sample()
                        errors += 1
                    except Exception as e:
+                        print(e)


Suggested change

print(e)

ivonastojanovic · 2025-12-20T18:08:45Z

Lib/profiling/sampling/sample.py

                        duration_sec = current_time - start_time
                        break
-                    except (RuntimeError, UnicodeDecodeError, MemoryError, OSError):
+                    except (RuntimeError, UnicodeDecodeError, MemoryError, OSError) as e:


Suggested change

except (RuntimeError, UnicodeDecodeError, MemoryError, OSError) as e:

except (RuntimeError, UnicodeDecodeError, MemoryError, OSError):

ivonastojanovic · 2025-12-20T18:15:25Z

Lib/test/test_profiling/test_sampling_profiler/test_blocking.py

+        # When consume_generator is on the arithmetic lines (temp1, temp2, etc.),
+        # fibonacci_generator should NOT be in the stack at all.
+        # Line numbers are important here - see ARITHMETIC_LINES below.
+        cls.generator_script = '''


nit: Maybe we could use textwrap.dedent here to keep the string nicely formatted. What do you think?

ivonastojanovic · 2025-12-20T18:26:48Z

Doc/library/profiling.sampling.rst


+.. option:: --blocking
+
+   Stop the target process during each sample. This ensures consistent


Not sure if pause might sound better than stop here

Suggested change

Stop the target process during each sample. This ensures consistent

Pause the target process during each sample. This ensures consistent

bedevere-app bot added the awaiting core review label Dec 20, 2025

bedevere-app bot mentioned this pull request Dec 20, 2025

Unfriendly traceback when sampling from an unknown PID #142654

Closed

pablogsal force-pushed the fixes branch from 7344e4c to eb9d3bc Compare December 20, 2025 01:33

pablogsal changed the title ~~gh-142654: Add blocking mode for accurate stack traces in Tachyon~~ gh-138122: Add blocking mode for accurate stack traces in Tachyon Dec 20, 2025

bedevere-app bot mentioned this pull request Dec 20, 2025

Implement PEP 799 – A dedicated profiling package for organizing Python profiling tools #138122

Open

pablogsal force-pushed the fixes branch 4 times, most recently from bb510c0 to 8c21dc9 Compare December 20, 2025 02:57

pablogsal force-pushed the fixes branch 4 times, most recently from de4448b to c270764 Compare December 20, 2025 15:43

pablogsal requested review from AA-Turner, ezio-melotti and hugovk as code owners December 20, 2025 15:43

pablogsal force-pushed the fixes branch from c270764 to 41f8907 Compare December 20, 2025 16:20

Merge branch 'main' into fixes

b16b0c4

pablogsal commented Dec 20, 2025

View reviewed changes

ivonastojanovic reviewed Dec 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-138122: Add blocking mode for accurate stack traces in Tachyon #142998

gh-138122: Add blocking mode for accurate stack traces in Tachyon #142998

pablogsal commented Dec 20, 2025 •

edited

Loading

Uh oh!

pablogsal commented Dec 20, 2025

Uh oh!

pablogsal Dec 20, 2025 •

edited

Loading

Uh oh!

ivonastojanovic left a comment

Uh oh!

ivonastojanovic Dec 20, 2025

Uh oh!

ivonastojanovic Dec 20, 2025

Uh oh!

ivonastojanovic Dec 20, 2025

Uh oh!

ivonastojanovic Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	except (RuntimeError, UnicodeDecodeError, MemoryError, OSError) as e:
	except (RuntimeError, UnicodeDecodeError, MemoryError, OSError):


		.. option:: --blocking

		Stop the target process during each sample. This ensures consistent

	Stop the target process during each sample. This ensures consistent
	Pause the target process during each sample. This ensures consistent

Uh oh!

gh-138122: Add blocking mode for accurate stack traces in Tachyon #142998

Are you sure you want to change the base?

gh-138122: Add blocking mode for accurate stack traces in Tachyon #142998

Conversation

pablogsal commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pablogsal commented Dec 20, 2025

Uh oh!

pablogsal Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ivonastojanovic left a comment

Choose a reason for hiding this comment

Uh oh!

ivonastojanovic Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

ivonastojanovic Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

ivonastojanovic Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

ivonastojanovic Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pablogsal commented Dec 20, 2025 •

edited

Loading

pablogsal Dec 20, 2025 •

edited

Loading