ENG-7948: StateManagerDisk deferred write queue #5883

masenf · 2025-10-14T20:48:40Z

New env var: REFLEX_STATE_MANAGER_DISK_DEBOUNCE_SECONDS (default 2.0)
If the debounce is non-zero, then state manager will queue the disk write
Queued writes will be processed in order of set time after they exceed the debounce timeout
New StateManager.close method standardized in base class
Close app.state_manager when the server is going down
Flush all queued writes when the StateManagerDisk closes
Add memory cache state expiration to StateManagerDisk
Use run_in_thread for the actual disk write and purging of expired states to avoid blocking the event loop while writing to disk
Update test cases to always call state_manager.close()

* New env var: REFLEX_STATE_MANAGER_DISK_DEBOUNCE_SECONDS (default 2.0) * If the debounce is non-zero, then state manager will queue the disk write * Queued writes will be processed in order of set time after they exceed the debounce timeout * New StateManager.close method standardized in base class * Close app.state_manager when the server is going down * Flush all queued writes when the StateManagerDisk closes * Update test cases to always call `state_manager.close()`

linear · 2025-10-14T20:48:42Z

ENG-7948 Maintain sandbox state across hot reloads

greptile-apps

Greptile Overview

Summary

Implements a deferred write queue for StateManagerDisk to reduce disk I/O overhead by batching state writes. The debounce period is configurable via REFLEX_STATE_MANAGER_DISK_DEBOUNCE_SECONDS (default 2.0 seconds). A background task processes queued writes after the debounce timeout and handles state expiration cleanup.

Major changes:

New QueueItem dataclass to track pending writes with timestamps
Background _process_write_queue() task that processes writes older than debounce period
Memory cache expiration tracking via _token_last_touched dictionary
Standardized close() method in base StateManager class to flush pending writes on shutdown
Integration with app lifespan to ensure clean shutdown
Updated tests to always call state_manager.close()

Issues found:

Critical: Line 295 only adds to queue if token not present, causing subsequent updates to be silently dropped
Critical: In-memory cache self.states never updated in set_state(), causing stale data to be served while waiting for debounced writes

Confidence Score: 1/5

This PR contains critical bugs that will cause data loss and stale state issues in production
Two critical logical errors in the core state management flow: (1) subsequent state updates are silently dropped when debouncing is enabled due to conditional queue insertion, and (2) the in-memory cache is never updated during set_state, causing stale data to be served to clients. These bugs will manifest as lost state updates and inconsistent application state.
reflex/istate/manager/disk.py requires immediate attention - the set_state() method has critical bugs at lines 285-305

Important Files Changed

File Analysis

Filename	Score	Overview
reflex/istate/manager/disk.py	1/5	Adds deferred write queue with debouncing and expiration handling; critical bug where subsequent updates to same token are dropped, and memory cache not updated causing stale data
reflex/istate/manager/init.py	5/5	Adds close() method to base StateManager class with empty default implementation
reflex/environment.py	5/5	Adds interpret_float_env() helper and REFLEX_STATE_MANAGER_DISK_DEBOUNCE_SECONDS environment variable with 2.0 default
reflex/app_mixins/lifespan.py	5/5	Calls state_manager.close() during app shutdown to flush pending writes

Sequence Diagram

sequenceDiagram
    participant Client
    participant App
    participant StateManagerDisk
    participant WriteQueue
    participant ProcessTask
    participant Disk

    Note over StateManagerDisk: Initialization
    StateManagerDisk->>StateManagerDisk: __post_init__()
    StateManagerDisk->>Disk: _purge_expired_states()

    Note over Client,Disk: State Update Flow (with debounce)
    Client->>App: User action triggers state change
    App->>StateManagerDisk: modify_state(token)
    StateManagerDisk->>StateManagerDisk: get_state(token)
    StateManagerDisk-->>App: yield state
    App->>App: Modify state
    App->>StateManagerDisk: set_state(token, state)
    StateManagerDisk->>WriteQueue: Add/Update QueueItem(token, state, timestamp)
    StateManagerDisk->>StateManagerDisk: _schedule_process_write_queue()
    StateManagerDisk->>ProcessTask: create_task() if not running
    StateManagerDisk-->>App: Return (state not yet on disk)

    Note over ProcessTask,Disk: Background Write Processing
    loop Every debounce period
        ProcessTask->>ProcessTask: Check queue for items older than debounce
        ProcessTask->>WriteQueue: Pop items ready to write
        ProcessTask->>Disk: set_state_for_substate() via run_in_thread
        ProcessTask->>ProcessTask: Check for expired tokens
        ProcessTask->>Disk: _purge_expired_states() via run_in_thread
        ProcessTask->>ProcessTask: _process_write_queue_delay()
    end

    Note over App,Disk: App Shutdown
    App->>StateManagerDisk: close()
    StateManagerDisk->>ProcessTask: cancel()
    ProcessTask->>WriteQueue: Flush all remaining items
    ProcessTask->>Disk: Write all items to disk
    StateManagerDisk-->>App: Shutdown complete

Additional Comments (1)

reflex/istate/manager/disk.py, line 285-305 (link)

logic: in-memory cache is never updated during set_state, only during get_state. this causes stale data to be served from memory cache while waiting for debounced writes. add memory cache update before queueing the write

_{6 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

reflex/istate/manager/disk.py

codspeed-hq · 2025-10-14T20:52:58Z

CodSpeed Performance Report

Merging #5883 will not alter performance

_{Comparing masenf/disk-state-throttler (d1359cb) with main (ade1254)¹}

Summary

✅ 8 untouched

No successful run was found on main (1e2a8c9) during the generation of this report, so ade1254 was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

…ottler

… state would expire conserve resources by pausing the _process_write_queue for the amount of time of the oldest known token to expire.

…rDisk

Avoid interference with _schedule_process_write_queue

…ottler

greptile-apps bot reviewed Oct 14, 2025

View reviewed changes

reflex/istate/manager/disk.py Show resolved Hide resolved

reflex/istate/manager/disk.py Outdated Show resolved Hide resolved

masenf added 12 commits October 14, 2025 14:17

FB: never sleep less than zero seconds

53c5cc0

AppHarness: call state_manager.close() for all state managers

bb1d8ec

Do not reschedule write queue after event loop is closed

da66814

Make AppHarness more compatible-er with the new StateManagerDisk

aeb3510

Merge remote-tracking branch 'origin/main' into masenf/disk-state-thr…

1f12113

…ottler

clear StateManagerDisk _write_queue on .close()

806872f

AppHarness.get_state makes sure to drain the backend's _write_queue

51b3f6a

move _flush_write_queue to a separate function

26cb98d

when debounce is disabled, sleep the expiration task until the oldest…

51c8207

… state would expire conserve resources by pausing the _process_write_queue for the amount of time of the oldest known token to expire.

simplify AppHarness song and dance for flushing backend's StateManage…

acc8225

…rDisk

Take _state_manager_lock when closing

03b524d

Avoid interference with _schedule_process_write_queue

Merge remote-tracking branch 'origin/main' into masenf/disk-state-thr…

d1359cb

…ottler

adhami3310 approved these changes Oct 16, 2025

View reviewed changes

adhami3310 merged commit 2cc6884 into main Oct 16, 2025
45 of 47 checks passed

adhami3310 deleted the masenf/disk-state-throttler branch October 16, 2025 00:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENG-7948: StateManagerDisk deferred write queue #5883

ENG-7948: StateManagerDisk deferred write queue #5883

Uh oh!

masenf commented Oct 14, 2025

Uh oh!

linear bot commented Oct 14, 2025

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

codspeed-hq bot commented Oct 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ENG-7948: StateManagerDisk deferred write queue #5883

ENG-7948: StateManagerDisk deferred write queue #5883

Uh oh!

Conversation

masenf commented Oct 14, 2025

Uh oh!

linear bot commented Oct 14, 2025

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Greptile Overview

Summary

Confidence Score: 1/5

Important Files Changed

Sequence Diagram

Additional Comments (1)

Uh oh!

Uh oh!

Uh oh!

codspeed-hq bot commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #5883 will not alter performance

Summary

Footnotes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

greptile-apps bot left a comment •

edited

Loading

codspeed-hq bot commented Oct 14, 2025 •

edited

Loading