Optimize prompt caching by splitting system messages #1388

simonferquel · 2026-01-16T15:34:56Z

Summary

Refactors session message handling to improve LLM prompt caching by categorizing system messages based on caching characteristics.

Changes

Split system messages into three categories: invariant (cacheable globally), context-specific (cacheable per user/project), and session summaries
Added strategic cache control markers at category boundaries
New tests for cache control behavior

Benefits

Better cache hit rates → lower API costs and faster responses
Fully backward compatible

Technical

Replaced monolithic GetMessages() with three focused functions for different message categories, enabling optimal cache utilization at multiple granularities.

…or cache control with summaries

simonferquel · 2026-01-16T15:36:40Z

/review

Refactor message handling to improve caching behavior and add tests f…

421d1b0

…or cache control with summaries

simonferquel requested a review from a team as a code owner January 16, 2026 15:34

rumpl approved these changes Jan 16, 2026

View reviewed changes

dgageot approved these changes Jan 16, 2026

View reviewed changes

dgageot merged commit ef02c5c into docker:main Jan 16, 2026
5 checks passed

BrewTestBot mentioned this pull request Jan 20, 2026

cagent 1.19.3 Homebrew/homebrew-core#263612

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize prompt caching by splitting system messages #1388

Optimize prompt caching by splitting system messages #1388

Uh oh!

simonferquel commented Jan 16, 2026

Uh oh!

simonferquel commented Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize prompt caching by splitting system messages #1388

Optimize prompt caching by splitting system messages #1388

Uh oh!

Conversation

simonferquel commented Jan 16, 2026

Summary

Changes

Benefits

Technical

Uh oh!

simonferquel commented Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants