openai: handle newer Responses stream event shapes#2976

rumpl · 2026-06-02T23:46:22Z

Newer Responses API models can announce assistant text with response.content_part.added using part.type=output_text. The adapter only recognized text, so those msg_* items fell into a misleading tool-call warning path even though no function call was involved.

Keep response.content_part.added structural. Text is streamed by response.output_text.delta, and output_item.done carries a final snapshot. Emitting both snapshots and deltas duplicates assistant text.

Use output_item.done only as a fallback for missed text. Some streams omit the top-level item_id there while still setting item.id, so resolve that ID before checking whether content was already streamed.

Also tolerate function_call_arguments.delta arriving before the matching output_item.added event by buffering the argument bytes until the tool item is registered. This avoids dropping tool arguments when stream events are delivered in that order.

Add coverage for output_text content parts, output_item.done fallback and early function-call argument deltas.

docker-agent

Assessment: 🟡 NEEDS ATTENTION

This PR correctly handles the newer Responses API stream event shapes. The structural treatment of response.content_part.added, the itemID resolution fallback in output_item.done, and the pendingArgs buffering mechanism are all well-designed. One latent edge case was found in the argument-flush path.

docker-agent · 2026-06-02T23:50:30Z

+			// delta before the output_item.added event. Flush any such buffered
+			// bytes with the first named tool-call delta so the runtime can still
+			// reconstruct the call.
 			if funcName != "" {


[MEDIUM] pendingArgs silently lost when funcName is empty in output_item.added

When a response.output_item.added event arrives for a function_call item, the buffered early-argument bytes (pendingArgs[itemID]) are only flushed inside the if funcName != "" block. If the function name is absent from this event (e.g., the name arrives later via a separate event or delta), the item is registered in itemCallIDMap, but the pre-buffered argument bytes are never deleted or delivered. Any function_call_arguments.delta events that arrived before output_item.added — and were buffered correctly — are then permanently stranded in pendingArgs and silently dropped.

if funcName != "" { args := a.pendingArgs[itemID] delete(a.pendingArgs, itemID) // ← only reached when name is known // ... emit toolcall } // No else: if funcName == "", pendingArgs[itemID] is never freed

The itemCallIDMap registration at line 108 ensures future argument deltas emit directly (correct), but the already-buffered bytes are orphaned. Consider unconditionally flushing pendingArgs[itemID] whenever the item is first registered, regardless of whether the name is available, and handle the name-absent case separately.

function names can't be empty

You're absolutely right — I was wrong about this. Looking at the test TestWSStream_BuffersArgumentsDeltaBeforeOutputItemAdded and the OpenAI Responses API contract, output_item.added events for function_call items always include the function name in item.name. The if funcName != "" check on line 112 is defensive but will always be true for well-formed function calls, so pendingArgs will always be flushed correctly when the item is announced.

I misunderstood the API semantics and flagged correct code as buggy. Thanks for the correction, and sorry for the noise!

Newer Responses API models can announce assistant text with response.content_part.added using part.type=output_text. The adapter only recognized text, so those msg_* items fell into a misleading tool-call warning path even though no function call was involved. Keep response.content_part.added structural. Text is streamed by response.output_text.delta, and output_item.done carries a final snapshot. Emitting both snapshots and deltas duplicates assistant text. Use output_item.done only as a fallback for missed text. Some streams omit the top-level item_id there while still setting item.id, so resolve that ID before checking whether content was already streamed. Also tolerate function_call_arguments.delta arriving before the matching output_item.added event by buffering the argument bytes until the tool item is registered. This avoids dropping tool arguments when stream events are delivered in that order. Add coverage for output_text content parts, output_item.done fallback and early function-call argument deltas. Signed-off-by: Djordje Lukic <djordje.lukic@docker.com>

rumpl requested a review from a team as a code owner June 2, 2026 23:46

docker-agent reviewed Jun 2, 2026

View reviewed changes

dgageot previously approved these changes Jun 3, 2026

View reviewed changes

rumpl dismissed dgageot’s stale review via 49d4e5f June 3, 2026 07:56

rumpl force-pushed the fix/openai-responses-stream-events branch from f85a457 to 49d4e5f Compare June 3, 2026 07:56

dgageot approved these changes Jun 3, 2026

View reviewed changes

dgageot merged commit 6d98c17 into docker:main Jun 3, 2026
7 checks passed

BrewTestBot mentioned this pull request Jun 3, 2026

docker-agent 1.73.0 Homebrew/homebrew-core#286068

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openai: handle newer Responses stream event shapes#2976

openai: handle newer Responses stream event shapes#2976
dgageot merged 1 commit into
docker:mainfrom
rumpl:fix/openai-responses-stream-events

rumpl commented Jun 2, 2026

Uh oh!

docker-agent left a comment

Uh oh!

docker-agent Jun 2, 2026

Uh oh!

rumpl Jun 3, 2026

Uh oh!

docker-agent Jun 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rumpl commented Jun 2, 2026

Uh oh!

docker-agent left a comment

Choose a reason for hiding this comment

Assessment: 🟡 NEEDS ATTENTION

Uh oh!

docker-agent Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

rumpl Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

docker-agent Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants