.NET: Issue #5350: maximal ChatClientAgent repro + Track A gap tests (A2 / A2b / A3 / A4) by Copilot · Pull Request #5952 · microsoft/agent-framework

Copilot · 2026-05-19T14:26:24Z

Motivation and Context

Issue #5350 reports that ToolApprovalRequestContent.ToolCall loses its concrete FunctionCallContent type after a JsonCheckpointStore round-trip, breaking FunctionInvokingChatClient's pattern match on resume. The OP attributes this to missing [JsonPolymorphic]/[JsonDerivedType] metadata or a serializer-options chain that does not reach AIJsonUtilities.DefaultOptions; a maintainer comment notes both are already in place. We need a reproducible baseline before attempting a fix.

Description

Adds ToolApprovalRequestCheckpointReproTests (11 tests, all passing) layered from the pure serializer up to full agent workflows, plus a written validation plan. The OP's hypothesis does not reproduce at any layer exercised; one test surfaces a separate, real bug worth fixing independently.

Layered marshaller tests (1 – 5): direct JsonMarshaller round-trip of ToolApprovalRequestContent, then wrapped in PortableValue, then as ExternalRequest.Data, then 25× deterministic repetition, plus a wire-format capture confirming "$type": "functionCall" is emitted.
RequestPort checkpoint resume (6): full CheckpointManager.CreateJson(InMemoryJsonStore) → RunStreamingAsync → SuperStep checkpoint → ResumeStreamingAsync with a RequestPort<TARC, TARR>.
Maximal E2E (7): real ChatClientAgent over a MockChatClient with ApprovalRequiredAIFunction wrapping a counting AIFunction, bound directly into a single-agent WorkflowBuilder. Asserts resumed RequestInfoEvent.Request.Data.As<TARC>().ToolCall is FunctionCallContent, then sends the approval response and asserts the wrapped function is invoked exactly once and the resulting FunctionResultContent flows into the follow-up chat-client call.
Track A gap tests (8 – 11), sharing RunReproAsync + ReproHarness:
- A2 — GroupChatBuilder: FCC type still preserved after resume. Uncovers a separate bug: approving the resumed request throws ArgumentException: An item with the same key has already been added. Key: ficc_call-1 from FunctionInvokingChatClient.ExtractAndRemoveApprovalRequestsAndResponses. Test pins this observed behavior with a TODO(#5350-followup)-tagged assertion.
- A2b — HandoffWorkflowBuilder: same as A2 but uses AgentWorkflowBuilder.CreateHandoffBuilderWith(...) with a no-op peer agent so the handoff graph is valid (the mock chat client never emits a handoff_to_* call, so the workflow stays on the initial approval-tool-equipped agent). FCC type preserved after resume and the workflow completes cleanly post-approval: tool invoked exactly once, zero errors, zero executor failures. This narrows A2's duplicate-key crash to the group-chat orchestration specifically (it does not occur on the handoff path).
- A3 — StringRoundTripJsonStore: wraps InMemoryJsonStore and round-trips every JsonElement via GetRawText() + JsonDocument.Parse on both commit and retrieve, emulating the OP's Dapper / SQL nvarchar hop. Identity-preserving; passes.
- A4 — custom JsonSerializerOptions: JsonSerializerDefaults.Web with no AI resolver passed as customOptions to CheckpointManager.CreateJson. Confirms JsonMarshaller's internal WorkflowsJsonUtilities.DefaultOptions chain always wins for known AIContent types; external options cannot displace it.
docs/working/issue-5350-root-cause-validation-plan.md: records the disproof and scopes remaining investigation tracks — request the OP's actual pre-/post-SQL JSON bytes (Track B), doc improvements for the option-chain / byte-preservation contract (Track C), and follow up on the A2 duplicate-key crash (now known to be group-chat-specific).

No production code is modified; this is investigation scaffolding plus a pinned regression test for the bug surfaced by A2.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the Contribution Guidelines
All unit tests pass, and I have added new tests where possible
Is this a breaking change? If yes, add "[BREAKING]" prefix to the title of the PR.

Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/79cbae85-b46f-470f-978d-7970d6f9e091 Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

…#5350 Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/baf2578f-bdc5-479b-8ebc-1b198f26c151 Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/153b10f3-2555-45fe-b264-42a2e9100cf2 Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/e7100d8a-ca6d-48e6-8210-78ba33dbd53c Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

lokitoth · 2026-05-19T14:27:33Z

@copilot Add a variant of A2 which uses Handoffs

Copilot · 2026-05-19T14:29:24Z

@copilot Add a variant of A2 which uses Handoffs

Added in 1011a41 as Repro_5350_A2b_HandoffWorkflowBuilder_.... Uses AgentWorkflowBuilder.CreateHandoffBuilderWith(harness.Agent) with a no-op peer agent so the handoff graph is valid; the mock chat client never emits a handoff_to_* call so the workflow stays on the initial approval-tool-equipped agent. Finding: the handoff path completes cleanly — TARC.ToolCall is FunctionCallContent post-resume, tool invoked exactly once, zero errors/executor failures. The duplicate-key ArgumentException from A2 does not occur here, narrowing that separate bug to the group-chat orchestration specifically.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds investigation scaffolding for issue #5350 by introducing layered repro tests that validate ToolApprovalRequestContent.ToolCall (polymorphic FunctionCallContent) survives JSON checkpoint round-trips, and documents a validation plan. Also pins a separate group-chat-specific duplicate-key failure observed after resume when approving.

Changes:

Add ToolApprovalRequestCheckpointReproTests with serializer-only and end-to-end checkpoint/resume coverage, including GroupChat/Handoff variants and a store that mimics string-based persistence.
Add a working validation-plan doc capturing results, next investigative tracks, and the unrelated A2 duplicate-key crash.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
dotnet/tests/Microsoft.Agents.AI.Workflows.UnitTests/ToolApprovalRequestCheckpointReproTests.cs	New layered repro + E2E tests for #5350 plus Track A variants and store shim
docs/working/issue-5350-root-cause-validation-plan.md	New investigation write-up documenting findings + next steps

+    [Fact]
+    public void Repro_5350_CaptureWireFormat_ForInspection()
+    {
+        JsonMarshaller marshaller = new();
+
+        JsonElement element = marshaller.Marshal(MakeApprovalRequest());
+        string serialized = element.GetRawText();
+
+        // Always-true assertion — purpose of this test is to expose the wire format.
+        serialized.Should().NotBeNullOrEmpty();
+        serialized.Should().Contain(CallId, "the call id should be present in the serialized form");
+    }


Copilot AI and others added 4 commits May 19, 2026 13:31

Add issue #5350 repro tests + root-cause validation plan

e28682a

Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/79cbae85-b46f-470f-978d-7970d6f9e091 Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

Add maximal E2E ChatClientAgent + ApprovalRequiredAIFunction repro for …

8d503c0

…#5350 Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/baf2578f-bdc5-479b-8ebc-1b198f26c151 Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

Add A2/A3/A4 gap tests; pin GroupChat duplicate-key bug discovered by A2

e53e87b

Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/153b10f3-2555-45fe-b264-42a2e9100cf2 Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

Add A2b: HandoffWorkflowBuilder variant of A2 — completes cleanly

1011a41

Agent-Logs-Url: https://github.com/microsoft/agent-framework/sessions/e7100d8a-ca6d-48e6-8210-78ba33dbd53c Co-authored-by: lokitoth <6936551+lokitoth@users.noreply.github.com>

Copilot AI review requested due to automatic review settings May 19, 2026 14:26

Copilot AI assigned Copilot and lokitoth May 19, 2026

Copilot AI review requested due to automatic review settings May 19, 2026 14:26

Copilot created this pull request from a session on behalf of lokitoth May 19, 2026 14:26 View session

Copilot started work on behalf of lokitoth May 19, 2026 14:29 View session

Copilot AI changed the title ~~Issue #5350: maximal ChatClientAgent repro + Track A gap tests~~ Issue #5350: maximal ChatClientAgent repro + Track A gap tests (A2 / A2b / A3 / A4) May 19, 2026

Copilot AI requested a review from lokitoth May 19, 2026 14:30

Merge branch 'main' into copilot/5350-implement-repro-unit-test

035b5b8

Copilot AI review requested due to automatic review settings May 19, 2026 14:34

lokitoth had a problem deploying to integration May 19, 2026 14:40 — with GitHub Actions Failure

moonbox3 added documentation Improvements or additions to documentation .NET labels May 19, 2026

Copilot AI reviewed May 19, 2026

View reviewed changes

Copilot finished work on behalf of lokitoth May 19, 2026 14:50

Copilot started reviewing on behalf of lokitoth May 19, 2026 14:52 View session

Copilot started work on behalf of lokitoth May 19, 2026 14:54 View session

github-actions Bot changed the title ~~Issue #5350: maximal ChatClientAgent repro + Track A gap tests (A2 / A2b / A3 / A4)~~ .NET: Issue #5350: maximal ChatClientAgent repro + Track A gap tests (A2 / A2b / A3 / A4) May 19, 2026

Copilot finished work on behalf of lokitoth May 19, 2026 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.NET: Issue #5350: maximal ChatClientAgent repro + Track A gap tests (A2 / A2b / A3 / A4)#5952

.NET: Issue #5350: maximal ChatClientAgent repro + Track A gap tests (A2 / A2b / A3 / A4)#5952
Copilot wants to merge 5 commits into
mainfrom
copilot/5350-implement-repro-unit-test

Copilot AI commented May 19, 2026 •

edited

Loading

Uh oh!

lokitoth commented May 19, 2026

Uh oh!

Copilot AI commented May 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Copilot AI commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

Contribution Checklist

Uh oh!

lokitoth commented May 19, 2026

Uh oh!

Copilot AI commented May 19, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Copilot AI commented May 19, 2026 •

edited

Loading