Fix: Claude Code VS Code extension fails to parse responses when every chunk includes usage data by zhaohuiweixiao · Pull Request #3670 · higress-group/higress

zhaohuiweixiao · 2026-04-01T03:29:35Z

Ⅰ. Describe what this PR did

Fix bug: Claude Code VS Code extension fails to parse responses when every chunk includes usage data.

In the ai-proxy plugin, the logic for converting OpenAI protocol to Claude protocol treats any chunk containing usage as the end of the stream.Therefore, when the model returns every chunk with usage, it prematurely emits a message_stop event, causing parsing failures in the Claude Code VS Code extension.

model output example:
`data: {"id":"019d3d99245a5dd32971f17e72e2e4e3","object":"chat.completion.chunk","created":1774854940,"model":"Minimax-M2.5","choices":[{"index":0,"delta":{"role":"assistant","content":""}}],"system_fingerprint":"","usage":{"prompt_tokens":40,"completion_tokens":0,"total_tokens":40,"prompt_tokens_details":{},"completion_tokens_details":{}}}

data: {"id":"019d3d99245a5dd32971f17e72e2e4e3","object":"chat.completion.chunk","created":1774854940,"model":"Minimax-M2.5","choices":[{"index":0,"delta":{"role":"assistant","content":"","reasoning_content":"用户"}}],"system_fingerprint":"","usage":{"prompt_tokens":40,"completion_tokens":1,"total_tokens":41,"prompt_tokens_details":{},"completion_tokens_details":{"reasoning_tokens":1}}}

data: {"id":"019d3d99245a5dd32971f17e72e2e4e3","object":"chat.completion.chunk","created":1774854940,"model":"Minimax-M2.5","choices":[{"index":0,"delta":{"role":"assistant","content":"","reasoning_content":"用"}}],"system_fingerprint":"","usage":{"prompt_tokens":40,"completion_tokens":2,"total_tokens":42,"prompt_tokens_details":{},"completion_tokens_details":{"reasoning_tokens":2}}}`

ai-proxy output:

Ⅱ. Does this pull request fix one issue?

yes

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Use whether the chunk contains a finish_reason as the end-of-stream indicator.

Ⅴ. Special notes for reviews

Ⅵ. AI Coding Tool Usage Checklist (if applicable)

Please check all applicable items:

For new standalone features (e.g., new wasm plugin or golang-filter plugin):
- I have created a design/ directory in the plugin folder
- I have added the design document to the design/ directory
- I have included the AI Coding summary below
For regular updates/changes (not new plugins):
- I have provided the prompts/instructions I gave to the AI Coding tool below
- I have included the AI Coding summary below

AI Coding Prompts (for regular updates)

AI Coding Summary

zhaohuiweixiao · 2026-04-01T03:32:50Z

@CH3CHO

CH3CHO · 2026-04-01T06:08:06Z

这个改法不是很合适。部分大模型服务在处理包含 "stream_options": { "include_usage": true } 的流式请求时，会在原本包含 "finish_reason":"stop" 的 chunk 后补充发送 usage chunk。现在的改法可能会有逻辑问题。

流式响应示例（https://api.moonshot.cn/v1/chat/completions + moonshot-v1-8k）：

data: {"id":"chatcmpl-69ccb4efd68e87fdb659f8c9","object":"chat.completion.chunk","created":1775023343,"model":"moonshot-v1-8k","choices":[{"index":0,"delta":{"content":"。"},"finish_reason":null}],"system_fingerprint":"fpv0_3f4f71b6"}

data: {"id":"chatcmpl-69ccb4efd68e87fdb659f8c9","object":"chat.completion.chunk","created":1775023343,"model":"moonshot-v1-8k","choices":[{"index":0,"delta":{},"finish_reason":"stop","usage":{"prompt_tokens":12,"completion_tokens":7,"total_tokens":19}}],"system_fingerprint":"fpv0_3f4f71b6"}

data: {"id":"chatcmpl-69ccb4efd68e87fdb659f8c9","object":"chat.completion.chunk","created":1775023343,"model":"moonshot-v1-8k","choices":[],"usage":{"prompt_tokens":12,"completion_tokens":7,"total_tokens":19}}

data: [DONE]

This change is not very appropriate. When some large model services process streaming requests containing "stream_options": { "include_usage": true }, they will send a usage chunk after the chunk that originally contained "finish_reason":"stop". There may be logical problems with the current reform.

Streaming response example (https://api.moonshot.cn/v1/chat/completions + moonshot-v1-8k):

data: {"id":"chatcmpl-69ccb4efd68e87fdb659f8c9","object":"chat.completion.chunk","created":1775023343,"model":"moonshot -v1-8k","choices":[{"index":0,"delta":{"content":""},"finish_reason":null}],"system_fingerprint":"fpv0_3f4f71b6"}

data: {"id":"chatcmpl-69ccb4efd68e87fdb659f8c9","object":"chat.completion.chunk","created":1775023343,"model":"moonshot-v1-8k","choices":[{"index":0 ,"delta":{},"finish_reason":"stop","usage":{"prompt_tokens":12,"completion_tokens":7,"total_tokens":19}}],"system_fingerprint":"fpv0_3f4f71b6"}

data: {"id":"chatcmpl-69ccb4efd68e87fdb659f8c9","object":"chat.completion.chunk","created":1775023343,"mode l":"moonshot-v1-8k","choices":[],"usage":{"prompt_tokens":12,"completion_tokens":7,"total_tokens":19}}

data: [DONE]

…y chunk includes usage data Signed-off-by: zhaohuihui <zhaohuihui_yewu@cmss.chinamobile.com>

zhaohuiweixiao · 2026-04-01T07:12:28Z

已修改为DONE时发送message_stop event

CH3CHO · 2026-04-02T10:06:24Z

已修改为DONE时发送message_stop event

会不会有的场景下，服务端没有返回 [DONE]？需要进一步分析代码并测试验证一下。

https://chat.deepseek.com/share/5zr9crsqyr04fyjj4q

zhaohuiweixiao · 2026-04-09T07:22:03Z

已修改为DONE时发送message_stop event

会不会有的场景下，服务端没有返回 [DONE]？需要进一步分析代码并测试验证一下。

https://chat.deepseek.com/share/5zr9crsqyr04fyjj4q

是要测试一下这个文档中说的这些主流实现是否都返回DONE吗？
vLLM、LocalAI、Llama.cpp：遵循 OpenAI 规范，在流结束时返回 [DONE]。

HuggingFace TGI：发送 [DONE]\n（带换行符），可能导致解析问题，但已修复。

FastChat：遵循 OpenAI 规范，支持流式输出。

Ollama：不发送 [DONE]，而是在流式响应的最后一个 JSON 块中将 "done": true 作为标志。

CH3CHO

LGTM

codecov-commenter · 2026-04-09T14:00:02Z

Codecov Report

❌ Patch coverage is 0% with 5 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...o/extensions/ai-proxy/provider/claude_to_openai.go	0.00%	5 Missing ⚠️

📢 Thoughts on this report? Let us know!

zhaohuiweixiao requested review from johnlanni, rinfx and wydream as code owners April 1, 2026 03:29

CH3CHO reviewed Apr 1, 2026

View reviewed changes

Comment thread plugins/wasm-go/extensions/ai-proxy/provider/claude_to_openai.go Outdated

Fix: Claude Code VS Code extension fails to parse responses when ever…

6f1c73c

…y chunk includes usage data Signed-off-by: zhaohuihui <zhaohuihui_yewu@cmss.chinamobile.com>

zhaohuiweixiao force-pushed the fix_chunkUsage branch from 64dfd17 to 6f1c73c Compare April 1, 2026 07:10

Merge branch 'main' into fix_chunkUsage

1f621a9

CH3CHO approved these changes Apr 9, 2026

View reviewed changes

Merge branch 'main' into fix_chunkUsage

9bc8eab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Claude Code VS Code extension fails to parse responses when every chunk includes usage data#3670

Fix: Claude Code VS Code extension fails to parse responses when every chunk includes usage data#3670
zhaohuiweixiao wants to merge 3 commits intohigress-group:mainfrom
zhaohuiweixiao:fix_chunkUsage

zhaohuiweixiao commented Apr 1, 2026 •

edited

Loading

Uh oh!

zhaohuiweixiao commented Apr 1, 2026 •

edited by github-actions Bot

Loading

Uh oh!

Uh oh!

CH3CHO commented Apr 1, 2026 •

edited by github-actions Bot

Loading

Uh oh!

zhaohuiweixiao commented Apr 1, 2026

Uh oh!

CH3CHO commented Apr 2, 2026

Uh oh!

zhaohuiweixiao commented Apr 9, 2026

Uh oh!

CH3CHO left a comment

Uh oh!

codecov-commenter commented Apr 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zhaohuiweixiao commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

Ⅵ. AI Coding Tool Usage Checklist (if applicable)

AI Coding Prompts (for regular updates)

AI Coding Summary

Uh oh!

zhaohuiweixiao commented Apr 1, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CH3CHO commented Apr 1, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhaohuiweixiao commented Apr 1, 2026

Uh oh!

CH3CHO commented Apr 2, 2026

Uh oh!

zhaohuiweixiao commented Apr 9, 2026

Uh oh!

CH3CHO left a comment

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhaohuiweixiao commented Apr 1, 2026 •

edited

Loading

zhaohuiweixiao commented Apr 1, 2026 •

edited by github-actions Bot

Loading

CH3CHO commented Apr 1, 2026 •

edited by github-actions Bot

Loading

codecov-commenter commented Apr 9, 2026 •

edited

Loading