Fix/495 ollama thinking by YizukiAme · Pull Request #506 · ax-llm/ax

YizukiAme · 2026-04-06T10:57:05Z

Closes #495

Changes

New: `chatRespProcessor` / `chatStreamRespProcessor` callbacks

Added two optional callbacks to AxAIOpenAIBaseArgs (following the existing chatReqUpdater pattern), allowing providers to post-process responses without subclassing
AxAIOpenAIImpl.

Ollama think parameter support

thinkingTokenBudget: 'none' → sends think: false (disables thinking, reduces latency)
Any other budget value → sends think: true
hasThinkingBudget and hasShowThoughts both set to true

`<think>` tag extraction

When Ollama returns thinking content inline as <think>...</think>, it is now extracted into the thought field and removed from content:

Non-streaming: regex extraction after full response
Streaming: stateful chunk-by-chunk routing via processThinkStreamChunk()

Testing

test:unit: 1854 passed
test:type-check: clean
test:lint: clean

When think: true is passed to Ollama, thinking models (Qwen3 etc.) return thought content inline as <think>...</think> in the response body. This commit: - Adds chatRespProcessor / chatStreamRespProcessor callbacks to AxAIOpenAIBaseArgs so providers can post-process responses without subclassing AxAIOpenAIImpl - Implements extractThinkTags() for non-streaming responses - Implements processThinkStreamChunk() with stateful streaming support - Sets hasShowThoughts: true for Ollama - Adds tests for both tag extraction and the no-tag pass-through case

YizukiAme added 2 commits April 6, 2026 18:27

fix: support ollama think parameter

5340dc9

dosco merged commit b1fcdf3 into ax-llm:main Apr 10, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/495 ollama thinking#506

Fix/495 ollama thinking#506
dosco merged 2 commits intoax-llm:mainfrom
YizukiAme:fix/495-ollama-thinking

YizukiAme commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

YizukiAme commented Apr 6, 2026

Changes

New: chatRespProcessor / chatStreamRespProcessor callbacks

Ollama think parameter support

<think> tag extraction

Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

New: `chatRespProcessor` / `chatStreamRespProcessor` callbacks

`<think>` tag extraction