fix: Forward HTTP headers for generate requests by mudit-eng · Pull Request #8795 · triton-inference-server/server

mudit-eng · 2026-05-20T23:00:05Z

What does the PR do?

Forwards matching HTTP headers as inference request parameters in the /generate and /generate_stream HTTP paths by calling the same ForwardHeaders helper used by /infer.

Adds a generate endpoint regression using an ensemble that routes to the Python backend and verifies a forwarded HTTP header is visible through request.parameters().

Checklist

Commit Type:

Related PRs:

None.

Where should the reviewer start?

src/http_server.cc, then qa/L0_http/generate_endpoint_test.py.

Test plan:

git diff --check HEAD~1..HEAD
python3 -m py_compile qa/L0_http/generate_endpoint_test.py qa/python_models/generate_models/mock_llm/1/model.py
qa/L0_http/test.sh generate endpoint section (not run locally: /opt/tritonserver/bin/tritonserver is not present in this environment)
CI Pipeline ID:
N/A

Caveats:

Local end-to-end QA could not be run because this environment does not include the Triton server runtime binary.

Background

Linear issue TGH-116 reported that --http-header-forward-pattern worked for /infer but not /generate when routing from an ensemble to the Python backend.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Resolves Linear TGH-116

Linear Issue: TGH-116

Co-authored-by: Mudit Aggarwal <mudita@nvidia.com>

fix: forward HTTP headers for generate requests

db8921f

Co-authored-by: Mudit Aggarwal <mudita@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Forward HTTP headers for generate requests#8795

fix: Forward HTTP headers for generate requests#8795
mudit-eng wants to merge 1 commit into
mainfrom
cursor/fix-generate-header-forwarding-d7c2

mudit-eng commented May 20, 2026 •

edited by cursor Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

mudit-eng commented May 20, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

mudit-eng commented May 20, 2026 •

edited by cursor Bot

Loading