Skip to content

Merge upstream→gfx11#1014

Open
eble-amd wants to merge 183 commits into
ROCm:gfx11from
eble-amd:merge-from-upstream
Open

Merge upstream→gfx11#1014
eble-amd wants to merge 183 commits into
ROCm:gfx11from
eble-amd:merge-from-upstream

Conversation

@eble-amd

@eble-amd eble-amd commented Jun 22, 2026

Copy link
Copy Markdown

Merge from upstream. Most conflicts were automatically resolved by a three-way merge tool. There were just a few instances of both sides adding code in the same place; I kept it all.

dependabot Bot and others added 30 commits June 2, 2026 11:26
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…mask (vllm-project#44082)

Signed-off-by: Yifan Qiao <yifanqiao@inferact.ai>
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
Co-authored-by: Claude <noreply@anthropic.com>
Signed-off-by: Nick Hill <nickhill123@gmail.com>
…llm-project#42027)

Signed-off-by: lesj0610 <lesj0610@users.noreply.github.com>
Co-authored-by: lesj0610 <lesj0610@users.noreply.github.com>
Signed-off-by: Siddharth Bedekar <bedeksid@gmail.com>
Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>
… supports check (vllm-project#43332)

Signed-off-by: Junhao Shen <junshen@nvidia.com>
Co-authored-by: Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
…t references it (vllm-project#44128)

Signed-off-by: Daoyuan Li <94409450+DaoyuanLi2816@users.noreply.github.com>
Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
…on Top-p path. (vllm-project#42191)

Signed-off-by: js_park <cakeng@naver.com>
Co-authored-by: Nick Hill <nickhill123@gmail.com>
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
Signed-off-by: Rotem Shavitt <rshavitt@gmail.com>
…oject#44236)

Signed-off-by: Willow Lopez <100782273+Oxygen56@users.noreply.github.com>
)

Signed-off-by: William-Rom <william.rom@intility.no>
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
…t-template-kwargs` for client-rendered datasets (vllm-project#44244)

Signed-off-by: Albert Cheng <albertching0112@gmail.com>
…-bake script (vllm-project#36949)

Signed-off-by: Andreas Karatzas <akaratza@amd.com>
vllm-project#44251)

Signed-off-by: Majid Taheri Andani <tahemaji@amazon.com>
Co-authored-by: Majid Taheri Andani <tahemaji@amazon.com>
Co-authored-by: tomeras91 <57313761+tomeras91@users.noreply.github.com>
…t#44287)

Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com>
Co-authored-by: varun sundar rabindranath <vsundarr@redhat.com>
Signed-off-by: NolanHo <kujyo.eia.serias@gmail.com>
Co-authored-by: OpenAI Codex <codex@openai.com>
Co-authored-by: Bugen Zhao <i@bugenzhao.com>
Signed-off-by: xunzhuo <xunzhuo@vllm-semantic-router.ai>
Co-authored-by: Bugen Zhao <i@bugenzhao.com>
yma11 and others added 13 commits June 8, 2026 13:58
Signed-off-by: Andreas Karatzas <akaratza@amd.com>
… rare OOMs (vllm-project#44761)

Signed-off-by: Andreas Karatzas <akaratza@amd.com>
Signed-off-by: zengxian <xiangdong.zeng@intel.com>
…llm-project#44828)

Signed-off-by: Sungjae Lee <33976427+llsj14@users.noreply.github.com>
Signed-off-by: Sungjae Lee <sung-jae.lee@navercorp.com>
…ct#44499)

Signed-off-by: Sahil Singh <sahiilsiingh37@gmail.com>
Co-authored-by: Bugen Zhao <i@bugenzhao.com>
…rence failures (vllm-project#44470)

Signed-off-by: Chaojun Zhang <chaojun.zhang@intel.com>
…eloaded (vllm-project#44419)

Signed-off-by: jmamou <jonathan.mamou@intel.com>
Signed-off-by: Jonathan Mamou <jonathan.mamou@intel.com>
Co-authored-by: Li, Jiang <bigpyj64@gmail.com>
Co-authored-by: Li, Jiang <jiang1.li@intel.com>
Signed-off-by: NickLucche <nlucches@redhat.com>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: walterbm <walter.beller.morales@gmail.com>
Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
@eble-amd eble-amd force-pushed the merge-from-upstream branch from fc7be7f to 8be8c01 Compare June 22, 2026 17:13
@eble-amd eble-amd requested a review from mgehre-amd June 22, 2026 18:50
@eble-amd eble-amd marked this pull request as ready for review June 22, 2026 18:50
@mgehre-amd

Copy link
Copy Markdown

Looks good! Did you do some smoke test by running some model before/after the change?

@eble-amd

Copy link
Copy Markdown
Author

Looks good! Did you do some smoke test by running some model before/after the change?

No, I thought our github workflows were testing.

@eble-amd

Copy link
Copy Markdown
Author

Hm... I didn't notice till now that there are conflicts preventing merging. I'll convert this back to a draft to investigate.

@eble-amd eble-amd marked this pull request as draft June 22, 2026 19:54
Merge from upstream. Most conflicts were automatically resolved by a
three-way merge tool. There were just a few instances of both sides
adding code in the same place; I kept it all.

Signed-off-by: Dan Eble <Dan.Eble@amd.com>
@eble-amd eble-amd force-pushed the merge-from-upstream branch from 8be8c01 to a3e798b Compare June 22, 2026 20:24
@eble-amd

Copy link
Copy Markdown
Author

Benchmark of model Qwen3-4B_128:

this PR on my assigned test system:

Prefill 2566.35 tokens/s; TTFT 50 ms
Decode 73.6 tokens/s (TPOT 13.59 ms)
End-to-end latency 1776 ms (median)
Average CPU utilization 6%

nightly regression suite:

Prefill 2410.4 tokens/s; TTFT 53.1 ms
Decode 73.4 tokens/s (TPOT 13.6 ms)
End-to-end latency 1,783 ms (median)
Average CPU utilization 6%

@eble-amd eble-amd marked this pull request as ready for review June 23, 2026 18:23
@mgehre-amd

Copy link
Copy Markdown

I merged @marcusr-amd first and this seems to have caused conflicts here. Can you please resolve them?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.