-
Notifications
You must be signed in to change notification settings - Fork 243
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add RainFusion sparse attention for wan2.2.
#1867
opened Jul 1, 2026 by
ethan686
Collaborator
Loading…
feat: add ViT encoder ACL Graph capture for Qwen3-VL.
#1866
opened Jul 1, 2026 by
sunbaosong
Collaborator
Loading…
feat: support multi machine for offline inference.
#1863
opened Jul 1, 2026 by
weizhehuang0827
Collaborator
Loading…
9 of 12 tasks
feat: add prefix-cache-affinity dp rank routing.
#1860
opened Jul 1, 2026 by
shifengmin
Collaborator
Loading…
7 of 15 tasks
feat: add VAE parallel for wan22 and QwenImageEdit.
#1858
opened Jul 1, 2026 by
ethan686
Collaborator
Loading…
feat: update CMake configuration for xllm_ops vendor directory and ma…
#1856
opened Jun 30, 2026 by
msmilezz
Collaborator
Loading…
17 tasks
feat: use grouped gemm in aiter lib.
#1853
opened Jun 30, 2026 by
cding-nv
Contributor
Loading…
8 of 17 tasks
feat: enable SwiGluQuant fusion for Qwen3 MLP
#1847
opened Jun 29, 2026 by
nie-linfeng
Contributor
Loading…
feat: add linear-state block type and sequence slot accessors (3/n).
#1846
opened Jun 29, 2026 by
yingxudeng
Collaborator
Loading…
17 tasks
bugfix: propagate is_hybrid_linear_attention for Qwen3.5 VLM graph capture
#1844
opened Jun 29, 2026 by
maojunx99
Contributor
Loading…
3 of 11 tasks
bugfix: fix num_used_block collections in pd prefill.
#1841
opened Jun 29, 2026 by
phantomlei3
Collaborator
Loading…
8 of 17 tasks
[WIP]feat: support qwen3.5 linear-state prefix cache.
#1839
opened Jun 27, 2026 by
yingxudeng
Collaborator
•
Draft
17 tasks
bugfix: synchronize MTP compute stream after draft/validate forward.
#1838
opened Jun 27, 2026 by
yingxudeng
Collaborator
Loading…
17 tasks
feat: add beam search torch support to mlu.
#1835
opened Jun 27, 2026 by
phantomlei3
Collaborator
•
Draft
17 tasks
feat: support batch embedding requests.
#1828
opened Jun 26, 2026 by
DongheJin
Collaborator
Loading…
17 tasks
bugfix: fix MTP DP concurrent inference failures.
#1817
opened Jun 24, 2026 by
DongheJin
Collaborator
Loading…
17 tasks
feat: enable bf16 fallback for o_proj in Qwen3-VL W8A8 quantization
#1809
opened Jun 23, 2026 by
nie-linfeng
Contributor
Loading…
feat: add scheduler-side linear-state prefix cache and sequence slot (3/n).
#1806
opened Jun 23, 2026 by
yingxudeng
Collaborator
Loading…
17 tasks
bugfix: fix DeepSeek V4 schedule_overlap + mtp.
#1782
opened Jun 18, 2026 by
JC-ut0
Contributor
Loading…
17 tasks
bugfix: resolve cross-NUMA spawn worker isolation issues
#1776
opened Jun 18, 2026 by
asr-sheep1
Collaborator
Loading…
8 of 17 tasks
feat: support eagle3 for vlm models.
#1773
opened Jun 17, 2026 by
shan-chen-feng
Collaborator
Loading…
17 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.