Skip to content

Pull requests: jd-opensource/xllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add RainFusion sparse attention for wan2.2.
#1867 opened Jul 1, 2026 by ethan686 Collaborator Loading…
feat: add ViT encoder ACL Graph capture for Qwen3-VL.
#1866 opened Jul 1, 2026 by sunbaosong Collaborator Loading…
docs: remove docs dir.
#1865 opened Jul 1, 2026 by XuZhang99 Collaborator Loading…
7 of 17 tasks
feat: support multi machine for offline inference.
#1863 opened Jul 1, 2026 by weizhehuang0827 Collaborator Loading…
9 of 12 tasks
Ref blockmgr4
#1861 opened Jul 1, 2026 by Kang-Meng Collaborator Draft
17 tasks
feat: add prefix-cache-affinity dp rank routing.
#1860 opened Jul 1, 2026 by shifengmin Collaborator Loading…
7 of 15 tasks
feat: add VAE parallel for wan22 and QwenImageEdit.
#1858 opened Jul 1, 2026 by ethan686 Collaborator Loading…
feat: update CMake configuration for xllm_ops vendor directory and ma…
#1856 opened Jun 30, 2026 by msmilezz Collaborator Loading…
17 tasks
feat: use grouped gemm in aiter lib.
#1853 opened Jun 30, 2026 by cding-nv Contributor Loading…
8 of 17 tasks
feat: enable SwiGluQuant fusion for Qwen3 MLP
#1847 opened Jun 29, 2026 by nie-linfeng Contributor Loading…
feat: add linear-state block type and sequence slot accessors (3/n).
#1846 opened Jun 29, 2026 by yingxudeng Collaborator Loading…
17 tasks
feat: add maca support for xllm.
#1845 opened Jun 29, 2026 by xicui0927 Loading…
3 of 17 tasks
bugfix: propagate is_hybrid_linear_attention for Qwen3.5 VLM graph capture
#1844 opened Jun 29, 2026 by maojunx99 Contributor Loading…
3 of 11 tasks
bugfix: fix num_used_block collections in pd prefill.
#1841 opened Jun 29, 2026 by phantomlei3 Collaborator Loading…
8 of 17 tasks
[WIP]feat: support qwen3.5 linear-state prefix cache.
#1839 opened Jun 27, 2026 by yingxudeng Collaborator Draft
17 tasks
bugfix: synchronize MTP compute stream after draft/validate forward.
#1838 opened Jun 27, 2026 by yingxudeng Collaborator Loading…
17 tasks
feat: add beam search torch support to mlu.
#1835 opened Jun 27, 2026 by phantomlei3 Collaborator Draft
17 tasks
feat: support batch embedding requests.
#1828 opened Jun 26, 2026 by DongheJin Collaborator Loading…
17 tasks
feat: support GLM DSA sharing top-k and MTP export.
#1823 opened Jun 25, 2026 by sanlio36 Collaborator Draft
17 tasks
bugfix: fix MTP DP concurrent inference failures.
#1817 opened Jun 24, 2026 by DongheJin Collaborator Loading…
17 tasks
feat: enable bf16 fallback for o_proj in Qwen3-VL W8A8 quantization
#1809 opened Jun 23, 2026 by nie-linfeng Contributor Loading…
feat: add scheduler-side linear-state prefix cache and sequence slot (3/n).
#1806 opened Jun 23, 2026 by yingxudeng Collaborator Loading…
17 tasks
bugfix: fix DeepSeek V4 schedule_overlap + mtp.
#1782 opened Jun 18, 2026 by JC-ut0 Contributor Loading…
17 tasks
bugfix: resolve cross-NUMA spawn worker isolation issues
#1776 opened Jun 18, 2026 by asr-sheep1 Collaborator Loading…
8 of 17 tasks
feat: support eagle3 for vlm models.
#1773 opened Jun 17, 2026 by shan-chen-feng Collaborator Loading…
17 tasks
ProTip! Add no:assignee to see everything that’s not assigned.