-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][fix] Add bounded timeout to gen-side KV transfer in C++ CacheTransceiver
Community want to contribute
PRs initiated from Community
[https://nvbugs/6025177][fix] Fix KV cache issue (cherry-pick to release/1.3.0rc5.post2)
#12819
opened Apr 7, 2026 by
thorjohnsen
Loading…
3 tasks done
[https://nvbugs/5448464][fix] Partially fix LoRA overallocation for Nemotron NAS
#12817
opened Apr 7, 2026 by
brb-nv
Loading…
1 task done
[TRTLLM-10939][feat] Enable block reuse with overlap scheduler
#12816
opened Apr 7, 2026 by
chienchunhung
Loading…
1 task done
[https://nvbugs/5658258][fix] Fix OOM with large number of LoRA adapters
#12815
opened Apr 7, 2026 by
brb-nv
Loading…
1 task done
[None][feat] dual-pool KV cache with SWA block eviction for gemma4
#12813
opened Apr 7, 2026 by
suyoggupta
Loading…
6 tasks done
[None][feat] Upgrade xgrammar and lock pillow
#12812
opened Apr 7, 2026 by
yuanjingx87
Loading…
1 task done
[None][feat] AutoDeploy: Gemma4 vision support
#12810
opened Apr 7, 2026 by
bmarimuthu-nv
•
Draft
1 task
[TRTLLM-11804][feat] Mechanical refactoring VisualGen API
VisualGen
#12807
opened Apr 7, 2026 by
zhenhuaw-me
Loading…
1 task done
[None][infra] use public torch index as CI backup (#12261)
#12804
opened Apr 7, 2026 by
niukuo
Loading…
1 task done
[https://nvbugs/6018647][test] Add unit test for Lifecycle Race Condition error in disagg sever
#12803
opened Apr 7, 2026 by
yingguo-trt
Loading…
1 task done
[None][perf] Add GreenContext SM-partitioned overlap for MoE DenseGEMM FC1+Router
#12802
opened Apr 7, 2026 by
JacobHu-NV
•
Draft
[None][feat] Add llm.encode() fast path for encoder-only models
Community want to contribute
PRs initiated from Community
[None][test] Remove RTX-6000 OOM test cases
#12800
opened Apr 7, 2026 by
yufeiwu-nv
Loading…
1 task done
[TRTLLM-11797][feat] Add cutedsl moe backend supporting for qwen3.5.
#12799
opened Apr 7, 2026 by
nv-guomingz
Loading…
1 task done
[None][feat]: Add test_moe_semantics.py to help agent understand the …
#12797
opened Apr 7, 2026 by
WeiHaocheng
Loading…
1 task
[None][test] add unit test and e2e test for gpt_oss_20b MHA kernel
#12796
opened Apr 7, 2026 by
ruodil
Loading…
1 task done
[https://nvbugs/5945047][fix] Fix Eagle3 one-model hang on SM120 via extend_ctx
#12795
opened Apr 7, 2026 by
ziyixiong-nv
Loading…
1 task done
[TRTLLM-11228][feat] Support DFlash in one-model spec dec
#12794
opened Apr 7, 2026 by
ziyixiong-nv
•
Draft
1 task
[https://nvbugs/5921674][fix] unwaive TestNemotronNanoV3 fp8 tests
#12792
opened Apr 7, 2026 by
tcherckez-nvidia
Loading…
1 task done
[None][feat] optimize GDN prefill with indexed in-kernel state updates
#12791
opened Apr 7, 2026 by
nv-guomingz
Loading…
1 task done
[None][feat] Upgrade xgrammar from 0.1.25 to 0.1.32
#12790
opened Apr 7, 2026 by
niukuo
Loading…
1 task done
[https://nvbugs/5910749][https://nvbugs/5995486][test] Fix Qwen3 skip softmax attention CI tests
#12789
opened Apr 7, 2026 by
bobboli
Loading…
3 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.