Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[TRTLLM-10939][feat] Enable block reuse with overlap scheduler
#12816 opened Apr 7, 2026 by chienchunhung Loading…
1 task done
[None][chore] Add failed cases into waives.txt
#12814 opened Apr 7, 2026 by xinhe-nv Loading…
[None][feat] dual-pool KV cache with SWA block eviction for gemma4
#12813 opened Apr 7, 2026 by suyoggupta Loading…
6 tasks done
[None][feat] Upgrade xgrammar and lock pillow
#12812 opened Apr 7, 2026 by yuanjingx87 Loading…
1 task done
[None][infra] Bump xgrammar
#12811 opened Apr 7, 2026 by yuanjingx87 Loading…
1 task done
[None][infra] use public torch index as CI backup (#12261)
#12804 opened Apr 7, 2026 by niukuo Loading…
1 task done
[None][test] Remove RTX-6000 OOM test cases
#12800 opened Apr 7, 2026 by yufeiwu-nv Loading…
1 task done
[None][test] add unit test and e2e test for gpt_oss_20b MHA kernel
#12796 opened Apr 7, 2026 by ruodil Loading…
1 task done
[None][feat] Upgrade xgrammar from 0.1.25 to 0.1.32
#12790 opened Apr 7, 2026 by niukuo Loading…
1 task done
ProTip! Exclude everything labeled bug with -label:bug.