-
Notifications
You must be signed in to change notification settings - Fork 53
Pull requests: XPU-Forces/mojo_opset
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[ilu/ttx] opt swa_paged_prefill: remove boundary_check in nomask path…
#373
opened Jun 22, 2026 by
sky-fun
Collaborator
Loading…
[KMCompiler][ttx] Optimize NPU ResidualAddRMSNorm forward performance
#367
opened Jun 15, 2026 by
YangLong114514
Loading…
[KMCompiler][ttx] Optimize Layernorm performance
#366
opened Jun 15, 2026 by
YangLong114514
Loading…
[KMCompiler][ttx] Optimize silu with rowwise nomask kernels
#365
opened Jun 15, 2026 by
YangLong114514
Loading…
[KMCompiler][ttx] Optimize gelu with rowwise nomask kernels
#364
opened Jun 15, 2026 by
YangLong114514
Loading…
[KMCompiler][ttx]Optimize rms_norm for small cols
#363
opened Jun 15, 2026 by
YangLong114514
Loading…
[ilu/ttx] optimize int8-KV paged prefill: dequant + reuse bf16 FA2
#361
opened Jun 12, 2026 by
AbeFei
Collaborator
Loading…
MojoFusedNormRoPESageQuantStore: fused RoPE + KV-Quant with Key per-token Quant operator
#358
opened Jun 12, 2026 by
NASA1473
Collaborator
Loading…
[ilu/ttx] opt swa: decode add fast path when windows cover the sequence;quant prefill use dequant + bf16 swa paged prefill
#354
opened Jun 11, 2026 by
sky-fun
Collaborator
Loading…
add deepep and test, modify test_moe_quant and test_attention_quant
#349
opened Jun 5, 2026 by
qiushi13
Collaborator
Loading…
[uc] add activation, norm, pos emb, quant, sdpa operators for uc backend.
#342
opened Jun 3, 2026 by
shengw-bd
Loading…
[ttx/npu] Optimize top_k_sampling by replacing argsort with iterative extraction
#318
opened May 21, 2026 by
lyujheng
Loading…
[ttx/npu] Optimize lightning_indexer_kernel by deferring k_scale to post-dot
#317
opened May 21, 2026 by
lyujheng
Loading…
feat: add MojoPaddedWindowAttention and MojoConv1d.
#274
opened May 6, 2026 by
wwens7
Collaborator
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.