-
Notifications
You must be signed in to change notification settings - Fork 153
Pull requests: pytorch/helion
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Lean kernel-artifact telemetry for cost-model dataset
CLA Signed
This label is managed by the Meta Open Source bot.
#2770
opened Jun 11, 2026 by
IshanAryendu
Contributor
Loading…
[pallas] size emit_pipeline scratch from reshape-merged block-size products
CLA Signed
This label is managed by the Meta Open Source bot.
#2769
opened Jun 11, 2026 by
choijon5
Contributor
Loading…
[cute] Persist autotune winner from memory instead of recompiling
CLA Signed
This label is managed by the Meta Open Source bot.
#2768
opened Jun 11, 2026 by
fulvius31
Collaborator
Loading…
[examples] Add a simpler concat implementation
CLA Signed
This label is managed by the Meta Open Source bot.
#2766
opened Jun 11, 2026 by
hinriksnaer
Collaborator
Loading…
[autotuner] Triton reduction seed heuristic (generalizable core)
CLA Signed
This label is managed by the Meta Open Source bot.
#2762
opened Jun 11, 2026 by
calebmkim
Contributor
Loading…
[autotuner] Reduction fact layer: ReductionFact + AccumulatorFact + enriched MemoryOpFact
CLA Signed
This label is managed by the Meta Open Source bot.
#2761
opened Jun 11, 2026 by
calebmkim
Contributor
Loading…
[examples] Edits to existing reduction example kernels for the seed-heuristic curriculum
CLA Signed
This label is managed by the Meta Open Source bot.
#2760
opened Jun 11, 2026 by
calebmkim
Contributor
Loading…
Add a SymInt-free tensor specialization key for exact torch.Tensor args
CLA Signed
This label is managed by the Meta Open Source bot.
#2759
opened Jun 11, 2026 by
yushangdi
Contributor
Loading…
[cute] Record measured-good B200 config for the fp8 scaled_mm example
CLA Signed
This label is managed by the Meta Open Source bot.
[cute] Unified rolled TMA producer + hoisted K-loop predicates for tcgen05
CLA Signed
This label is managed by the Meta Open Source bot.
[cute] Unified rolled TMA producer + hoisted K-loop predicates for tcgen05
CLA Signed
This label is managed by the Meta Open Source bot.
Skip the measure("Kernel.bind") context manager when measurement is off
CLA Signed
This label is managed by the Meta Open Source bot.
Move measure("Kernel.bind") off the cache-hit dispatch path
CLA Signed
This label is managed by the Meta Open Source bot.
Collect kernel artifacts: device-IR node-link dump (.ir.jsonl)
CLA Signed
This label is managed by the Meta Open Source bot.
#2750
opened Jun 11, 2026 by
IshanAryendu
Contributor
•
Draft
Install a per-spec fast launcher that bypasses Triton's JITFunction.run
CLA Signed
This label is managed by the Meta Open Source bot.
#2749
opened Jun 10, 2026 by
yushangdi
Contributor
Loading…
[Pallas] Add pallas_loop_type = 'outer_pipeline'
CLA Signed
This label is managed by the Meta Open Source bot.
[Pallas] Fix attention example VMEM regression by making LSE 3D
CLA Signed
This label is managed by the Meta Open Source bot.
#2743
opened Jun 10, 2026 by
norx1991
Contributor
Loading…
[cute] Fused-scale epilogue: scalar colvec read for per-row scale
CLA Signed
This label is managed by the Meta Open Source bot.
#2742
opened Jun 10, 2026 by
yushangdi
Contributor
Loading…
[cute] Deep AB staging for fp8 to close the compute-bound gap
CLA Signed
This label is managed by the Meta Open Source bot.
#2741
opened Jun 10, 2026 by
yushangdi
Contributor
Loading…
[cute] Support column-major (K-major) B on the tcgen05 fp8 TMA path
CLA Signed
This label is managed by the Meta Open Source bot.
#2740
opened Jun 10, 2026 by
yushangdi
Contributor
Loading…
Collect kernel artifacts and append-mode autotune telemetry with run_id
CLA Signed
This label is managed by the Meta Open Source bot.
#2737
opened Jun 10, 2026 by
IshanAryendu
Contributor
•
Draft
[Pallas] Rewrites of jagged reduction kernels in Pallas friendly ways.
CLA Signed
This label is managed by the Meta Open Source bot.
#2731
opened Jun 9, 2026 by
thcmbs
Collaborator
Loading…
[Pallas] Test jagged carry with dynamic row counts
CLA Signed
This label is managed by the Meta Open Source bot.
[Autotuner] Support autotuing with non-dense mutated input
CLA Signed
This label is managed by the Meta Open Source bot.
#2721
opened Jun 8, 2026 by
xiaohongchen1991
Contributor
Loading…
[Pallas] Ordered carry store for jagged row tiles
CLA Signed
This label is managed by the Meta Open Source bot.
#2719
opened Jun 8, 2026 by
thcmbs
Collaborator
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.