Skip to content

Pull requests: pytorch/helion

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Lean kernel-artifact telemetry for cost-model dataset CLA Signed This label is managed by the Meta Open Source bot.
#2770 opened Jun 11, 2026 by IshanAryendu Contributor Loading…
[pallas] size emit_pipeline scratch from reshape-merged block-size products CLA Signed This label is managed by the Meta Open Source bot.
#2769 opened Jun 11, 2026 by choijon5 Contributor Loading…
[cute] Persist autotune winner from memory instead of recompiling CLA Signed This label is managed by the Meta Open Source bot.
#2768 opened Jun 11, 2026 by fulvius31 Collaborator Loading…
[examples] Add a simpler concat implementation CLA Signed This label is managed by the Meta Open Source bot.
#2766 opened Jun 11, 2026 by hinriksnaer Collaborator Loading…
[autotuner] Triton reduction seed heuristic (generalizable core) CLA Signed This label is managed by the Meta Open Source bot.
#2762 opened Jun 11, 2026 by calebmkim Contributor Loading…
[autotuner] Reduction fact layer: ReductionFact + AccumulatorFact + enriched MemoryOpFact CLA Signed This label is managed by the Meta Open Source bot.
#2761 opened Jun 11, 2026 by calebmkim Contributor Loading…
[examples] Edits to existing reduction example kernels for the seed-heuristic curriculum CLA Signed This label is managed by the Meta Open Source bot.
#2760 opened Jun 11, 2026 by calebmkim Contributor Loading…
Add a SymInt-free tensor specialization key for exact torch.Tensor args CLA Signed This label is managed by the Meta Open Source bot.
#2759 opened Jun 11, 2026 by yushangdi Contributor Loading…
[cute] Record measured-good B200 config for the fp8 scaled_mm example CLA Signed This label is managed by the Meta Open Source bot.
#2756 opened Jun 11, 2026 by yushangdi Contributor Draft
[cute] Unified rolled TMA producer + hoisted K-loop predicates for tcgen05 CLA Signed This label is managed by the Meta Open Source bot.
#2755 opened Jun 11, 2026 by yushangdi Contributor Draft
[cute] Unified rolled TMA producer + hoisted K-loop predicates for tcgen05 CLA Signed This label is managed by the Meta Open Source bot.
#2754 opened Jun 11, 2026 by yushangdi Contributor Draft
Skip the measure("Kernel.bind") context manager when measurement is off CLA Signed This label is managed by the Meta Open Source bot.
#2752 opened Jun 11, 2026 by yushangdi Contributor Draft
Move measure("Kernel.bind") off the cache-hit dispatch path CLA Signed This label is managed by the Meta Open Source bot.
#2751 opened Jun 11, 2026 by yushangdi Contributor Draft
Collect kernel artifacts: device-IR node-link dump (.ir.jsonl) CLA Signed This label is managed by the Meta Open Source bot.
#2750 opened Jun 11, 2026 by IshanAryendu Contributor Draft
Install a per-spec fast launcher that bypasses Triton's JITFunction.run CLA Signed This label is managed by the Meta Open Source bot.
#2749 opened Jun 10, 2026 by yushangdi Contributor Loading…
[Pallas] Add pallas_loop_type = 'outer_pipeline' CLA Signed This label is managed by the Meta Open Source bot.
#2744 opened Jun 10, 2026 by ethche Contributor Draft
[Pallas] Fix attention example VMEM regression by making LSE 3D CLA Signed This label is managed by the Meta Open Source bot.
#2743 opened Jun 10, 2026 by norx1991 Contributor Loading…
[cute] Fused-scale epilogue: scalar colvec read for per-row scale CLA Signed This label is managed by the Meta Open Source bot.
#2742 opened Jun 10, 2026 by yushangdi Contributor Loading…
[cute] Deep AB staging for fp8 to close the compute-bound gap CLA Signed This label is managed by the Meta Open Source bot.
#2741 opened Jun 10, 2026 by yushangdi Contributor Loading…
[cute] Support column-major (K-major) B on the tcgen05 fp8 TMA path CLA Signed This label is managed by the Meta Open Source bot.
#2740 opened Jun 10, 2026 by yushangdi Contributor Loading…
Collect kernel artifacts and append-mode autotune telemetry with run_id CLA Signed This label is managed by the Meta Open Source bot.
#2737 opened Jun 10, 2026 by IshanAryendu Contributor Draft
[Pallas] Rewrites of jagged reduction kernels in Pallas friendly ways. CLA Signed This label is managed by the Meta Open Source bot.
#2731 opened Jun 9, 2026 by thcmbs Collaborator Loading…
[Pallas] Test jagged carry with dynamic row counts CLA Signed This label is managed by the Meta Open Source bot.
#2722 opened Jun 8, 2026 by thcmbs Collaborator Draft
[Autotuner] Support autotuing with non-dense mutated input CLA Signed This label is managed by the Meta Open Source bot.
#2721 opened Jun 8, 2026 by xiaohongchen1991 Contributor Loading…
[Pallas] Ordered carry store for jagged row tiles CLA Signed This label is managed by the Meta Open Source bot.
#2719 opened Jun 8, 2026 by thcmbs Collaborator Loading…
ProTip! Add no:assignee to see everything that’s not assigned.