switch to triton ascend 3.2.1#281
Conversation
…agate_nan=tl.PropagateNan.ALL;
This reverts commit d705131.
There was a problem hiding this comment.
Code Review
This pull request migrates NPU Triton kernels to support Triton Ascend 3.2.1, primarily by updating slicing, math, and hint operations to the tl.extra.cann namespace and enabling NaN propagation in reduction operations. Feedback identifies several critical issues, including potential integer wrap-around in quantization due to the removal of saturation modes, the accidental deletion of the byted-triton-x dependency in pyproject.toml, and the presence of dead code. Additionally, the reviewer recommends implementing safe division patterns in the SDPA kernel and suggests making the hardcoded normalization casting mode configurable.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Claude Code ReviewVerdict: Request changes -- Migration to vendor-namespaced Triton extensions looks mechanical and correct, but a few items (a perf-killing SummaryThis PR migrates kernels from generic Must fix
SuggestionsSuggestions (5)
NitsNits (5)
Notes
|
Claude Code ReviewVerdict: Request changes -- Several concerning changes: a default RMSNorm casting mode flip, a removed SummaryThis PR adapts kernels to a new Must fix
SuggestionsSuggestions (5)
NitsNits (5)
Notes
|
|
No description provided.