fix layernorm_fwd_kernel#347
Open
Todobe wants to merge 1 commit into
Open
Conversation
Contributor
There was a problem hiding this comment.
Code Review
This pull request modifies the Triton layernorm forward kernel in mojo_opset/backends/ttx/kernels/npu/layernorm.py to filter out invalid columns by applying a block_mask to x_centered before calculating the variance accumulation. There are no review comments, and I have no additional feedback to provide.
Important
The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This operator has an issue with tail block processing. It needs to calculate
sum((x - mean(x))^2). When the dimension of x is not an integer multiple of the block size, the last block of x after partitioning becomes[normal value, normal value, normal value, 0]. Then,(x - mean(x)) ^2becomes[normal value, normal value, normal value, mean(x)^2]. When performing the sum, this finalmean(x)^2is also included, causing an accuracy error.Moreover, since the mean of x in the test cases is basically 0, this issue was not detected.
This issue can be resolved by adding a
tl.wherestatement after the subtraction.To reproduce this error, you need to modify the test file as follows(to make the mean of x not equal to 0)):
Use
pytest tests/accuracy/operators/test_normalization.py::test_layernormto test the accuracy.Origin op test result:
Fixed op test result: