add torch_npu swa by old-steel · Pull Request #263 · XPU-Forces/mojo_opset

old-steel · 2026-04-27T11:25:08Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces the TorchNpuSWAFunction to support sliding window attention on NPU backends using torch_npu.npu_fusion_attention. The review identifies several critical issues, including misspelled keyword arguments and a mismatch in the number of return values in the backward method, both of which would cause runtime errors. Additionally, the feedback suggests avoiding hardcoded mask dimensions to support longer sequences, eliminating performance bottlenecks caused by host-device synchronization, and reducing code duplication by importing utility functions from the core module.

add torch_npu swa

daa7a57

gemini-code-assist Bot reviewed Apr 27, 2026

View reviewed changes

add torch_npu swa

f080ba6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add torch_npu swa#263

add torch_npu swa#263
old-steel wants to merge 2 commits into
XPU-Forces:masterfrom
old-steel:swa_npu

old-steel commented Apr 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

old-steel commented Apr 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant