Skip to content

Fix inf grad_norm on Qwen3.5 at seq_len > 65536 without flash-attn

d79628e
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

Fix inf grad_norm on Qwen3.5 at seq_len > 65536 without flash-attn #582

Fix inf grad_norm on Qwen3.5 at seq_len > 65536 without flash-attn
d79628e
Select commit
Loading
Failed to load commit list.

Annotations

1 warning
Analyze (python)
succeeded Apr 9, 2026 in 1m 3s