Skip to content

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1548

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1548

Job Run time
31s
13m 26s
1m 1s
44m 16s
43m 36s
9m 39s
49m 34s
13m 6s
10m 42s
11m 25s
11m 18s
11m 27s
10m 38s
17m 2s
16m 0s
11m 30s
11m 37s
11m 5s
12m 17s
10m 36s
11m 4s
11m 54s
12m 52s
10m 34s
6h 7m 10s