Add VibeVoice-1.5B perf benchmark and wire to nightly by saiarthiraguram · Pull Request #5343 · tenstorrent/tt-xla

saiarthiraguram · 2026-06-23T08:53:39Z

Ticket

Problem description

VibeVoice-1.5B (microsoft/VibeVoice-1.5B) was brought up single-device on n150, but it has no perf benchmark and is not part of the nightly performance pipeline, so its throughput/PCC is not tracked over time.

What's changed

Adds a perf benchmark for VibeVoice-1.5B and wires it into the nightly perf matrix.

tests/benchmark/test_encoders.py::test_vibevoice — runs the model through the generic single-forward + PCC encoder harness. VibeVoice's bringup forward reduces to the Qwen2.5 LM backbone producing logits (speech_tensors=None; the semantic connector is exercised but unused), and the loader wraps the model to return a bare logits tensor, so it fits the existing harness without changes. Config: bf16, batch 1, seq len 32, loop count 32, optimization level 1, trace disabled.
.github/workflows/perf-bench-matrix.json — adds a vibevoice entry pinned to runs-on: n150-perf (the verified bringup arch). This matrix is filtered and executed by the nightly pipeline (schedule-nightly.yml → perf-benchmark → call-filtered-perf-tests.yml).

Impact: VibeVoice-1.5B throughput and PCC are now tracked in the nightly benchmark report on n150.

Dependency: the benchmark imports the VibeVoice loader from third_party.tt_forge_models.vibevoice. It only runs green once the tt-forge-models loader PR (branch sai_arthi_raguram/vibe_voice) lands and the submodule is uplifted in tt-xla. Land + uplift that first.

Checklist

New/Existing tests provide coverage for changes — tests/benchmark/test_encoders.py::test_vibevoice verified on n150: PCC=0.992910, 1 passed in 197s.

Logs

benchmark_vibevoice_n150.log
bringup_steps.txt

Add a single-forward + PCC benchmark for VibeVoice-1.5B (microsoft/VibeVoice-1.5B) using the generic encoder benchmark harness. The model's bringup forward reduces to the Qwen2.5 LM backbone producing logits (speech_tensors=None; semantic connector exercised but unused), and the loader wraps it to return a bare logits tensor, so it runs cleanly through the existing harness. Wire it into the nightly perf pipeline via perf-bench-matrix.json, pinned to n150-perf (the verified bringup arch). Trace disabled, optimization level 1. Verified on n150: PCC=0.992910, 1 passed. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add VibeVoice-1.5B perf benchmark and wire to nightly#5343

Add VibeVoice-1.5B perf benchmark and wire to nightly#5343
saiarthiraguram wants to merge 1 commit into
mainfrom
sai_arthi_raguram/vibevoice_nightly

saiarthiraguram commented Jun 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

saiarthiraguram commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ticket

Problem description

What's changed

Checklist

Logs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

saiarthiraguram commented Jun 23, 2026 •

edited

Loading