Skip to content

fix Parakeet split retry for long recordings#1166

Closed
ferologics wants to merge 1 commit intocjpais:mainfrom
ferologics:fix/parakeet-split-retry
Closed

fix Parakeet split retry for long recordings#1166
ferologics wants to merge 1 commit intocjpais:mainfrom
ferologics:fix/parakeet-split-retry

Conversation

@ferologics
Copy link
Copy Markdown
Contributor

@ferologics ferologics commented Mar 27, 2026

Summary

  • restore Parakeet recovery for long recordings by moving chunked retry into a dedicated recovery helper
  • retry only the current recoverable ONNX broadcast-shape inference failure
  • preserve structured transcription results through recovery and merge chunk outputs at the seam instead of flattening to text immediately
  • cover the retry driver with scenario tests for recovery, fail-fast behavior, and bounded retry exhaustion

Why

Long Parakeet recordings can still fail with a recoverable inference error like:

BroadcastIterator::Init ... Attempting to broadcast an axis by a dimension other than 1

That currently bubbles up as a hard failure instead of retrying on smaller chunks.

Validation

  • cd src-tauri && cargo check
  • cd src-tauri && cargo test
  • validated through Handy's normal History re-transcribe flow on ~/Library/Application Support/com.pais.handy/recordings/handy-1774637386.wav

Follow-up notes

  • seam merging is still heuristic and would be stronger with word-level timestamps during recovery
  • once this stabilizes, the recovery helper should move down into transcribe-rs

@ferologics ferologics force-pushed the fix/parakeet-split-retry branch 2 times, most recently from b936310 to 9f1f11f Compare March 27, 2026 19:58
@ferologics ferologics marked this pull request as draft March 27, 2026 19:59
@ferologics ferologics force-pushed the fix/parakeet-split-retry branch from 9f1f11f to fe1be85 Compare March 27, 2026 20:28
@ferologics ferologics force-pushed the fix/parakeet-split-retry branch from fe1be85 to 2d38f81 Compare March 27, 2026 20:39
@cjpais
Copy link
Copy Markdown
Owner

cjpais commented Mar 27, 2026

Look at my PR for chunking

@cjpais cjpais closed this Mar 27, 2026
@ferologics
Copy link
Copy Markdown
Contributor Author

ferologics commented Mar 28, 2026

I tested #1173 's chunked Parakeet path against the original failing long recording (handy-1774637386.wav), and it completed successfully.

Given that, I think #1173 is the better direction to land:

So I'm closing this in favor of #1173 rather than trying to merge two competing fixes for the same issue.

The fallback/recovery idea from this PR may still be worth revisiting later, ideally lower in transcribe-rs, if chunked paths still prove brittle.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants