Fix for small segments by Pranjalya · Pull Request #57 · shashikg/WhisperS2T

Pranjalya · 2024-04-05T11:36:20Z

Patch

Fix for small segments, when the audio duration is less than max_seg_len
Fallback for generate_segment_batched in case the seq_len and seq_metadata is not provided

Add tensorrt backend

Create updates

BBC-Esq · 2024-05-25T02:38:31Z

I like it!

Sembiance · 2024-06-12T16:03:32Z

Great fix, without it WhisperS2T is useless for small duration audio.

HIGHLY recommend merging this pull request :)

shashikg · 2024-07-06T05:38:00Z

Hi @Pranjalya @Sembiance !
Can you describe here or link an issue related to small duration audio?

Pranjalya · 2024-09-03T01:15:20Z

Hey @shashikg, the issue was in the loop where we segment audio into parts and the case where the original audio's duration is < 1s. Using the range function and setting the end timestamp as int(audio_duration) will lead it to it being 0, which when used on range returns an empty list. Using a math.ceil function ensures that it is rounded up to the next ceiling integer and the audio segment timestamp is logged.
This bug is potentially dangerous as well if someone is using indexing to map the audio segments, as it leads to missing of the parts.

LostnD · 2024-11-18T16:15:44Z

what will "max_seg_len" do?

Pranjalya added 4 commits December 28, 2023 06:35

🐰 fix breaking code

514b1cd

Merge pull request #2 from shashikg/main

7fbc846

Add tensorrt backend

Merge pull request #3 from shashikg/main

22d5cbd

Create updates

🐶 patch for small segment file

05c26eb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for small segments#57

Fix for small segments#57
Pranjalya wants to merge 4 commits intoshashikg:mainfrom
Pranjalya:small-segment-fix

Pranjalya commented Apr 5, 2024

Uh oh!

BBC-Esq commented May 25, 2024

Uh oh!

Sembiance commented Jun 12, 2024

Uh oh!

shashikg commented Jul 6, 2024

Uh oh!

Pranjalya commented Sep 3, 2024

Uh oh!

LostnD commented Nov 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Pranjalya commented Apr 5, 2024

Uh oh!

BBC-Esq commented May 25, 2024

Uh oh!

Sembiance commented Jun 12, 2024

Uh oh!

shashikg commented Jul 6, 2024

Uh oh!

Pranjalya commented Sep 3, 2024

Uh oh!

LostnD commented Nov 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants