[FEAT] Voxtral Support#3036
Conversation
|
@maximizemaxwell Thank you for your earlier work! Here is the working code if you want to try it. |
|
@greenrazer Could you review this feature implementations? |
|
Hey, thank you for the implementation. I had a rough one myself but was missing the tekken part among some other flaws. I went through the code and I think I might have some pointers for improvement. I am new to audio transformers, however, so please have some leniency with any mistakes :) For the example:
For the main code:
|
greenrazer
left a comment
There was a problem hiding this comment.
Thank you for the contribution, great job!
Just fix the clippy errors and we should be good:
cargo clippy --workspace --tests --examples --fix -- -D warnings
|
Thank you @greenrazer and @benedikt-schaber for the suggestions. I have made the requested changes. I have also tested whisper's and snac's examples to make sure they run correctly after moving the |
* feat: implement some configs in voxtral * fix: fixed imports, implement more func * feat: implemented full version, need fixes * fix: fixed some compile errors * feat: add initial examples * fix: fixed voxtral.rs * fix: fixed compile errors in examples * fix: fixed compile errors * fix: update model integration * First working example * Remove unused melfilters code * Remove unused code * Reuse whisper's pcm_decode * Simplify generation function * Remove unnecessary post-process fun * Reuse snac's resample * Apply clippy suggestions * Remove unused filters * Improve example * Update tekken-rs * Clippy fixes --------- Co-authored-by: Max <naturale@hufs.ac.kr>
mistralai/Voxtral-Mini-3B-2507.mistralai/Voxtral-Small-24B-2507, but this is not tested.These models only have support for tekken tokenizer files right now, so I created and added an optional tekken tokenizer crate to the examples workspace. (tekken-rs). This gets enabled with the
tekkenfeature.fixes #3028
Run example with