Language request: Catalan (ca)

Hi team, first of all Chatterbox is genuinely impressive, the blind test results speak for themselves.

I'd love to see Catalan added to the multilingual model. A few reasons this might be easier than other language requests:

The training data problem is already solved. The [Projecte AINA](https://projecteaina.cat/en/) from the Barcelona Supercomputing Center has published high-quality open Catalan speech datasets specifically designed for TTS training, including [LaFresCat](https://huggingface.co/projecte-aina) (studio quality, multi-accent, multiple speakers) and large CommonVoice Catalan subsets. All freely available on Hugging Face.

Catalan has around 10 million speakers and is currently underserved by every major TTS provider. ElevenLabs supports it but no quality open-source alternative does. This would be a meaningful gap to fill.

Would love to know if this is on the roadmap, or if a community fine-tune contribution on top of the existing multilingual model would be a useful path forward.

Thanks for the great work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Language request: Catalan (ca) #517

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Language request: Catalan (ca) #517

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions