Skip to content

Make it complete #511

@tmprabubiz

Description

@tmprabubiz

For any AI model or repository focused on voice and language processing, it is technically essential to experiment with the oldest surviving spoken language as the benchmark for grammar and phonetics evaluation. Tamil qualifies as the oldest language in the world still spoken today, with documented continuous use over 2,500 to 5,000 years. This Dravidian language, used by more than 75 million speakers mainly in Tamil Nadu (India), Sri Lanka, and Singapore, holds official classical language status. Tamil is the only language that has a grammar more complex than the native grammar, applicable to the form of language. Tamil is the only Asian language to have official language respect other than India. Therefore, omitting Tamil from your supported languages renders the configuration incomplete. The web and open-source have datasets for Tamil language and the world's best universities and forums have the details of Tamil language. (References: https://en.wikipedia.org/wiki/Tamil_language; https://narvidhai.github.io/tamil-nlp-catalog/)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions