Add wildcard ~ support for Japanese grammar patterns#2363
Add wildcard ~ support for Japanese grammar patterns#2363RadiantSol wants to merge 2 commits intoyomidevs:masterfrom
Conversation
…midevs#2336) Add a wildcard text preprocessor that generates variants of scanned text with middle portions replaced by ~, enabling grammar dictionaries to use entries like "いくら~でも" that match text like "いくら騒いでも". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Merging this PR will degrade performance by 62.53%
Performance Changes
Comparing Footnotes
|
|
maybe we could avoid the performance hit by doing this in the deinflection layer instead? a new the only issue is we'd have to start maintaining a hardcoded list of known patterns, or create a new system for dynamically creating rules from imported dictionaries. are you using a dictionary that already has these wildcard entries? i don't see any in jitendex |
Only drawbacks to this implementation is the addition of potentially a lot more lookups and the inability to scan for X~Y~Z entries.
AI Summary
insertWildcardtext processor for Japanese that generates wildcard variants of grammar patterns