Improvements
- The rule-based algorithm for the recognition of Japanese texts has been improved. Texts including both Japanese and Chinese characters are now classified more often correctly as Japanese instead of Chinese.
Bug Fixes
- Text spans created by
LanguageDetector.detect_multiple_languages_of()
sometimes skipped characters in the last span. This has been fixed. (#247)
Please note: All improvements and bug fixes will also be part of the next Rust-based Python extension release 2.1.0.