pemistahl/lingua-py v1.3.5 on GitHub

The language models are now stored in dictionaries instead of NumPy arrays. This change leads to significantly improved runtime performance at the cost of higher memory consumption (up to 3 GB for all models). As the runtime performance was much too slow with the former approach, this change makes sense because adding more memory is quite cheap.
The language model files are now compressed with the Brotli algorithm which reduces the file size by 15 %, on average.
The characters Щщ are now correctly identified as possible indicators for the Ukrainian language, leading to slightly higher accuracy when identifying Ukrainian texts.

pemistahl/lingua-py v1.3.5 Lingua 1.3.5 on GitHub