github pemistahl/lingua-py v1.3.5
Lingua 1.3.5

6 months ago

Improvements

  • The language models are now stored in dictionaries instead of NumPy arrays. This change leads to significantly improved runtime performance at the cost of higher memory consumption (up to 3 GB for all models). As the runtime performance was much too slow with the former approach, this change makes sense because adding more memory is quite cheap.

  • The language model files are now compressed with the Brotli algorithm which reduces the file size by 15 %, on average.

  • The characters Щщ are now correctly identified as possible indicators for the Ukrainian language, leading to slightly higher accuracy when identifying Ukrainian texts.

Miscellaneous

  • All dependencies have been updated to their latest versions.

Don't miss a new lingua-py release

NewReleases is sending notifications on new releases.