pypi charset-normalizer 1.4.0
Version 1.4.0

latest releases: 3.4.0, 3.3.2, 3.3.1...
3 years ago

Changes :

Thanks to @potiuk for his tests/ideas that permitted us to improve the quality of this project.

  • Dependency: ➖ Using standard logging instead of using the package loguru.
  • Dependency: ➖ Dropping nose test framework in favor of the maintained pytest.
  • Dependency: ➖ Choose to not use dragonmapper package to help with gibberish Chinese/CJK text.
  • Dependency: 🔧 ➖ Require cached_property only for Python 3.5 due to constraint. Dropping for every other interpreter version.
  • Bugfix: 🐛 BOM marker in a CharsetNormalizerMatch instance could be False in rare cases even if obviously present. Due to the sub-match factoring process.
  • Improvement: 🎇 Return ASCII if given sequences fit. Given reasonable confidence.
  • Performance: ⚡ Huge improvement over the larges payload.
  • Change: 🔥 Stop support for UTF-7 that does not contain a SIG. (Contributions are welcome to improve that point)
  • Feature: 🎇 CLI now produces JSON consumable output.
  • Dependency: Dropping PrettyTable, replaced with pure JSON output.
  • Bugfix: 🐛 Not searching properly for the BOM when trying utf32/16 parent codec.
  • Other: ⚡ Improving the package final size by compressing frequencies.json.

This project no longer requires anything except for python 3.5. It is still supported even if passed EOL.
Version 2.x will require Python 3.6+

Don't miss a new charset-normalizer release

NewReleases is sending notifications on new releases.