github huggingface/tokenizers python-v0.10.1
Python v0.10.1

latest releases: v0.21.0rc0, v0.20.4, v0.20.4rc0...
3 years ago

Fixed

  • [#616]: Fix SentencePiece tokenizers conversion
  • [#617]: Fix offsets produced by Precompiled Normalizer (used by tokenizers converted from SPM)
  • [#618]: Fix Normalizer.normalize with PyNormalizedStringRefMut
  • [#620]: Fix serialization/deserialization for overlapping models
  • [#621]: Fix ByteLevel instantiation from a previously saved state (using __getstate__())

Don't miss a new tokenizers release

NewReleases is sending notifications on new releases.