Changed
- skip pandas
pd.NA
similar toNone
- add
score_multiplier
argument toprocess.cdist
which allows multiplying the end result scores
with a constant factor. - drop support for Python 3.7
Performance
- improve performance of simd implementation for
LCS
/Indel
/Jaro
/JaroWinkler
- improve performance of Jaro and Jaro Winkler for long sequences
- implement
process.extract
withlimit=1
usingprocess.extractOne
which can be faster
Fixed
- the preprocessing function was always called through Python due to a broken C-API version check
- fix wraparound issue in simd implementation of Jaro and Jaro Winkler