✨ New features and improvements
- Alpha support for Russian via pymorphy2.
- Improve language data for Danish, Italian and Dutch.
- Add
offsets_from_biluo_tags
helper to convert BILUO notation to entity offsets. - Use
POS
instead ofTAG
by default in displaCy, to prevent visualisation issues in languages with long combined tags (e.g. Italian or Dutch). - Drop support for EOL Python 2.6 and 3.3.
🔴 Bug fixes
- Fix issue #1207: Fix
Span.noun_chunks
. - Fix issue #1494: Handle sequential infixes in tokenizer rules.
- Fix issue #1587: Add note on attribute extension default arguments in docs.
- Fix issue #1599: Fix typo in documentation.
- Fix issue #1612: Ensure that
Span.orth_ == Span.text
. - Fix issue #1617: Make
entity_relations.py
example Python 2 compatible and fix French test. - Fix issue #1654: Fix off-by-one error in
nlp.add_pipe
when usingafter
. - Fix issue #1674: Set correct requirement string in
spacy package
. - Fix issue with
StringStore
cleanup.
📖 Documentation and examples
- Update resources page with new spaCy extensions.
- Add "Unknown locale" error to troubleshooting guide.
- Always use
python -m spacy
for CLI commands again to prevent issues on Windows etc. - Fix various typos and inconsistencies.
👥 Contributors
Thanks to @ligser, @pavillet, @yuukos, @GreenRiverRUS, @MartinoMensio, @raphael0202, @tokestermw, @fsonntag, @cclauss, @bdewilde, @markulrich, @sorenlind, @hugovk, @atomobianco, @twerkmeister, @mkdynamic and @jimregan for the pull requests and contributions.