📊 Help us improve spaCy and take the User Survey 2018!
✨ New features and improvements
- NEW: Lexical attribute
IS_CURRENCY
viaToken.is_currency
for currency symbols. - Add
noun_chunks
syntax iterator for Norwegian. - Add
get_beam_parse
method inArcEager
. - Revert changes to the
Matcher
in favour of the new and improved API (#1971) coming in v2.1.0.
🔴 Bug fixes
- Fix issue #1706: Ensure files opened in
from_disk
are closed. - Fix issue #1733: Make model loading from package compatible with Python 3.4.
- Fix issue #1832, #1928: Fix vector handling in
init_model
command. - Fix issue #1915: Pass in hyperparameters correctly during
begin_training
. - Fix issue #1924: Require
html5lib
insetup.py
to preventsix
error. - Fix issue #1929: Correctly handle NER with pre-set sentence boundaries.
- Fix issue #1941: Improve documentation around model symlink on Windows.
- Fix issue #1949: Correct
Matcher
docs to only includeORTH
andLOWER
. - Fix issue #1950: Fix bug in regex
Matcher
example. - Fix issue #1959: Execute custom pipeline component when using
Language.pipe
. - Fix issue #1964: Correct typo in glossary.
- Fix issue #1974: Don't set
random.seed
globally in CLI commands. - Fix issue #1989: Correct documentation of
match_id
and improve example.
📖 Documentation and examples
- Fix various typos and inconsistencies.
👥 Contributors
Thanks to @ohenrik, @tokestermw, @azarezade, @piratos, @mhaddy, @pktippa, @mdcclv, @oxinabox, @SThomasP, @DuyguA, @emulbreh, @ursachec and @enerrio for the pull requests and contributions.