✨ Major features and improvements
- NEW: Alpha tokenization for Hebrew.
- NEW: Experimental
train
andpackage
commands to train a model and convert it to a Python package. - Enable experimental support for L1-regularized regression loss in dependency parser and named entity recognizer. Should improve fine-tuning of existing models.
- Fix high memory usage in
download
command.
🔴 Bug fixes
- Fix issue #903, #912: Base forms are now correctly protected from lemmatization.
- Fix issue #909, #925: Use
mlink
to create symlinks in Python 2 on Windows. - Fix issue #910: Update config when adding label to pre-trained model.
- Fix issue #911: Delete old training scripts.
- Fix issue #918: Use
--no-cache-dir
when downloading models via pip. - Fixed infinite recursion in
spacy.info
. - Fix initialisation of languages when no model is available.
📖 Documentation and examples
- Troubleshooting guide for most common issues and usage problems.
- Documentation for new
package
andtrain
commands. - Documentation for spaCy's JSON format for training data.
- Fix various typos and inconsistencies.
👥 Contributors
Thanks to @raphael0202, @pavlin99th, @iddoberger and @solresol for the pull requests!