📊 Help us improve spaCy and take the User Survey 2018!
✨ New features and improvements
- Improve language data for Turkish and Croatian.
- Add built-in factories for
merge_entities
andmerge_noun_chunks
to allow models to specify those components as part of their pipeline.
merge_entities = nlp.create_pipe('merge_entities')
nlp.add_pipe(merge_entities, after='ner')
🔴 Bug fixes
- Fix issue #2012: Fix Spanish
noun_chunks
failure caused by typo. - Fix issue #2040: Make sure
Token.lemma
always returns a hash value. - Fix issue #2063: Correct typo in English lookup lemmatization table.
- Fix issue #2103: Correct typo in documentation.
- Fix pickling of
Vectors
class.
📖 Documentation and examples
- Add example for visualizing spaCy vectors with the TensorBoard Embedding Projector.
- Fix various typos and inconsistencies.
👥 Contributors
Thanks to @thomasopsomer, @alldefector, @DuyguA, @dejanmarich, @justindujardin, @calumcalder, @SebastinSanty, @iann0036, @doug-descombaz and @willismonroe for the pull requests and contributions.