Details: https://spacy.io/models/xx#xx_ent_wiki_sm
File checksum:
f686c0ba1c1144d7601e0c037247802bc98249ae6ee65b6c237016ba7cad7f74
Multi-lingual CNN trained on Nothman et al. (2010) Wikipedia corpus. Assigns named entities. Supports identification of PER, LOC, ORG and MISC entities for Dutch, English, French, German, French, Italian, Polish, Portuguese, Russian and Spanish.
Feature | Description |
---|---|
Name | xx_ent_wiki_sm
|
Version | 2.3.0
|
spaCy | >=2.3.0,<2.4.0
|
Model size | 3 MB |
Pipeline | ner
|
Vectors | 0 keys, 0 unique vectors (0 dimensions) |
Sources | WikiNER |
License | MIT
|
Author | Explosion |
Label Scheme
Component | Labels |
---|---|
ner
| LOC , MISC , ORG , PER
|
Accuracy
Type | Score |
---|---|
ENTS_F
| 82.67 |
ENTS_P
| 83.10 |
ENTS_R
| 82.24 |
Because the model is trained on Wikipedia, it may perform inconsistently on many genres, such as social media text.
Installation
pip install spacy
python -m spacy download xx_ent_wiki_sm