Checksum .tar.gz:
a6b57757baaf214ba9e7e788abbb6f25a03997a4874772aec038605014eb7735
Checksum .whl:992db03e9c49c4320fbded1ad15d1ae8913b3cf6dc059592377a2b26b130469a
Details: https://spacy.io/models/mk#mk_core_news_sm
Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.
Feature | Description |
---|---|
Name | mk_core_news_sm
|
Version | 3.2.0
|
spaCy | >=3.2.0,<3.3.0
|
Default Pipeline | morphologizer , parser , attribute_ruler , lemmatizer , ner
|
Components | morphologizer , parser , senter , attribute_ruler , lemmatizer , ner
|
Vectors | 0 keys, 0 unique vectors (0 dimensions) |
Sources | Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) spaCy lookups data (Explosion) |
License | CC BY-SA 4.0
|
Author | Explosion |
Model size | 18 MB |
Label Scheme
View label scheme (55 labels for 4 components)
Component | Labels |
---|---|
morphologizer
| POS=PROPN , POS=AUX , POS=ADJ , POS=NOUN , POS=ADP , POS=PUNCT , POS=CONJ , POS=NUM , POS=VERB , POS=PRON , POS=ADV , POS=SCONJ , POS=PART , POS=SYM , POS=X , _ , POS=INTJ
|
parser
| ROOT , advmod , att , aux , cc , dep , det , dobj , iobj , neg , nsubj , pobj , poss , pozm , pozv , prep , punct , relcl
|
senter
| I , S
|
ner
| CARDINAL , DATE , EVENT , FAC , GPE , LANGUAGE , LAW , LOC , MONEY , NORP , ORDINAL , ORG , PERCENT , PERSON , PRODUCT , QUANTITY , TIME , WORK_OF_ART
|
Accuracy
Type | Score |
---|---|
TOKEN_ACC
| 100.00 |
TOKEN_P
| 100.00 |
TOKEN_R
| 100.00 |
TOKEN_F
| 100.00 |
SENTS_P
| 67.11 |
SENTS_R
| 66.23 |
SENTS_F
| 66.67 |
ENTS_P
| 73.00 |
ENTS_R
| 70.64 |
ENTS_F
| 71.80 |
POS_ACC
| 91.64 |
DEP_UAS
| 63.08 |
DEP_LAS
| 47.60 |
Installation
pip install spacy
python -m spacy download mk_core_news_sm