Checksum .tar.gz:
66908d86c61fd6b361ebb0f655d6a5a03f0a3668e6c8e7c59faf019440362922
Checksum .whl:dc6e02a035ec2de9877c0a2e43b04687917cc564bc51beb31fdf822c6bde96f9
Details: https://spacy.io/models/mk#mk_core_news_lg
Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.
Feature | Description |
---|---|
Name | mk_core_news_lg
|
Version | 3.1.0
|
spaCy | >=3.1.0,<3.2.0
|
Default Pipeline | morphologizer , parser , attribute_ruler , lemmatizer , ner
|
Components | morphologizer , parser , senter , attribute_ruler , lemmatizer , ner
|
Vectors | 274587 keys, 274587 unique vectors (300 dimensions) |
Sources | Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska) spaCy lookups data (Explosion) Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion) |
License | CC BY-SA 4.0
|
Author | Explosion |
Model size | 311 MB |
Label Scheme
View label scheme (55 labels for 4 components)
Component | Labels |
---|---|
morphologizer
| POS=PROPN , POS=AUX , POS=ADJ , POS=NOUN , POS=ADP , POS=PUNCT , POS=CONJ , POS=NUM , POS=VERB , POS=PRON , POS=ADV , POS=SCONJ , POS=PART , POS=SYM , POS=X , _ , POS=INTJ
|
parser
| ROOT , advmod , att , aux , cc , dep , det , dobj , iobj , neg , nsubj , pobj , poss , pozm , pozv , prep , punct , relcl
|
senter
| I , S
|
ner
| CARDINAL , DATE , EVENT , FAC , GPE , LANGUAGE , LAW , LOC , MONEY , NORP , ORDINAL , ORG , PERCENT , PERSON , PRODUCT , QUANTITY , TIME , WORK_OF_ART
|
Accuracy
Type | Score |
---|---|
TOKEN_ACC
| 100.00 |
POS_ACC
| 93.28 |
SENTS_P
| 70.42 |
SENTS_R
| 64.94 |
SENTS_F
| 67.57 |
DEP_UAS
| 65.62 |
DEP_LAS
| 51.08 |
ENTS_P
| 75.50 |
ENTS_R
| 74.47 |
ENTS_F
| 74.98 |
Installation
pip install spacy
python -m spacy download mk_core_news_lg