github explosion/spacy-models mk_core_news_lg-3.2.0

Downloads Downloads (wheel)

Checksum .tar.gz: 3de2f5d510a8f1442d636cbf162bb4da8b67de2c5ecd8d097b2b1645799c7839
Checksum .whl: 349c98aac5dd38fe7b55151472ff9d3a00dd87ad8a1e2623ae371417b9366383

Details: https://spacy.io/models/mk#mk_core_news_lg

Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name mk_core_news_lg
Version 3.2.0
spaCy >=3.2.0,<3.3.0
Default Pipeline morphologizer, parser, attribute_ruler, lemmatizer, ner
Components morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 274587 keys, 274587 unique vectors (300 dimensions)
Sources Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
spaCy lookups data (Explosion)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License CC BY-SA 4.0
Author Explosion
Model size 312 MB

Label Scheme

View label scheme (55 labels for 4 components)
Component Labels
morphologizer POS=PROPN, POS=AUX, POS=ADJ, POS=NOUN, POS=ADP, POS=PUNCT, POS=CONJ, POS=NUM, POS=VERB, POS=PRON, POS=ADV, POS=SCONJ, POS=PART, POS=SYM, POS=X, _, POS=INTJ
parser ROOT, advmod, att, aux, cc, dep, det, dobj, iobj, neg, nsubj, pobj, poss, pozm, pozv, prep, punct, relcl
senter I, S
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 100.00
TOKEN_P 100.00
TOKEN_R 100.00
TOKEN_F 100.00
SENTS_P 64.86
SENTS_R 62.34
SENTS_F 63.58
DEP_UAS 68.76
DEP_LAS 53.67
POS_ACC 93.39
ENTS_P 74.83
ENTS_R 74.38
ENTS_F 74.61

Installation

pip install spacy
python -m spacy download mk_core_news_lg

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.