github explosion/spacy-models mk_core_news_lg-3.3.0

Downloads Downloads (wheel)

Checksum .tar.gz: f465b184c5d7d04c0165c099772c0b72f65f03b7a52d2d4b1be7fb6dd116069f
Checksum .whl: 9217e0187410318d2566120ed0de095420e19ad0c15d6a299fec3c1b3d33d248

Details: https://spacy.io/models/mk#mk_core_news_lg

Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name mk_core_news_lg
Version 3.3.0
spaCy >=3.3.0.dev0,<3.4.0
Default Pipeline morphologizer, parser, attribute_ruler, lemmatizer, ner
Components morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 274587 keys, 274587 unique vectors (300 dimensions)
Sources Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
spaCy lookups data (Explosion)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License CC BY-SA 4.0
Author Explosion
Model size 310 MB

Label Scheme

View label scheme (53 labels for 3 components)
Component Labels
morphologizer POS=PROPN, POS=AUX, POS=ADJ, POS=NOUN, POS=ADP, POS=PUNCT, POS=CONJ, POS=NUM, POS=VERB, POS=PRON, POS=ADV, POS=SCONJ, POS=PART, POS=SYM, POS=X, _, POS=INTJ
parser ROOT, advmod, att, aux, cc, dep, det, dobj, iobj, neg, nsubj, pobj, poss, pozm, pozv, prep, punct, relcl
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 100.00
TOKEN_P 100.00
TOKEN_R 100.00
TOKEN_F 100.00
SENTS_P 72.22
SENTS_R 67.53
SENTS_F 69.80
DEP_UAS 67.25
DEP_LAS 51.37
ENTS_P 75.39
ENTS_R 74.81
ENTS_F 75.10
POS_ACC 93.47

Installation

pip install spacy
python -m spacy download mk_core_news_lg

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.