github explosion/spacy-models mk_core_news_lg-3.1.0

Downloads Downloads (wheel)

Checksum .tar.gz: 66908d86c61fd6b361ebb0f655d6a5a03f0a3668e6c8e7c59faf019440362922
Checksum .whl: dc6e02a035ec2de9877c0a2e43b04687917cc564bc51beb31fdf822c6bde96f9

Details: https://spacy.io/models/mk#mk_core_news_lg

Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name mk_core_news_lg
Version 3.1.0
spaCy >=3.1.0,<3.2.0
Default Pipeline morphologizer, parser, attribute_ruler, lemmatizer, ner
Components morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 274587 keys, 274587 unique vectors (300 dimensions)
Sources Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
Macedonian Corpus (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)
spaCy lookups data (Explosion)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License CC BY-SA 4.0
Author Explosion
Model size 311 MB

Label Scheme

View label scheme (55 labels for 4 components)
Component Labels
morphologizer POS=PROPN, POS=AUX, POS=ADJ, POS=NOUN, POS=ADP, POS=PUNCT, POS=CONJ, POS=NUM, POS=VERB, POS=PRON, POS=ADV, POS=SCONJ, POS=PART, POS=SYM, POS=X, _, POS=INTJ
parser ROOT, advmod, att, aux, cc, dep, det, dobj, iobj, neg, nsubj, pobj, poss, pozm, pozv, prep, punct, relcl
senter I, S
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 100.00
POS_ACC 93.28
SENTS_P 70.42
SENTS_R 64.94
SENTS_F 67.57
DEP_UAS 65.62
DEP_LAS 51.08
ENTS_P 75.50
ENTS_R 74.47
ENTS_F 74.98

Installation

pip install spacy
python -m spacy download mk_core_news_lg

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.