github explosion/spacy-models ca_core_news_trf-3.8.0

Downloads Downloads (wheel)

Checksum .tar.gz: 5f47057a0ac0a2c8be5b894aa7583dd22e1db3a8594c17316f160a958a830448
Checksum .whl: f0bf7b6d1c88640a488e4445f0a5454694a1e78ea8f63792a877b6a899820f01

Details: https://spacy.io/models/ca#ca_core_news_trf

Catalan transformer pipeline (Transformer(name='projecte-aina/roberta-base-ca-v2', piece_encoder='byte-bpe', stride=112, type='roberta', width=768, window=144, vocab_size=50262)). Components: transformer, morphologizer, parser, ner, attribute_ruler, lemmatizer.

Feature Description
Name ca_core_news_trf
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline transformer, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components transformer, morphologizer, parser, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources UD Catalan AnCora v2.8 (Martínez Alonso, Héctor; Pascual, Elena; Zeman, Daniel)
UD Catalan AnCora v2.8 + NER v3.2.9 (Carlos Rodríguez-Penagos and Carme Armentano-Oller)
Catalan Lemmatizer (Text Mining Unit, Barcelona Supercomputing Center)
projecte-aina/roberta-base-ca-v2 (Text Mining Unit (TeMU) at the Barcelona Supercomputing Center)
License GNU GPL 3.0
Author Explosion
Model size 435 MB

Label Scheme

View label scheme (316 labels for 3 components)
Component Labels
morphologizer Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, POS=PROPN, POS=PUNCT|PunctSide=Ini|PunctType=Brck, POS=PUNCT|PunctSide=Fin|PunctType=Brck, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, Gender=Fem|Number=Sing|POS=NOUN, POS=ADP, NumType=Card|Number=Plur|POS=NUM, Gender=Masc|Number=Plur|POS=NOUN, Number=Sing|POS=ADJ, POS=CCONJ, Gender=Fem|Number=Sing|POS=DET|PronType=Ind, NumForm=Digit|NumType=Card|POS=NUM, NumForm=Digit|POS=NOUN, Gender=Masc|Number=Plur|POS=ADJ, POS=PUNCT|PunctType=Comm, POS=AUX|VerbForm=Inf, Case=Acc,Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, POS=PRON|PronType=Rel, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|POS=DET|PronType=Art, Gender=Fem|Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Gender=Fem|Number=Plur|POS=NOUN, Gender=Fem|Number=Plur|POS=ADJ, POS=VERB|VerbForm=Inf, Case=Acc,Dat|Number=Plur|POS=PRON|Person=3|PronType=Prs, Number=Plur|POS=ADJ, POS=PUNCT|PunctType=Peri, Number=Sing|POS=PRON|PronType=Rel, Gender=Masc|Number=Sing|POS=NOUN, Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin, Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part, POS=SCONJ, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Definite=Def|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Masc|Number=Sing|POS=DET|PronType=Dem, POS=VERB|VerbForm=Ger, POS=NOUN, Gender=Fem|NumType=Card|Number=Sing|POS=NUM, Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part, Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ, POS=SYM, Gender=Masc|Number=Sing|POS=ADJ, Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Gender=Fem|Number=Sing|POS=DET|PronType=Dem, POS=ADV|Polarity=Neg, POS=ADV, Number=Sing|POS=PRON|PronType=Dem, Number=Sing|POS=NOUN, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Number=Plur|POS=NOUN, Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|POS=ADJ, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|POS=PRON|PronType=Tot, Case=Loc|POS=PRON|Person=3|PronType=Prs, Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ, Degree=Cmp|POS=ADV, Gender=Fem|Number=Plur|POS=DET|PronType=Art, Gender=Fem|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin, NumType=Card|POS=NUM, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Number=Sing|POS=PRON|PronType=Ind, Gender=Masc|Number=Sing|POS=DET|PronType=Art, Number=Plur|POS=DET|PronType=Ind, Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Plur|POS=DET|PronType=Dem, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin, Gender=Masc|NumType=Card|Number=Sing|POS=NUM, Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Number=Sing|POS=DET|PronType=Ind, POS=PUNCT, Number=Sing|POS=DET|PronType=Rel, Case=Gen|POS=PRON|Person=3|PronType=Prs, Gender=Fem|NumType=Card|Number=Plur|POS=NUM, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, POS=DET|PronType=Ind, POS=AUX, Case=Acc|Gender=Neut|Number=Sing|POS=PRON|Person=3|PronType=Prs, Case=Acc,Dat|Number=Plur|POS=PRON|Person=1|PronType=Prs, Degree=Cmp|Number=Sing|POS=ADJ, Number=Sing|POS=VERB, Gender=Masc|Number=Plur|POS=PRON|PronType=Ind, Gender=Fem|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Plur|POS=DET|PronType=Art, Gender=Masc|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs, Case=Acc|Gender=Fem,Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Gender=Masc|Number=Sing|POS=PRON|PronType=Ind, Gender=Fem|Number=Plur|POS=PRON|PronType=Ind, Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Number=Plur|POS=PRON|PronType=Rel, Gender=Masc|Number=Plur|POS=DET|PronType=Int, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, AdvType=Tim|POS=NOUN, Gender=Masc|Number=Plur|POS=DET|PronType=Ind, Gender=Fem|Number=Plur|POS=DET|PronType=Ind, Gender=Masc|Number=Sing|POS=DET|PronType=Int, Mood=Cnd|Number=Sing|POS=AUX|Person=3|VerbForm=Fin, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Gender=Masc|Number=Sing|POS=PRON|PronType=Int, POS=PUNCT|PunctType=Semi, Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin, Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs, Gender=Masc|NumType=Card|Number=Plur|POS=NUM, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|POS=PRON|PronType=Ind, Mood=Sub|Number=Sing|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, NumForm=Digit|POS=SYM, Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part, Gender=Fem|Number=Sing|POS=PRON|PronType=Int, Gender=Fem|Number=Sing|POS=DET|PronType=Int, POS=PRON|PronType=Int, Gender=Fem|Number=Plur|POS=DET|PronType=Int, Mood=Cnd|Number=Sing|POS=VERB|Person=3|VerbForm=Fin, Mood=Cnd|Number=Plur|POS=VERB|Person=3|VerbForm=Fin, POS=PART, Gender=Fem|Number=Sing|POS=PRON|PronType=Dem, Gender=Masc|Number=Sing|POS=DET|PronType=Tot, Gender=Masc|Number=Plur|POS=PRON|PronType=Dem, POS=ADJ, Gender=Masc|Number=Plur|POS=PRON|Person=3|PronType=Prs, Degree=Cmp|Number=Plur|POS=ADJ, POS=PUNCT|PunctType=Dash, Mood=Sub|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Gender=Masc|POS=NOUN, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, Gender=Fem|Number=Plur|POS=PRON|PronType=Int, Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ, Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Fut|VerbForm=Fin, POS=PUNCT|PunctType=Colo, Gender=Masc|NumType=Card|POS=NUM, Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Number=Sing|POS=PRON|PronType=Int, POS=PUNCT|PunctType=Quot, Mood=Imp|Number=Sing|POS=VERB|Person=3|VerbForm=Fin, Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Gender=Masc|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, POS=AUX|VerbForm=Ger, Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Mood=Imp|Number=Sing|POS=AUX|Person=3|VerbForm=Fin, Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Sing|POS=PRON|PronType=Dem, Case=Acc,Dat|Number=Sing|POS=PRON|Person=2|Polite=Infm|PrepCase=Npr|PronType=Prs, Gender=Masc|Number=Plur|POS=PRON|PronType=Int, Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, NumForm=Digit|NumType=Frac|POS=NUM, POS=VERB, Gender=Fem|Number=Plur|POS=PRON|PronType=Dem, Gender=Fem|POS=NOUN, Case=Acc,Dat|Number=Sing|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs, Mood=Sub|Number=Plur|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=2|Tense=Fut|VerbForm=Fin, Mood=Sub|Number=Plur|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, Number=Plur|POS=PRON|Person=1|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Case=Nom|Number=Sing|POS=PRON|Person=2|Polite=Infm|PronType=Prs, POS=X, Mood=Cnd|Number=Plur|POS=AUX|Person=1|VerbForm=Fin, Number=Sing|POS=DET|PronType=Dem, POS=DET, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs, NumType=Ord|Number=Sing|POS=ADJ, Gender=Fem|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part, Number=Plur|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Gender=Fem|Number=Plur|POS=AUX|Tense=Past|VerbForm=Part, Gender=Masc|Number=Plur|POS=AUX|Tense=Past|VerbForm=Part, Number=Plur|POS=PRON|PronType=Dem, Mood=Imp|Number=Plur|POS=VERB|Person=1|VerbForm=Fin, POS=PRON|PronType=Ind, Mood=Ind|Number=Sing|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin, Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin, Case=Nom|Number=Sing|POS=PRON|Person=1|PronType=Prs, Case=Acc|Number=Sing|POS=PRON|Person=1|PrepCase=Pre|PronType=Prs, Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, POS=PUNCT|PunctSide=Fin|PunctType=Qest, NumForm=Digit|NumType=Ord|POS=ADJ, Case=Acc|POS=PRON|Person=3|PrepCase=Pre|PronType=Prs|Reflex=Yes, NumForm=Digit|NumType=Frac|POS=SYM, Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs, Gender=Masc|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs, Mood=Sub|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, POS=PUNCT|PunctSide=Ini|PunctType=Qest, NumType=Card|Number=Sing|POS=NUM, Foreign=Yes|POS=PRON|PronType=Int, Foreign=Yes|Mood=Ind|POS=VERB|VerbForm=Fin, Foreign=Yes|POS=ADP, Gender=Masc|Number=Sing|POS=PROPN, POS=PUNCT|PunctSide=Ini|PunctType=Excl, POS=PUNCT|PunctSide=Fin|PunctType=Excl, Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin, Number=Plur|POS=PRON|Person=2|Polite=Form|PronType=Prs, Mood=Sub|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, POS=PUNCT|PunctSide=Ini|PunctType=Comm, POS=PUNCT|PunctSide=Fin|PunctType=Comm, Number=Plur|POS=PRON|Person=2|PronType=Prs, Mood=Ind|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin, Case=Acc,Dat|Number=Plur|POS=PRON|Person=2|PronType=Prs, Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin, Mood=Cnd|Number=Plur|POS=VERB|Person=1|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Gender=Masc|Number=Plur|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs, Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Number=Sing|POS=PRON|Person=2|Polite=Form|PronType=Prs, Gender=Masc|Number=Sing|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, POS=VERB|Tense=Past|VerbForm=Part, Mood=Imp|Number=Plur|POS=AUX|Person=3|VerbForm=Fin, Case=Nom|POS=PRON|Person=3|PronType=Prs, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Fem|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs, Gender=Masc|Number=Sing|POS=PRON|PronType=Rel, Definite=Ind|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs, Number=Plur|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs, POS=AUX|Tense=Past|VerbForm=Part, Gender=Fem|NumType=Card|POS=NUM, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Fut|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, AdvType=Tim|Degree=Cmp|POS=ADV, Case=Acc|Number=Sing|POS=PRON|Person=2|Polite=Infm|PrepCase=Pre|PronType=Prs, POS=DET|PronType=Rel, Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, POS=INTJ, Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, POS=VERB|VerbForm=Fin, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art, Mood=Sub|Number=Plur|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|Number[psor]=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs, Mood=Sub|Number=Sing|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin, Case=Acc|POS=PRON|Person=3|PronType=Prs|Reflex=Yes, Foreign=Yes|POS=NOUN, Foreign=Yes|Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Foreign=Yes|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Foreign=Yes|POS=SCONJ, Foreign=Yes|Gender=Fem|Number=Sing|POS=DET|PronType=Art, Gender=Masc|POS=SYM, Gender=Fem|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs, Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Gender=Masc|Number=Plur|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs, Gender=Fem|Number=Sing|POS=PROPN, Mood=Sub|Number=Plur|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Definite=Def|Foreign=Yes|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Foreign=Yes|POS=VERB, Foreign=Yes|POS=ADJ, Foreign=Yes|POS=DET, Foreign=Yes|POS=ADV, POS=PUNCT|PunctSide=Fin|Punta d'aignctType=Brck, Degree=Cmp|POS=ADJ, AdvType=Tim|POS=SYM, Number=Plur|POS=DET|PronType=Dem, Mood=Ind|Number=Sing|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin
parser ROOT, acl, advcl, advmod, amod, appos, aux, case, cc, ccomp, compound, conj, cop, csubj, dep, det, expl:pass, fixed, flat, iobj, mark, nmod, nsubj, nummod, obj, obl, parataxis, punct, xcomp
ner LOC, MISC, ORG, PER

Accuracy

Type Score
TOKEN_ACC 99.93
TOKEN_P 99.78
TOKEN_R 99.79
TOKEN_F 99.79
POS_ACC 99.19
MORPH_ACC 98.96
MORPH_MICRO_P 99.73
MORPH_MICRO_R 99.46
MORPH_MICRO_F 99.60
SENTS_P 99.65
SENTS_R 99.47
SENTS_F 99.56
DEP_UAS 95.23
DEP_LAS 93.65
TAG_ACC 99.19
LEMMA_ACC 98.19
ENTS_P 91.95
ENTS_R 91.23
ENTS_F 91.59

Installation

pip install spacy
python -m spacy download ca_core_news_trf

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.