Checksum .tar.gz:
f6aa73ec329936e93d3d4924bf734ff1092d389a6c8ac1e9c095eb8ee3390283
Checksum .whl:5d18d1c1735492768a03f6d89744ae02e02a6ee7bb760ba38cacd353ccb1e654
Details: https://spacy.io/models/ca#ca_core_news_lg
Catalan pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.
Feature | Description |
---|---|
Name | ca_core_news_lg
|
Version | 3.2.0
|
spaCy | >=3.2.0,<3.3.0
|
Default Pipeline | tok2vec , morphologizer , parser , attribute_ruler , lemmatizer , ner
|
Components | tok2vec , morphologizer , parser , senter , attribute_ruler , lemmatizer , ner
|
Vectors | 500000 keys, 500000 unique vectors (300 dimensions) |
Sources | UD Catalan AnCora v2.8 (Martínez Alonso, Héctor; Pascual, Elena; Zeman, Daniel) UD Catalan AnCora v2.8 + NER v3.2.8 (Carlos Rodríguez-Penagos and Carme Armentano-Oller) Catalan Lemmatizer (Text Mining Unit, Barcelona Supercomputing Center) Catalan Word Embeddings in FastText (Version 1.0) (Gutiérrez-Fandiño, Asier, Armengol-Estapé, Jordi, Gonzalez-Agirre, Aitor, Carrino, Casimiro Pio, de Gibert, Ona, & Villegas, Marta) |
License | GNU GPL 3.0
|
Author | Explosion |
Model size | 548 MB |
Label Scheme
View label scheme (318 labels for 4 components)
Component | Labels |
---|---|
morphologizer
| Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art , POS=PROPN , POS=PUNCT|PunctSide=Ini|PunctType=Brck , POS=PUNCT|PunctSide=Fin|PunctType=Brck , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part , Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art , Gender=Fem|Number=Sing|POS=NOUN , POS=ADP , NumType=Card|Number=Plur|POS=NUM , Gender=Masc|Number=Plur|POS=NOUN , Number=Sing|POS=ADJ , POS=CCONJ , Gender=Fem|Number=Sing|POS=DET|PronType=Ind , NumForm=Digit|NumType=Card|POS=NUM , NumForm=Digit|POS=NOUN , Gender=Masc|Number=Plur|POS=ADJ , POS=PUNCT|PunctType=Comm , POS=AUX|VerbForm=Inf , Case=Acc,Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes , Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art , POS=PRON|PronType=Rel , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|POS=DET|PronType=Art , Gender=Fem|Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art , Gender=Fem|Number=Plur|POS=NOUN , Gender=Fem|Number=Plur|POS=ADJ , POS=VERB|VerbForm=Inf , Case=Acc,Dat|Number=Plur|POS=PRON|Person=3|PronType=Prs , Number=Plur|POS=ADJ , POS=PUNCT|PunctType=Peri , Number=Sing|POS=PRON|PronType=Rel , Gender=Masc|Number=Sing|POS=NOUN , Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin , Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part , POS=SCONJ , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part , Definite=Def|Number=Sing|POS=DET|PronType=Art , Gender=Masc|Number=Sing|POS=DET|PronType=Ind , Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part , Gender=Masc|Number=Sing|POS=DET|PronType=Dem , POS=VERB|VerbForm=Ger , POS=NOUN , Gender=Fem|NumType=Card|Number=Sing|POS=NUM , Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part , Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ , POS=SYM , Gender=Masc|Number=Sing|POS=ADJ , Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Gender=Fem|Number=Sing|POS=DET|PronType=Dem , POS=ADV|Polarity=Neg , POS=ADV , Number=Sing|POS=PRON|PronType=Dem , Number=Sing|POS=NOUN , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Number=Plur|POS=NOUN , Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|POS=ADJ , Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Sing|POS=PRON|PronType=Tot , Case=Loc|POS=PRON|Person=3|PronType=Prs , Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ , Degree=Cmp|POS=ADV , Gender=Fem|Number=Plur|POS=DET|PronType=Art , Gender=Fem|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin , Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin , NumType=Card|POS=NUM , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin , Number=Sing|POS=PRON|PronType=Ind , Gender=Masc|Number=Sing|POS=DET|PronType=Art , Number=Plur|POS=DET|PronType=Ind , Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Plur|POS=DET|PronType=Dem , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin , Gender=Masc|NumType=Card|Number=Sing|POS=NUM , Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs , Number=Sing|POS=DET|PronType=Ind , POS=PUNCT , Number=Sing|POS=DET|PronType=Rel , Case=Gen|POS=PRON|Person=3|PronType=Prs , Gender=Fem|NumType=Card|Number=Plur|POS=NUM , Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , POS=DET|PronType=Ind , POS=AUX , Case=Acc|Gender=Neut|Number=Sing|POS=PRON|Person=3|PronType=Prs , Case=Acc,Dat|Number=Plur|POS=PRON|Person=1|PronType=Prs , Degree=Cmp|Number=Sing|POS=ADJ , Number=Sing|POS=VERB , Gender=Masc|Number=Plur|POS=PRON|PronType=Ind , Gender=Fem|Number=Plur|POS=DET|PronType=Dem , Gender=Masc|Number=Plur|POS=DET|PronType=Art , Gender=Masc|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs , Case=Acc|Gender=Fem,Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part , Gender=Masc|Number=Sing|POS=PRON|PronType=Ind , Gender=Fem|Number=Plur|POS=PRON|PronType=Ind , Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Number=Plur|POS=PRON|PronType=Rel , Gender=Masc|Number=Plur|POS=DET|PronType=Int , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , AdvType=Tim|POS=NOUN , Gender=Masc|Number=Plur|POS=DET|PronType=Ind , Gender=Fem|Number=Plur|POS=DET|PronType=Ind , Gender=Masc|Number=Sing|POS=DET|PronType=Int , Mood=Cnd|Number=Sing|POS=AUX|Person=3|VerbForm=Fin , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Number=Sing|POS=DET|PronType=Art , Gender=Masc|Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Masc|Number=Sing|POS=PRON|PronType=Int , POS=PUNCT|PunctType=Semi , Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin , Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Masc|NumType=Card|Number=Plur|POS=NUM , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|POS=PRON|PronType=Ind , Mood=Sub|Number=Sing|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , NumForm=Digit|POS=SYM , Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part , Gender=Fem|Number=Sing|POS=PRON|PronType=Int , Gender=Fem|Number=Sing|POS=DET|PronType=Int , POS=PRON|PronType=Int , Gender=Fem|Number=Plur|POS=DET|PronType=Int , Mood=Cnd|Number=Sing|POS=VERB|Person=3|VerbForm=Fin , Mood=Cnd|Number=Plur|POS=VERB|Person=3|VerbForm=Fin , POS=PART , Gender=Fem|Number=Sing|POS=PRON|PronType=Dem , Gender=Masc|Number=Sing|POS=DET|PronType=Tot , Gender=Masc|Number=Plur|POS=PRON|PronType=Dem , POS=ADJ , Gender=Masc|Number=Plur|POS=PRON|Person=3|PronType=Prs , Degree=Cmp|Number=Plur|POS=ADJ , POS=PUNCT|PunctType=Dash , Mood=Sub|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs , Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part , Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Masc|POS=NOUN , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin , Gender=Fem|Number=Plur|POS=PRON|PronType=Int , Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ , Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Fut|VerbForm=Fin , POS=PUNCT|PunctType=Colo , Gender=Masc|NumType=Card|POS=NUM , Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Number=Sing|POS=PRON|PronType=Int , POS=PUNCT|PunctType=Quot , Mood=Imp|Number=Sing|POS=VERB|Person=3|VerbForm=Fin , Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Masc|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin , POS=AUX|VerbForm=Ger , Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs , Mood=Imp|Number=Sing|POS=AUX|Person=3|VerbForm=Fin , Number=Plur|POS=PRON|PronType=Ind , Gender=Masc|Number=Sing|POS=PRON|PronType=Dem , Case=Acc,Dat|Number=Sing|POS=PRON|Person=2|Polite=Infm|PrepCase=Npr|PronType=Prs , Gender=Masc|Number=Plur|POS=PRON|PronType=Int , Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , NumForm=Digit|NumType=Frac|POS=NUM , POS=VERB , Gender=Fem|Number=Plur|POS=PRON|PronType=Dem , Gender=Fem|POS=NOUN , Case=Acc,Dat|Number=Sing|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs , Mood=Sub|Number=Plur|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Mood=Ind|Number=Plur|POS=AUX|Person=2|Tense=Fut|VerbForm=Fin , Mood=Sub|Number=Plur|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , Number=Plur|POS=PRON|Person=1|PronType=Prs , Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , Case=Nom|Number=Sing|POS=PRON|Person=2|Polite=Infm|PronType=Prs , POS=X , Mood=Cnd|Number=Plur|POS=AUX|Person=1|VerbForm=Fin , Number=Sing|POS=DET|PronType=Dem , POS=DET , Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin , Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , POS=DET|PronType=Art , Gender=Masc|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , NumType=Ord|Number=Sing|POS=ADJ , Gender=Fem|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part , Number=Plur|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Fem|Number=Plur|POS=AUX|Tense=Past|VerbForm=Part , Gender=Masc|Number=Plur|POS=AUX|Tense=Past|VerbForm=Part , Number=Plur|POS=PRON|PronType=Dem , Mood=Imp|Number=Plur|POS=VERB|Person=1|VerbForm=Fin , POS=PRON|PronType=Ind , Mood=Ind|Number=Sing|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin , Case=Nom|Number=Sing|POS=PRON|Person=1|PronType=Prs , Case=Acc|Number=Sing|POS=PRON|Person=1|PrepCase=Pre|PronType=Prs , Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin , Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , POS=PUNCT|PunctSide=Fin|PunctType=Qest , NumForm=Digit|NumType=Ord|POS=ADJ , Case=Acc|POS=PRON|Person=3|PrepCase=Pre|PronType=Prs|Reflex=Yes , NumForm=Digit|NumType=Frac|POS=SYM , Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Gender=Masc|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Mood=Sub|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , POS=PUNCT|PunctSide=Ini|PunctType=Qest , NumType=Card|Number=Sing|POS=NUM , Foreign=Yes|POS=PRON|PronType=Int , Foreign=Yes|Mood=Ind|POS=VERB|VerbForm=Fin , Foreign=Yes|POS=ADP , Gender=Masc|Number=Sing|POS=PROPN , POS=PUNCT|PunctSide=Ini|PunctType=Excl , POS=PUNCT|PunctSide=Fin|PunctType=Excl , Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin , Number=Plur|POS=PRON|Person=2|Polite=Form|PronType=Prs , Mood=Sub|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , POS=PUNCT|PunctSide=Ini|PunctType=Comm , POS=PUNCT|PunctSide=Fin|PunctType=Comm , Number=Plur|POS=PRON|Person=2|PronType=Prs , Mood=Ind|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin , Case=Acc,Dat|Number=Plur|POS=PRON|Person=2|PronType=Prs , Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin , Mood=Cnd|Number=Plur|POS=VERB|Person=1|VerbForm=Fin , Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , Gender=Masc|Number=Plur|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art , Number=Sing|POS=PRON|Person=2|Polite=Form|PronType=Prs , Gender=Masc|Number=Sing|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , POS=VERB|Tense=Past|VerbForm=Part , Mood=Imp|Number=Plur|POS=AUX|Person=3|VerbForm=Fin , Case=Nom|POS=PRON|Person=3|PronType=Prs , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin , Gender=Fem|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Gender=Masc|Number=Sing|POS=PRON|PronType=Rel , Definite=Ind|Number=Sing|POS=DET|PronType=Art , Gender=Masc|Number=Sing|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs , Number=Plur|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs , POS=AUX|Tense=Past|VerbForm=Part , Gender=Fem|NumType=Card|POS=NUM , Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Fut|VerbForm=Fin , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin , AdvType=Tim|Degree=Cmp|POS=ADV , Case=Acc|Number=Sing|POS=PRON|Person=2|Polite=Infm|PrepCase=Pre|PronType=Prs , POS=DET|PronType=Rel , Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art , Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin , POS=INTJ , Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , POS=VERB|VerbForm=Fin , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin , Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art , Mood=Sub|Number=Plur|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|Number[psor]=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Mood=Sub|Number=Sing|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Case=Acc|POS=PRON|Person=3|PronType=Prs|Reflex=Yes , Foreign=Yes|POS=NOUN , Foreign=Yes|Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Foreign=Yes|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Foreign=Yes|POS=SCONJ , Foreign=Yes|Gender=Fem|Number=Sing|POS=DET|PronType=Art , Gender=Masc|POS=SYM , Gender=Fem|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Gender=Masc|Number=Plur|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Gender=Fem|Number=Sing|POS=PROPN , Mood=Sub|Number=Plur|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , Definite=Def|Foreign=Yes|Gender=Masc|Number=Sing|POS=DET|PronType=Art , Foreign=Yes|POS=VERB , Foreign=Yes|POS=ADJ , Foreign=Yes|POS=DET , Foreign=Yes|POS=ADV , POS=PUNCT|PunctSide=Fin|Punta d'aignctType=Brck , Degree=Cmp|POS=ADJ , AdvType=Tim|POS=SYM , Number=Plur|POS=DET|PronType=Dem , Mood=Ind|Number=Sing|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin
|
parser
| ROOT , acl , advcl , advmod , amod , appos , aux , case , cc , ccomp , compound , conj , cop , csubj , dep , det , expl:pass , fixed , flat , iobj , mark , nmod , nsubj , nummod , obj , obl , parataxis , punct , xcomp
|
senter
| I , S
|
ner
| LOC , MISC , ORG , PER
|
Accuracy
Type | Score |
---|---|
TOKEN_ACC
| 99.97 |
TOKEN_P
| 99.78 |
TOKEN_R
| 99.79 |
TOKEN_F
| 99.79 |
POS_ACC
| 98.53 |
MORPH_ACC
| 98.21 |
MORPH_MICRO_P
| 99.57 |
MORPH_MICRO_R
| 99.07 |
MORPH_MICRO_F
| 99.32 |
SENTS_P
| 99.18 |
SENTS_R
| 99.06 |
SENTS_F
| 99.12 |
DEP_UAS
| 91.70 |
DEP_LAS
| 88.74 |
TAG_ACC
| 98.53 |
LEMMA_ACC
| 97.59 |
ENTS_P
| 85.38 |
ENTS_R
| 84.44 |
ENTS_F
| 84.91 |
Installation
pip install spacy
python -m spacy download ca_core_news_lg