Details: https://spacy.io/models/fr#fr_core_news_md
File checksum:
ab81567673eba7cd5a332ceaea9f5e152fcb9afeb51dd9d25d83293d4510d311
French multi-task CNN trained on UD French Sequoia and WikiNER. Assigns word vectors, POS tags, dependency parse and named entities. Word vectors trained using FastText CBOW on Wikipedia and OSCAR (Common Crawl).
Feature | Description |
---|---|
Name | fr_core_news_md
|
Version | 2.3.0
|
spaCy | >=2.3.0,<2.4.0
|
Model size | 43 MB |
Pipeline | tagger , parser , ner
|
Vectors | 500000 keys, 20000 unique vectors (300 dimensions) |
Sources | UD French Sequoia v2.5 (Candito, Marie; Seddah, Djamé; Perrier, Guy; Guillaume, Bruno) WikiNER OSCAR (Common Crawl) Wikipedia (20200301) |
License | LGPL
|
Author | Explosion |
Label Scheme
Component | Labels |
---|---|
tagger
| ADJ , ADJ__Gender=Fem|Number=Plur , ADJ__Gender=Fem|Number=Plur|NumType=Ord , ADJ__Gender=Fem|Number=Sing , ADJ__Gender=Fem|Number=Sing|NumType=Ord , ADJ__Gender=Masc , ADJ__Gender=Masc|Number=Plur , ADJ__Gender=Masc|Number=Plur|NumType=Ord , ADJ__Gender=Masc|Number=Sing , ADJ__Gender=Masc|Number=Sing|NumType=Ord , ADJ__NumType=Ord , ADJ__Number=Plur , ADJ__Number=Sing , ADJ__Number=Sing|NumType=Ord , ADP , ADP_DET__Definite=Def|Gender=Masc|Number=Sing|PronType=Art , ADP_DET__Definite=Def|Number=Plur|PronType=Art , ADP_PRON__Gender=Fem|Number=Plur , ADP_PRON__Gender=Masc|Number=Plur , ADP_PRON__Gender=Masc|Number=Sing , ADV , ADV__Gender=Fem , ADV__Polarity=Neg , ADV__PronType=Int , AUX__Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part , AUX__Mood=Cnd|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin , AUX__Mood=Cnd|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin , AUX__Mood=Cnd|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=1|Tense=Fut|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=1|Tense=Imp|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=3|Tense=Imp|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin , AUX__Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=1|Tense=Imp|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=2|Tense=Imp|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=3|Tense=Imp|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin , AUX__Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin , AUX__Mood=Sub|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin , AUX__Mood=Sub|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin , AUX__Mood=Sub|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin , AUX__Mood=Sub|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin , AUX__Mood=Sub|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin , AUX__Tense=Past|VerbForm=Part , AUX__Tense=Pres|VerbForm=Part , AUX__VerbForm=Inf , CCONJ , DET , DET__Definite=Def|Gender=Fem|Number=Sing|PronType=Art , DET__Definite=Def|Gender=Masc|Number=Sing|PronType=Art , DET__Definite=Def|Number=Plur|PronType=Art , DET__Definite=Def|Number=Sing|PronType=Art , DET__Definite=Ind|Gender=Fem|Number=Plur|PronType=Art , DET__Definite=Ind|Gender=Fem|Number=Sing|PronType=Art , DET__Definite=Ind|Gender=Masc|Number=Plur|PronType=Art , DET__Definite=Ind|Gender=Masc|Number=Sing|PronType=Art , DET__Definite=Ind|Number=Plur|PronType=Art , DET__Definite=Ind|Number=Sing|PronType=Art , DET__Gender=Fem|Number=Plur , DET__Gender=Fem|Number=Plur|PronType=Int , DET__Gender=Fem|Number=Sing , DET__Gender=Fem|Number=Sing|Poss=Yes , DET__Gender=Fem|Number=Sing|PronType=Dem , DET__Gender=Fem|Number=Sing|PronType=Int , DET__Gender=Masc|Number=Plur , DET__Gender=Masc|Number=Sing , DET__Gender=Masc|Number=Sing|PronType=Dem , DET__Gender=Masc|Number=Sing|PronType=Int , DET__Number=Plur , DET__Number=Plur|Poss=Yes , DET__Number=Plur|PronType=Dem , DET__Number=Sing , DET__Number=Sing|Poss=Yes , INTJ , NOUN , NOUN__Gender=Fem , NOUN__Gender=Fem|Number=Plur , NOUN__Gender=Fem|Number=Sing , NOUN__Gender=Masc , NOUN__Gender=Masc|Number=Plur , NOUN__Gender=Masc|Number=Plur|NumType=Card , NOUN__Gender=Masc|Number=Sing , NOUN__Gender=Masc|Number=Sing|NumType=Card , NOUN__NumType=Card , NOUN__Number=Plur , NOUN__Number=Sing , NUM , NUM__Gender=Masc|NumType=Card , NUM__NumType=Card , PART , PRON , PRON__Gender=Fem , PRON__Gender=Fem|Number=Plur , PRON__Gender=Fem|Number=Plur|Person=3 , PRON__Gender=Fem|Number=Plur|Person=3|PronType=Prs , PRON__Gender=Fem|Number=Plur|PronType=Dem , PRON__Gender=Fem|Number=Plur|PronType=Rel , PRON__Gender=Fem|Number=Sing , PRON__Gender=Fem|Number=Sing|Person=3 , PRON__Gender=Fem|Number=Sing|Person=3|PronType=Prs , PRON__Gender=Fem|Number=Sing|PronType=Dem , PRON__Gender=Fem|Number=Sing|PronType=Rel , PRON__Gender=Masc , PRON__Gender=Masc|Number=Plur , PRON__Gender=Masc|Number=Plur|Person=3 , PRON__Gender=Masc|Number=Plur|Person=3|PronType=Prs , PRON__Gender=Masc|Number=Plur|PronType=Dem , PRON__Gender=Masc|Number=Plur|PronType=Rel , PRON__Gender=Masc|Number=Sing , PRON__Gender=Masc|Number=Sing|Person=3 , PRON__Gender=Masc|Number=Sing|Person=3|PronType=Dem , PRON__Gender=Masc|Number=Sing|Person=3|PronType=Prs , PRON__Gender=Masc|Number=Sing|PronType=Dem , PRON__Gender=Masc|Number=Sing|PronType=Rel , PRON__NumType=Card , PRON__Number=Plur , PRON__Number=Plur|Person=1 , PRON__Number=Plur|Person=1|PronType=Prs , PRON__Number=Plur|Person=1|Reflex=Yes , PRON__Number=Plur|Person=2 , PRON__Number=Plur|Person=2|PronType=Prs , PRON__Number=Plur|Person=2|Reflex=Yes , PRON__Number=Plur|Person=3 , PRON__Number=Sing , PRON__Number=Sing|Person=1 , PRON__Number=Sing|Person=1|PronType=Prs , PRON__Number=Sing|Person=1|Reflex=Yes , PRON__Number=Sing|Person=2|PronType=Prs , PRON__Number=Sing|Person=3 , PRON__Number=Sing|PronType=Dem , PRON__Person=3 , PRON__Person=3|Reflex=Yes , PRON__PronType=Int , PRON__PronType=Rel , PROPN , PROPN__Gender=Fem|Number=Plur , PROPN__Gender=Fem|Number=Sing , PROPN__Gender=Masc , PROPN__Gender=Masc|Number=Plur , PROPN__Gender=Masc|Number=Sing , PROPN__Number=Plur , PROPN__Number=Sing , PUNCT , SCONJ , SYM , VERB__Gender=Fem|Number=Plur|Tense=Past|VerbForm=Part , VERB__Gender=Fem|Number=Plur|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Gender=Fem|Number=Sing|Tense=Past|VerbForm=Part , VERB__Gender=Fem|Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Gender=Masc|Number=Plur|Tense=Past|VerbForm=Part , VERB__Gender=Masc|Number=Plur|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part , VERB__Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Gender=Masc|Tense=Past|VerbForm=Part , VERB__Gender=Masc|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Mood=Cnd|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin , VERB__Mood=Cnd|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin , VERB__Mood=Cnd|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin , VERB__Mood=Cnd|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin , VERB__Mood=Cnd|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin , VERB__Mood=Imp|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin , VERB__Mood=Imp|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin , VERB__Mood=Imp|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=1|Tense=Fut|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=1|Tense=Imp|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=2|Tense=Fut|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=2|Tense=Imp|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=3|Tense=Fut|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=3|Tense=Imp|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin , VERB__Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=1|Tense=Fut|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=1|Tense=Imp|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=3|Tense=Imp|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin , VERB__Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin , VERB__Mood=Ind|Person=3|VerbForm=Fin , VERB__Mood=Ind|VerbForm=Fin , VERB__Mood=Sub|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin , VERB__Mood=Sub|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin , VERB__Mood=Sub|Number=Sing|Person=3|Tense=Past|VerbForm=Fin , VERB__Mood=Sub|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin , VERB__Number=Plur|Tense=Past|VerbForm=Part , VERB__Number=Plur|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Number=Sing|Tense=Past|VerbForm=Part , VERB__Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass , VERB__Tense=Past|VerbForm=Part , VERB__Tense=Past|VerbForm=Part|Voice=Pass , VERB__Tense=Pres|VerbForm=Part , VERB__VerbForm=Inf , X , _SP
|
parser
| ROOT , acl , acl:relcl , advcl , advmod , amod , appos , aux:pass , aux:tense , case , cc , ccomp , conj , cop , dep , det , expl:comp , expl:pass , expl:subj , fixed , flat:foreign , flat:name , iobj , mark , nmod , nsubj , nsubj:pass , nummod , obj , obl:agent , obl:arg , obl:mod , parataxis , punct , vocative , xcomp
|
ner
| LOC , MISC , ORG , PER
|
Accuracy
Type | Score |
---|---|
LAS
| 85.44 |
UAS
| 88.90 |
TOKEN_ACC
| 98.52 |
TAGS_ACC
| 95.72 |
ENTS_F
| 84.67 |
ENTS_P
| 84.92 |
ENTS_R
| 84.43 |
Because the model is trained on Wikipedia, it may perform inconsistently on many genres, such as social media text. The NER accuracy refers to the "silver standard" annotations in the WikiNER corpus. Accuracy on these annotations tends to be higher than correct human annotations.
Installation
pip install spacy
python -m spacy download fr_core_news_md