github explosion/spacy-models en_core_web_md-2.3.1

Downloads

Details: https://spacy.io/models/en#en_core_web_md

File checksum: 904c7b8ac18b045898d34195c9dc8aee3b94ca019d25cd20eab3a94d8aa2e515

English multi-task CNN trained on OntoNotes, with GloVe vectors trained on Common Crawl. Assigns word vectors, POS tags, dependency parse and named entities.

Feature Description
Name en_core_web_md
Version 2.3.1
spaCy >=2.3.0,<2.4.0
Model size 48 MB
Pipeline  tagger, parser, ner
Vectors 684830 keys, 20000 unique vectors (300 dimensions)
Sources OntoNotes 5
GloVe Common Crawl (Jeffrey Pennington, Richard Socher, and Christopher D. Manning)
License MIT
Author Explosion

Label Scheme

Component Labels
tagger  $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser  ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner  CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
LAS  90.06
UAS  91.88
TOKEN_ACC  99.76
TAGS_ACC  97.21
ENTS_F  86.20
ENTS_P  86.27
ENTS_R  86.13

Installation

pip install spacy
python -m spacy download en_core_web_md

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.