github explosion/spacy-models en_core_web_md-2.3.0

Downloads

Details: https://spacy.io/models/en#en_core_web_md

File checksum: 70a957e8cf71f3d98b387d3ddb91a19b1ec016c26b2e580daee1aa0827d648fb

English multi-task CNN trained on OntoNotes, with GloVe vectors trained on Common Crawl. Assigns word vectors, POS tags, dependency parse and named entities.

Feature Description
Name en_core_web_md
Version 2.3.0
spaCy >=2.3.0,<2.4.0
Model size 48 MB
Pipeline  tagger, parser, ner
Vectors 684830 keys, 20000 unique vectors (300 dimensions)
Sources OntoNotes 5
GloVe Common Crawl (Jeffrey Pennington, Richard Socher, and Christopher D. Manning)
License MIT
Author Explosion

Label Scheme

Component Labels
tagger  $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser  ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner  CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
LAS  90.07
UAS  91.89
TOKEN_ACC  99.76
TAGS_ACC  97.19
ENTS_F  85.92
ENTS_P  85.92
ENTS_R  85.92

Installation

pip install spacy
python -m spacy download en_core_web_md

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.