github explosion/spacy-models en_core_web_lg-2.3.1

Downloads

Details: https://spacy.io/models/en#en_core_web_lg

File checksum: 978be907fecfb9c1ab60a80a70fbc51a4ad62db91ebd672c3165f89640f05b4e

English multi-task CNN trained on OntoNotes, with GloVe vectors trained on Common Crawl. Assigns word vectors, POS tags, dependency parses and named entities.

Feature Description
Name en_core_web_lg
Version 2.3.1
spaCy >=2.3.0,<2.4.0
Model size 746 MB
Pipeline  tagger, parser, ner
Vectors 684830 keys, 684831 unique vectors (300 dimensions)
Sources OntoNotes 5
GloVe Common Crawl (Jeffrey Pennington, Richard Socher, and Christopher D. Manning)
License MIT
Author Explosion

Label Scheme

Component Labels
tagger  $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser  ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner  CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
LAS  90.28
UAS  92.09
TOKEN_ACC  99.76
TAGS_ACC  97.22
ENTS_F  86.40
ENTS_P  86.43
ENTS_R  86.37

Installation

pip install spacy
python -m spacy download en_core_web_lg

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.