github explosion/spacy-models en_core_web_lg-2.3.0

Downloads

Details: https://spacy.io/models/en#en_core_web_lg

File checksum: c2c532aad85495bd5708ec077ceb3700794cd64b88480f654942f9303e2fca55

English multi-task CNN trained on OntoNotes, with GloVe vectors trained on Common Crawl. Assigns word vectors, POS tags, dependency parses and named entities.

Feature Description
Name en_core_web_lg
Version 2.3.0
spaCy >=2.3.0,<2.4.0
Model size 746 MB
Pipeline  tagger, parser, ner
Vectors 684830 keys, 684831 unique vectors (300 dimensions)
Sources OntoNotes 5
GloVe Common Crawl (Jeffrey Pennington, Richard Socher, and Christopher D. Manning)
License MIT
Author Explosion

Label Scheme

Component Labels
tagger  $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser  ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner  CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
LAS  90.27
UAS  92.07
TOKEN_ACC  99.76
TAGS_ACC  97.24
ENTS_F  86.52
ENTS_P  86.48
ENTS_R  86.56

Installation

pip install spacy
python -m spacy download en_core_web_lg

Don't miss a new spacy-models release

NewReleases is sending notifications on new releases.