Skip to content

Releases: explosion/spacy-models

fi_core_news_lg-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: c32f5b26d64b589d998f4b03eb4ca3b7fe189deccee441c154ef3a75a0a2011b
Checksum .whl: 203c224d2684788ec13e5540b59d6e56684e9a349d7181eb2f9898704b026deb

Details: https://spacy.io/models/fi#fi_core_news_lg

Finnish pipeline optimized for CPU. Components: tok2vec, tagger, morphologizer, parser, lemmatizer (trainable_lemmatizer), senter, ner.

Feature Description
Name fi_core_news_lg
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, morphologizer, parser, lemmatizer, attribute_ruler, ner
Components tok2vec, tagger, morphologizer, parser, lemmatizer, senter, attribute_ruler, ner
Vectors floret (200000, 300)
Sources UD Finnish TDT v2.8 (Ginter, Filip; Kanerva, Jenna; Laippala, Veronika; Miekka, Niko; Missilä, Anna; Ojala, Stina; Pyysalo, Sampo)
TurkuONE (ffe2040e) (Jouni Luoma, Li-Hsin Chang, Filip Ginter, Sampo Pyysalo)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License CC BY-SA 4.0
Author Explosion
Model size 220 MB

Label Scheme

View label scheme (2145 labels for 4 components)
Component Labels
tagger A, Adj, Adp, Adv, Adv_V, C, C_V, Foreign, Interj, N, Num, Pron, Punct, Symb, V, V_Pron, _SP
morphologizer Case=Nom|Number=Sing|POS=NOUN, NumType=Ord|POS=ADJ, Case=Ade|Number=Sing|POS=NOUN, Case=Nom|Derivation=U|Number=Sing|POS=NOUN, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, POS=ADV, Case=Par|Degree=Pos|Number=Plur|POS=ADJ, POS=CCONJ, Case=Par|Degree=Pos|Derivation=Inen|Number=Plur|POS=ADJ, Case=Par|Number=Plur|POS=NOUN, Case=Ill|Number=Sing|POS=NOUN, POS=PUNCT, Case=Nom|Degree=Pos|Derivation=Lainen|Number=Sing|POS=ADJ, POS=SCONJ, Case=Nom|Number=Sing|Number[psor]=Plur|POS=NOUN|Person[psor]=1, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Acc|Number=Plur|POS=PRON|Person=1|PronType=Prs, Case=Gen|Number=Sing|POS=NOUN, Case=Abl|Degree=Pos|Derivation=Lainen|Number=Sing|POS=ADJ, Clitic=Kaan|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Mood=Ind|Number=Sing|POS=VERB|Person=0|Tense=Past|VerbForm=Fin|Voice=Act, Case=Nom|Derivation=Lainen|Number=Sing|POS=ADJ, Case=Nom|Number=Sing|POS=PROPN, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, Case=Nom|Number=Sing|POS=PRON|PronType=Dem, Clitic=Kin|POS=ADV, Case=Gen|Number=Plur|POS=PROPN, Case=Ess|Number=Sing|POS=NOUN, Case=Ill|Number=Sing|POS=PRON|Person=1|PronType=Prs, Case=Gen|Degree=Pos|Number=Sing|POS=ADJ, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act, Case=Gen|Number=Sing|POS=PRON|PronType=Dem, Case=Ela|Derivation=Llinen,Vs|Number=Sing|POS=NOUN, POS=ADJ, Case=Gen|Number=Plur|POS=NOUN, Case=Par|Number=Sing|POS=PRON|PronType=Dem, Number=Sing|POS=AUX|Person=3|Polarity=Neg|VerbForm=Fin|Voice=Act, Case=Ine|Number=Sing|POS=PRON|PronType=Ind, Case=Ine|Number=Sing|POS=NOUN, Case=Nom|Degree=Pos|Number=Sing|POS=VERB|PartForm=Past|VerbForm=Part|Voice=Pass, Case=Ade|Number=Sing|POS=PRON|PronType=Ind, Case=Ins|Number=Plur|POS=NOUN, Case=Gen|Number=Sing|POS=PROPN, Case=Par|Number=Sing|POS=NOUN, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Nom|Degree=Pos|Number=Sing|POS=ADJ, Case=Nom|Number=Plur|POS=NOUN, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=All|Number=Sing|POS=PRON|PronType=Dem, Case=Ill|InfForm=3|Number=Sing|POS=VERB|VerbForm=Inf|Voice=Act, Case=Nom|Clitic=Kin|Number=Plur|POS=PRON|Person=1|PronType=Prs, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Past|VerbForm=Fin|Voice=Act, Case=Gen|Number=Sing|POS=NOUN|Style=Coll, Case=All|Derivation=U|Number=Sing|POS=NOUN, AdpType=Post|POS=ADP, Case=Nom|Degree=Pos|Derivation=Llinen|Number=Sing|POS=ADJ, Case=Gen|Number=Sing|POS=PRON|PronType=Rcp, Case=Abl|Number=Sing|POS=NOUN, Case=All|Number=Sing|POS=PRON|PronType=Rcp, Case=Ine|InfForm=3|Number=Sing|POS=VERB|VerbForm=Inf|Voice=Act, Case=Par|Number=Plur|POS=PRON|PronType=Ind, Case=Par|Derivation=Ja|Number=Plur|POS=NOUN, Case=Gen|Derivation=Vs|Number=Sing|POS=NOUN, Case=Par|Number=Sing|POS=PRON|PronType=Ind, Case=Par|Derivation=Ja|Number=Sing|POS=NOUN, Case=Nom|Degree=Pos|Derivation=Inen|Number=Sing|POS=ADJ, Case=Tra|Number=Sing|POS=NOUN, Case=Ela|Number=Sing|POS=NOUN, Case=Nom|Degree=Pos|Number=Sing|POS=VERB|PartForm=Past|VerbForm=Part|Voice=Act, Case=Par|Degree=Pos|Number=Sing|POS=ADJ, Case=Par|Clitic=Kin|Number=Sing|POS=NOUN, InfForm=1|Number=Sing|POS=VERB|VerbForm=Inf|Voice=Act, Case=Nom|Derivation=Ja|Number=Sing|POS=NOUN, Case=Ela|Number=Sing|Number[psor]=Sing|POS=NOUN|Person[psor]=1, Case=Ine|Number=Sing|POS=NOUN|Person[psor]=3, InfForm=1|Number=Sing|POS=AUX|VerbForm=Inf|Voice=Act, Derivation=Sti|POS=ADV, Mood=Cnd|Number=Sing|POS=AUX|Person=3|VerbForm=Fin|Voice=Act, Case=Ill|Number=Sing|POS=PRON|PronType=Int, Mood=Ind|Number=Sing|POS=VERB|Person=0|Tense=Pres|VerbForm=Fin|Voice=Act, Case=Ill|Number=Plur|POS=NOUN, Case=Par|Degree=Pos|Number=Plur|POS=VERB|PartForm=Pres|VerbForm=Part|Voice=Act, Case=Nom|Degree=Pos|Number=Sing|POS=VERB|PartForm=Agt|VerbForm=Part|Voice=Act, Case=Nom|Number=Plur|POS=NOUN|Person[psor]=3, Case=Par|Number=Sing|POS=PRON|PronType=Rel, Case=Ine|Clitic=Kin|Number=Plur|POS=NOUN, Mood=Ind|POS=VERB|Tense=Pres|VerbForm=Fin|Voice=Pass, Case=Gen|Number=Sing|POS=PRON|PronType=Ind, Case=Gen|NumType=Card|Number=Sing|POS=NUM, Case=All|Number=Sing|POS=NOUN, Case=Nom|Number=Sing|POS=PRON|PronType=Ind, Case=Nom|Number=Sing|POS=PRON|PronType=Rel, Case=Ill|Number=Sing|POS=NOUN|Person[psor]=3, Case=Par|Degree=Pos|Derivation=Inen|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Derivation=Lainen|Number=Sing|POS=ADJ, Case=Gen|Derivation=Inen|NumType=Ord|Number=Sing|POS=ADJ, Case=Nom|Degree=Pos|Number=Sing|POS=VERB|PartForm=Pres|VerbForm=Part|Voice=Act, Case=Gen|Degree=Pos|Number=Sing|POS=AUX|PartForm=Pres|VerbForm=Part|Voice=Act, Case=Nom|Derivation=Ja|Number=Plur|POS=NOUN|Typo=Yes, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, Case=Par|Number=Sing|POS=PRON|Person[psor]=3|Reflex=Yes, Case=All|Degree=Pos|Derivation=Inen|Number=Plur|POS=ADJ, Case=All|Degree=Pos|Number=Plur|POS=ADJ, Case=All|Number=Plur|POS=NOUN, Case=Ela|Derivation=U|Number=Plur|POS=NOUN, Case=Nom|Degree=Pos|Number=Sing|POS=VERB|PartForm=Pres|VerbForm=Part|Voice=Pass, Case=Nom|Degree=Pos|Number=Sing|POS=VERB|PartForm=Past|Typo=Yes|VerbForm=Part|Voice=Act, Case=Nom|Clitic=Kaan|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Clitic=Ka|Number=Sing|POS=AUX|Person=3|Polarity=Neg|VerbForm=Fin|Voice=Act, Case=Ela|Degree=Pos|Number=Sing|POS=ADJ, Connegative=Yes|Mood=Ind|POS=VERB|Tense=Pres|VerbForm=Fin, Case=Tra|Degree=Pos|Derivation=Inen|Number=Sing|POS=ADJ, Mood=Cnd|Number=Sing|POS=AUX|Person=0|VerbForm=Fin|Voice=Act, Case=Nom|Degree=Cmp|Number=Sing|POS=ADJ, Case=Nom|Number=Sing|POS=PRON|Person=1|PronType=Prs, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Past|VerbForm=Fin|Voice=Act, Case=Ade|Number=Sing|POS=PRON|PronType=Rel, Mood=Ind|POS=VERB|Tense=Past|VerbForm=Fin|Voice=Pass, Case=All|Number=Sing|POS=PRON|PronType=Ind, Case=All|Number=Plur|Number[psor]=Sing|POS=NOUN|Person[psor]=1, Case=Nom|Number=Plur|POS=PRON|PronType=Ind, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Nom|Number=Plur|POS=PRON|Person=3|PronType=Prs, Clitic=Kin|Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Nom|Degree=Pos|Number=Plur|POS=VERB|PartForm=Past|VerbForm=Part|Voice=Act, Case=Par|Derivation=Vs|Number=Sing|POS=NOUN, Case=Gen|Number=Sing|Number[psor]=Sing|POS=NOUN|Person[psor]=1, Case=Gen|Degree=Pos|Number=Sing|POS=VERB|PartForm=Pres|VerbForm=Part|Voice=Act, Case=Nom|Number=Sing|Number[psor]=Sing|POS=NOUN|Person[psor]=1, Case=Ill|Derivation=Ja|Number=Sing|Number[psor]=Sing|POS=NOUN|Person[psor]=1, Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin|Voice=Act, Case=Ine|Number=Sing|POS=PRON|PronType=Dem, Case=Ine|Number=Sing|POS=PROPN, Mood=Ind|Number=Sing|POS=AUX|Person=0|Tense=Pres|VerbForm=Fin|Voice=Act, Case=Nom|Number=Sing|POS=PRON, Case=Nom|Derivation=Inen|NumType=Ord|Number=Sing|POS=ADJ, `Case=Nom|Number=Sing|POS=PRON|...
Read more

es_dep_news_trf-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: e344d0c64dea4b7eae0c20b5ebf8367c184ff9ccdb92c92bef598641ea8acb75
Checksum .whl: afe76019ca75827db18f14f282f4678b1544277b29dcba8011b400837b19dcb4

Details: https://spacy.io/models/es#es_dep_news_trf

Spanish transformer pipeline (Transformer(name='dccuchile/bert-base-spanish-wwm-cased', piece_encoder='bert-wordpiece', stride=112, type='bert', width=768, window=158, vocab_size=31002)). Components: transformer, morphologizer, parser, attribute_ruler, lemmatizer.

Feature Description
Name es_dep_news_trf
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline transformer, morphologizer, parser, attribute_ruler, lemmatizer
Components transformer, morphologizer, parser, attribute_ruler, lemmatizer
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources UD Spanish AnCora v2.8 (Martínez Alonso, Héctor; Zeman, Daniel)
spaCy lookups data (Explosion)
dccuchile/bert-base-spanish-wwm-cased (Cañete, José and Chaperon, Gabriel and Fuentes, Rodrigo and Ho, Jou-Hui and Kang, Hojin and Pérez, Jorge)
License GNU GPL 3.0
Author Explosion
Model size 388 MB

Label Scheme

View label scheme (463 labels for 2 components)
Component Labels
morphologizer Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=NOUN, Definite=Def|Gender=Masc|Number=Sing|POS=ADP|PronType=Art, Gender=Masc|Number=Sing|POS=ADJ, POS=ADP, Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, POS=PROPN, Case=Acc|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, POS=VERB|VerbForm=Inf, Gender=Fem|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|Number=Sing|POS=NOUN, Gender=Fem|Number=Plur|POS=NOUN, Gender=Fem|Number=Plur|POS=DET|PronType=Ind, POS=PRON|PronType=Int,Rel, Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, POS=SCONJ, POS=NOUN, Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Number=Plur|POS=NOUN, Gender=Masc|Number=Plur|POS=DET|PronType=Ind, Gender=Masc|Number=Plur|POS=NOUN, POS=PUNCT|PunctType=Peri, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, POS=PUNCT|PunctType=Comm, Case=Acc|Gender=Fem|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Number=Plur|POS=ADJ, POS=CCONJ, Gender=Masc|Number=Plur|POS=PRON|PronType=Ind, POS=ADV, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Dem, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Number=Sing|POS=ADJ, Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Masc|Number=Plur|POS=PRON|PronType=Tot, POS=PRON|PronType=Ind, POS=ADV|Polarity=Neg, Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Sing|POS=ADJ, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, Number=Plur|POS=PRON|PronType=Int,Rel, POS=PUNCT|PunctType=Quot, POS=PUNCT, Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part, POS=PUNCT|PunctSide=Ini|PunctType=Brck, POS=PUNCT|PunctSide=Fin|PunctType=Brck, NumForm=Digit|NumType=Card|POS=NUM, NumType=Card|POS=NUM, POS=VERB|VerbForm=Ger, Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ, Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Number=Sing|POS=NOUN, Gender=Masc|Number=Plur|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin, Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Degree=Cmp|POS=ADV, POS=AUX|VerbForm=Inf, Number=Plur|POS=DET|PronType=Ind, Number=Plur|POS=DET|PronType=Dem, POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Degree=Cmp|Number=Sing|POS=ADJ, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes|VerbForm=Inf, Degree=Sup|Gender=Masc|Number=Plur|POS=ADJ, Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art, AdvType=Tim|POS=NOUN, Gender=Masc|Number=Sing|POS=PRON|PronType=Ind, NumType=Card|Number=Plur|POS=NUM, Case=Acc|Gender=Masc|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, NumForm=Digit|POS=NOUN, Number=Sing|POS=PRON|PronType=Dem, Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs, Gender=Fem|Number=Plur|POS=ADJ, Gender=Fem|Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Plur|POS=DET|PronType=Tot, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ, Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ, Gender=Masc|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part, Number=Sing|POS=DET|PronType=Tot, Gender=Fem|Number=Sing|POS=PRON|PronType=Ind, Case=Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Degree=Cmp|Number=Plur|POS=ADJ, POS=AUX|VerbForm=Ger, Gender=Fem|POS=NOUN, Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ, AdvType=Tim|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Fem|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Number=Sing|POS=PRON|PronType=Int,Rel, POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Gender=Fem|Number=Plur|POS=DET|PronType=Tot, Case=Acc,Nom|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Acc,Nom|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Case=Acc|Definite=Def|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Sing|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin, Gender=Masc|Number=Sing|POS=DET|PronType=Tot, Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs, POS=PART, Gender=Fem|Number=Sing|POS=DET|PronType=Ind, Number=Sing|POS=DET|PronType=Ind, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Ind, Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin, NumForm=Digit|POS=SYM, Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin, Case=Dat|Number=Sing|POS=VERB|Person=3|PronType=Prs|VerbForm=Inf, Gender=Fem|Number=Plur|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin, NumForm=Digit|NumType=Frac|POS=NUM, Gender=Fem|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Case=Dat|Number=Plur|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs, Definite=Ind|Gender=Masc|Number=Plur|POS=DET|PronType=Art, POS=PUNCT|PunctType=Colo, Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin, Gender=Fem|Number=Sing|POS=DET|PronType=Neg, Gender=Masc|Number=Sing|POS=PRON|PronType=Dem, Case=Acc|Gender=Masc|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, `Case=Acc|Gender=Fem|Number=Sing|POS=AUX|Person=3|PrepCa...
Read more

es_core_news_sm-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: f13c93ee56937f2024ce2836f3726bf56bd85589237e9894d97b650bb0546bd9
Checksum .whl: e451a83d6df79b87e9eed0cb553f03e99e36a3bab18a7b79f0dcfd1fdf875e12

Details: https://spacy.io/models/es#es_core_news_sm

Spanish pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name es_core_news_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources UD Spanish AnCora v2.8 (Martínez Alonso, Héctor; Zeman, Daniel)
WikiNER (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)
spaCy lookups data (Explosion)
License GNU GPL 3.0
Author Explosion
Model size 12 MB

Label Scheme

View label scheme (468 labels for 3 components)
Component Labels
morphologizer Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=NOUN, Definite=Def|Gender=Masc|Number=Sing|POS=ADP|PronType=Art, Gender=Masc|Number=Sing|POS=ADJ, POS=ADP, Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, POS=PROPN, Case=Acc|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, POS=VERB|VerbForm=Inf, Gender=Fem|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|Number=Sing|POS=NOUN, Gender=Fem|Number=Plur|POS=NOUN, Gender=Fem|Number=Plur|POS=DET|PronType=Ind, POS=PRON|PronType=Int,Rel, Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, POS=SCONJ, POS=NOUN, Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Number=Plur|POS=NOUN, Gender=Masc|Number=Plur|POS=DET|PronType=Ind, Gender=Masc|Number=Plur|POS=NOUN, POS=PUNCT|PunctType=Peri, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, POS=PUNCT|PunctType=Comm, Case=Acc|Gender=Fem|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Number=Plur|POS=ADJ, POS=CCONJ, Gender=Masc|Number=Plur|POS=PRON|PronType=Ind, POS=ADV, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Dem, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Number=Sing|POS=ADJ, Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Masc|Number=Plur|POS=PRON|PronType=Tot, POS=PRON|PronType=Ind, POS=ADV|Polarity=Neg, Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Sing|POS=ADJ, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, Number=Plur|POS=PRON|PronType=Int,Rel, POS=PUNCT|PunctType=Quot, POS=PUNCT, Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part, POS=PUNCT|PunctSide=Ini|PunctType=Brck, POS=PUNCT|PunctSide=Fin|PunctType=Brck, NumForm=Digit|NumType=Card|POS=NUM, NumType=Card|POS=NUM, POS=VERB|VerbForm=Ger, Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ, Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Number=Sing|POS=NOUN, Gender=Masc|Number=Plur|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin, Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Degree=Cmp|POS=ADV, POS=AUX|VerbForm=Inf, Number=Plur|POS=DET|PronType=Ind, Number=Plur|POS=DET|PronType=Dem, POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Degree=Cmp|Number=Sing|POS=ADJ, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes|VerbForm=Inf, Degree=Sup|Gender=Masc|Number=Plur|POS=ADJ, Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art, AdvType=Tim|POS=NOUN, Gender=Masc|Number=Sing|POS=PRON|PronType=Ind, NumType=Card|Number=Plur|POS=NUM, Case=Acc|Gender=Masc|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, NumForm=Digit|POS=NOUN, Number=Sing|POS=PRON|PronType=Dem, Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs, Gender=Fem|Number=Plur|POS=ADJ, Gender=Fem|Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Plur|POS=DET|PronType=Tot, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ, Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ, Gender=Masc|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part, Number=Sing|POS=DET|PronType=Tot, Gender=Fem|Number=Sing|POS=PRON|PronType=Ind, Case=Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Degree=Cmp|Number=Plur|POS=ADJ, POS=AUX|VerbForm=Ger, Gender=Fem|POS=NOUN, Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ, AdvType=Tim|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Fem|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Number=Sing|POS=PRON|PronType=Int,Rel, POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Gender=Fem|Number=Plur|POS=DET|PronType=Tot, Case=Acc,Nom|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Acc,Nom|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Case=Acc|Definite=Def|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, POS=SPACE, Gender=Fem|Number=Sing|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin, Gender=Masc|Number=Sing|POS=DET|PronType=Tot, Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs, POS=PART, Gender=Fem|Number=Sing|POS=DET|PronType=Ind, Number=Sing|POS=DET|PronType=Ind, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Ind, Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin, NumForm=Digit|POS=SYM, Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin, Case=Dat|Number=Sing|POS=VERB|Person=3|PronType=Prs|VerbForm=Inf, Gender=Fem|Number=Plur|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin, NumForm=Digit|NumType=Frac|POS=NUM, Gender=Fem|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Case=Dat|Number=Plur|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs, Definite=Ind|Gender=Masc|Number=Plur|POS=DET|PronType=Art, POS=PUNCT|PunctType=Colo, Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin, Gender=Fem|Number=Sing|POS=DET|PronType=Neg, Gender=Masc|Number=Sing|POS=PRON|PronType=Dem, Case=Acc|Gender=Masc|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Case=Acc|Gender=Fem|Number=Sing|POS=AUX|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, Number=Sing|POS=PRON|PronType=Neg, POS=PUNCT|PunctType=Semi, `Case=Dat|Number=Plur|POS=PRON|...
Read more

es_core_news_md-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 79b8cadfdae5ace729bf1fd1220de7ca834bd4a7a905e0520491e51443cbee55
Checksum .whl: 478b8bb3f3e8eb149192f7d80e7cd64f990b7e9bccbc5329df41a57e86326be2

Details: https://spacy.io/models/es#es_core_news_md

Spanish pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name es_core_news_md
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 500000 keys, 20000 unique vectors (300 dimensions)
Sources UD Spanish AnCora v2.8 (Martínez Alonso, Héctor; Zeman, Daniel)
WikiNER (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)
spaCy lookups data (Explosion)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License GNU GPL 3.0
Author Explosion
Model size 40 MB

Label Scheme

View label scheme (468 labels for 3 components)
Component Labels
morphologizer Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=NOUN, Definite=Def|Gender=Masc|Number=Sing|POS=ADP|PronType=Art, Gender=Masc|Number=Sing|POS=ADJ, POS=ADP, Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, POS=PROPN, Case=Acc|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, POS=VERB|VerbForm=Inf, Gender=Fem|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|Number=Sing|POS=NOUN, Gender=Fem|Number=Plur|POS=NOUN, Gender=Fem|Number=Plur|POS=DET|PronType=Ind, POS=PRON|PronType=Int,Rel, Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, POS=SCONJ, POS=NOUN, Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Number=Plur|POS=NOUN, Gender=Masc|Number=Plur|POS=DET|PronType=Ind, Gender=Masc|Number=Plur|POS=NOUN, POS=PUNCT|PunctType=Peri, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, POS=PUNCT|PunctType=Comm, Case=Acc|Gender=Fem|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Number=Plur|POS=ADJ, POS=CCONJ, Gender=Masc|Number=Plur|POS=PRON|PronType=Ind, POS=ADV, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Dem, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Number=Sing|POS=ADJ, Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Masc|Number=Plur|POS=PRON|PronType=Tot, POS=PRON|PronType=Ind, POS=ADV|Polarity=Neg, Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Sing|POS=ADJ, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, Number=Plur|POS=PRON|PronType=Int,Rel, POS=PUNCT|PunctType=Quot, POS=PUNCT, Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part, POS=PUNCT|PunctSide=Ini|PunctType=Brck, POS=PUNCT|PunctSide=Fin|PunctType=Brck, NumForm=Digit|NumType=Card|POS=NUM, NumType=Card|POS=NUM, POS=VERB|VerbForm=Ger, Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ, Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Number=Sing|POS=NOUN, Gender=Masc|Number=Plur|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin, Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Degree=Cmp|POS=ADV, POS=AUX|VerbForm=Inf, Number=Plur|POS=DET|PronType=Ind, Number=Plur|POS=DET|PronType=Dem, POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Degree=Cmp|Number=Sing|POS=ADJ, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes|VerbForm=Inf, Degree=Sup|Gender=Masc|Number=Plur|POS=ADJ, Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art, AdvType=Tim|POS=NOUN, Gender=Masc|Number=Sing|POS=PRON|PronType=Ind, NumType=Card|Number=Plur|POS=NUM, Case=Acc|Gender=Masc|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, NumForm=Digit|POS=NOUN, Number=Sing|POS=PRON|PronType=Dem, Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs, Gender=Fem|Number=Plur|POS=ADJ, Gender=Fem|Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Plur|POS=DET|PronType=Tot, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ, Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ, Gender=Masc|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part, Number=Sing|POS=DET|PronType=Tot, Gender=Fem|Number=Sing|POS=PRON|PronType=Ind, Case=Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Degree=Cmp|Number=Plur|POS=ADJ, POS=AUX|VerbForm=Ger, Gender=Fem|POS=NOUN, Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ, AdvType=Tim|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Fem|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Number=Sing|POS=PRON|PronType=Int,Rel, POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Gender=Fem|Number=Plur|POS=DET|PronType=Tot, Case=Acc,Nom|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Acc,Nom|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Case=Acc|Definite=Def|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, POS=SPACE, Gender=Fem|Number=Sing|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin, Gender=Masc|Number=Sing|POS=DET|PronType=Tot, Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs, POS=PART, Gender=Fem|Number=Sing|POS=DET|PronType=Ind, Number=Sing|POS=DET|PronType=Ind, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Ind, Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin, NumForm=Digit|POS=SYM, Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin, Case=Dat|Number=Sing|POS=VERB|Person=3|PronType=Prs|VerbForm=Inf, Gender=Fem|Number=Plur|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin, NumForm=Digit|NumType=Frac|POS=NUM, Gender=Fem|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Case=Dat|Number=Plur|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs, Definite=Ind|Gender=Masc|Number=Plur|POS=DET|PronType=Art, POS=PUNCT|PunctType=Colo, Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin, Gender=Fem|Number=Sing|POS=DET|PronType=Neg, Gender=Masc|Number=Sing|POS=PRON|PronType=Dem, Case=Acc|Gender=Masc|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, `Case=Acc|Gender=Fem|Number=Sing|POS=AUX|Person=3|PrepCase=Npr|PronType=Prs|Ver...
Read more

es_core_news_lg-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 2ab38480b96b2c104a99f88b9e5541c7995557c0a2f06c108ae6e7e59af7e221
Checksum .whl: 7c6c212715a12f31aacde3361754436945ff7376fb24cde57d0c277c9c9b050b

Details: https://spacy.io/models/es#es_core_news_lg

Spanish pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name es_core_news_lg
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 500000 keys, 500000 unique vectors (300 dimensions)
Sources UD Spanish AnCora v2.8 (Martínez Alonso, Héctor; Zeman, Daniel)
WikiNER (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)
spaCy lookups data (Explosion)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License GNU GPL 3.0
Author Explosion
Model size 541 MB

Label Scheme

View label scheme (468 labels for 3 components)
Component Labels
morphologizer Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=NOUN, Definite=Def|Gender=Masc|Number=Sing|POS=ADP|PronType=Art, Gender=Masc|Number=Sing|POS=ADJ, POS=ADP, Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, POS=PROPN, Case=Acc|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, POS=VERB|VerbForm=Inf, Gender=Fem|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|Number=Sing|POS=NOUN, Gender=Fem|Number=Plur|POS=NOUN, Gender=Fem|Number=Plur|POS=DET|PronType=Ind, POS=PRON|PronType=Int,Rel, Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, POS=SCONJ, POS=NOUN, Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Number=Plur|POS=NOUN, Gender=Masc|Number=Plur|POS=DET|PronType=Ind, Gender=Masc|Number=Plur|POS=NOUN, POS=PUNCT|PunctType=Peri, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, POS=PUNCT|PunctType=Comm, Case=Acc|Gender=Fem|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Number=Plur|POS=ADJ, POS=CCONJ, Gender=Masc|Number=Plur|POS=PRON|PronType=Ind, POS=ADV, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Dem, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Number=Sing|POS=ADJ, Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Masc|Number=Plur|POS=PRON|PronType=Tot, POS=PRON|PronType=Ind, POS=ADV|Polarity=Neg, Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Sing|POS=ADJ, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin, Number=Plur|POS=PRON|PronType=Int,Rel, POS=PUNCT|PunctType=Quot, POS=PUNCT, Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part, POS=PUNCT|PunctSide=Ini|PunctType=Brck, POS=PUNCT|PunctSide=Fin|PunctType=Brck, NumForm=Digit|NumType=Card|POS=NUM, NumType=Card|POS=NUM, POS=VERB|VerbForm=Ger, Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ, Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs, Number=Sing|POS=NOUN, Gender=Masc|Number=Plur|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin, Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Degree=Cmp|POS=ADV, POS=AUX|VerbForm=Inf, Number=Plur|POS=DET|PronType=Ind, Number=Plur|POS=DET|PronType=Dem, POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Degree=Cmp|Number=Sing|POS=ADJ, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes|VerbForm=Inf, Degree=Sup|Gender=Masc|Number=Plur|POS=ADJ, Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art, AdvType=Tim|POS=NOUN, Gender=Masc|Number=Sing|POS=PRON|PronType=Ind, NumType=Card|Number=Plur|POS=NUM, Case=Acc|Gender=Masc|Number=Sing|POS=VERB|Person=3|PrepCase=Npr|PronType=Prs|VerbForm=Inf, NumForm=Digit|POS=NOUN, Number=Sing|POS=PRON|PronType=Dem, Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs, Gender=Fem|Number=Plur|POS=ADJ, Gender=Fem|Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Plur|POS=DET|PronType=Tot, Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ, Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ, Gender=Masc|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part, Number=Sing|POS=DET|PronType=Tot, Gender=Fem|Number=Sing|POS=PRON|PronType=Ind, Case=Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Degree=Cmp|Number=Plur|POS=ADJ, POS=AUX|VerbForm=Ger, Gender=Fem|POS=NOUN, Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ, AdvType=Tim|POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin, Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part, Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part, Gender=Fem|Number=Plur|POS=DET|PronType=Dem, Gender=Masc|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Number=Sing|POS=PRON|PronType=Int,Rel, POS=ADJ, Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin, Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Gender=Fem|Number=Plur|POS=DET|PronType=Tot, Case=Acc,Nom|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Acc,Nom|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Case=Acc|Definite=Def|Gender=Masc|Number=Sing|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, POS=SPACE, Gender=Fem|Number=Sing|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin, Gender=Masc|Number=Sing|POS=DET|PronType=Tot, Number=Plur|POS=PRON|PronType=Ind, Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs, POS=PART, Gender=Fem|Number=Sing|POS=DET|PronType=Ind, Number=Sing|POS=DET|PronType=Ind, Gender=Masc|NumType=Card|Number=Plur|POS=DET|PronType=Ind, Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin, NumForm=Digit|POS=SYM, Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin, Case=Dat|Number=Sing|POS=VERB|Person=3|PronType=Prs|VerbForm=Inf, Gender=Fem|Number=Plur|POS=PRON|PronType=Dem, Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin, NumForm=Digit|NumType=Frac|POS=NUM, Gender=Fem|Number=Sing|POS=PRON|Poss=Yes|PronType=Int,Rel, Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin, Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin, Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs, Case=Dat|Number=Plur|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs, Definite=Ind|Gender=Masc|Number=Plur|POS=DET|PronType=Art, POS=PUNCT|PunctType=Colo, Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin, Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin, Gender=Fem|Number=Sing|POS=DET|PronType=Neg, Gender=Masc|Number=Sing|POS=PRON|PronType=Dem, Case=Acc|Gender=Masc|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs, Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part, `Case=Acc|Gender=Fem|Number=Sing|POS=AUX|Person=3|PrepCase=Npr|PronType=Prs|V...
Read more

en_core_web_trf-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: eaed18a77cbd6fad1b0605d2535c2091010d9aaf87b6264c2080042dd394621f
Checksum .whl: 272a31e9d8530d1e075351d30a462d7e80e31da23574f1b274e200f3fff35bf5

Details: https://spacy.io/models/en#en_core_web_trf

English transformer pipeline (Transformer(name='roberta-base', piece_encoder='byte-bpe', stride=104, type='roberta', width=768, window=144, vocab_size=50265)). Components: transformer, tagger, parser, ner, attribute_ruler, lemmatizer.

Feature Description
Name en_core_web_trf
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline transformer, tagger, parser, attribute_ruler, lemmatizer, ner
Components transformer, tagger, parser, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
ClearNLP Constituent-to-Dependency Conversion (Emory University)
WordNet 3.0 (Princeton University)
roberta-base (Yinhan Liu and Myle Ott and Naman Goyal and Jingfei Du and Mandar Joshi and Danqi Chen and Omer Levy and Mike Lewis and Luke Zettlemoyer and Veselin Stoyanov)
License MIT
Author Explosion
Model size 436 MB

Label Scheme

View label scheme (112 labels for 3 components)
Component Labels
tagger $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, ````
parser ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 99.86
TOKEN_P 99.57
TOKEN_R 99.58
TOKEN_F 99.57
TAG_ACC 98.12
SENTS_P 94.11
SENTS_R 83.63
SENTS_F 88.56
DEP_UAS 95.19
DEP_LAS 93.84
ENTS_P 89.73
ENTS_R 90.13
ENTS_F 89.93

Installation

pip install spacy
python -m spacy download en_core_web_trf

en_core_web_sm-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 14a2f31bc476af87019819ea8c9948fabdfd473a442edd6b1cba62bf0c2c0f55
Checksum .whl: 1932429db727d4bff3deed6b34cfc05df17794f4a52eeb26cf8928f7c1a0fb85

Details: https://spacy.io/models/en#en_core_web_sm

English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name en_core_web_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
ClearNLP Constituent-to-Dependency Conversion (Emory University)
WordNet 3.0 (Princeton University)
License MIT
Author Explosion
Model size 12 MB

Label Scheme

View label scheme (113 labels for 3 components)
Component Labels
tagger $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 99.86
TOKEN_P 99.57
TOKEN_R 99.58
TOKEN_F 99.57
TAG_ACC 97.29
SENTS_P 92.01
SENTS_R 89.39
SENTS_F 90.68
DEP_UAS 91.77
DEP_LAS 89.92
ENTS_P 84.30
ENTS_R 84.36
ENTS_F 84.33

Installation

pip install spacy
python -m spacy download en_core_web_sm

en_core_web_md-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 504fe70715dca9464bf11e385fbe80e92ae127394df3f9c45242926082425551
Checksum .whl: 5e6329fe3fecedb1d1a02c3ea2172ee0fede6cea6e4aefb6a02d832dba78a310

Details: https://spacy.io/models/en#en_core_web_md

English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name en_core_web_md
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 684830 keys, 20000 unique vectors (300 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
ClearNLP Constituent-to-Dependency Conversion (Emory University)
WordNet 3.0 (Princeton University)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License MIT
Author Explosion
Model size 31 MB

Label Scheme

View label scheme (113 labels for 3 components)
Component Labels
tagger $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 99.86
TOKEN_P 99.57
TOKEN_R 99.58
TOKEN_F 99.57
TAG_ACC 97.31
SENTS_P 92.14
SENTS_R 89.17
SENTS_F 90.63
DEP_UAS 91.84
DEP_LAS 89.98
ENTS_P 84.40
ENTS_R 85.07
ENTS_F 84.73

Installation

pip install spacy
python -m spacy download en_core_web_md

en_core_web_lg-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 7a3c89f3243950000a102c5f124277bbbde2dec467d548eee28b23bd1938ed62
Checksum .whl: 293e9547a655b25499198ab15a525b05b9407a75f10255e405e8c3854329ab63

Details: https://spacy.io/models/en#en_core_web_lg

English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name en_core_web_lg
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 684830 keys, 342918 unique vectors (300 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
ClearNLP Constituent-to-Dependency Conversion (Emory University)
WordNet 3.0 (Princeton University)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License MIT
Author Explosion
Model size 382 MB

Label Scheme

View label scheme (113 labels for 3 components)
Component Labels
tagger $, '', ,, -LRB-, -RRB-, ., :, ADD, AFX, CC, CD, DT, EX, FW, HYPH, IN, JJ, JJR, JJS, LS, MD, NFP, NN, NNP, NNPS, NNS, PDT, POS, PRP, PRP$, RB, RBR, RBS, RP, SYM, TO, UH, VB, VBD, VBG, VBN, VBP, VBZ, WDT, WP, WP$, WRB, XX, _SP, ````
parser ROOT, acl, acomp, advcl, advmod, agent, amod, appos, attr, aux, auxpass, case, cc, ccomp, compound, conj, csubj, csubjpass, dative, dep, det, dobj, expl, intj, mark, meta, neg, nmod, npadvmod, nsubj, nsubjpass, nummod, oprd, parataxis, pcomp, pobj, poss, preconj, predet, prep, prt, punct, quantmod, relcl, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 99.86
TOKEN_P 99.57
TOKEN_R 99.58
TOKEN_F 99.57
TAG_ACC 97.38
SENTS_P 92.03
SENTS_R 89.24
SENTS_F 90.62
DEP_UAS 92.04
DEP_LAS 90.21
ENTS_P 85.21
ENTS_R 85.87
ENTS_F 85.54

Installation

pip install spacy
python -m spacy download en_core_web_lg

el_core_news_sm-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: f40e544a9feefe8f9715138c14a17361e87e6ab7c20fa813620a376ae2fc0b32
Checksum .whl: 18df59b7f099a20d6f7cc1f964a57408a4c1663b73b5110932c1ea24f66e3027

Details: https://spacy.io/models/el#el_core_news_sm

Greek pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, lemmatizer (trainable_lemmatizer), senter, ner, attribute_ruler.

Feature Description
Name el_core_news_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, lemmatizer, attribute_ruler, ner
Components tok2vec, morphologizer, parser, lemmatizer, senter, attribute_ruler, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources UD Greek GDT v2.8 (Prokopidis, Prokopis)
Greek NER Corpus (Google Summer of Code 2018) (Giannis Daras)
License CC BY-NC-SA 3.0
Author Explosion
Model size 12 MB

Label Scheme

View label scheme (395 labels for 3 components)
Component Labels
morphologizer Case=Nom|Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Pass, POS=ADP, Case=Acc|Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, NumType=Card|POS=NUM, POS=NOUN, POS=ADV, POS=PUNCT, Case=Acc|Gender=Neut|Number=Plur|POS=NOUN, Case=Acc|Gender=Neut|Number=Plur|POS=ADP, Case=Gen|Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art, Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Gen|Definite=Def|Gender=Neut|Number=Sing|POS=DET|PronType=Art, Case=Acc|Definite=Def|Gender=Neut|Number=Sing|POS=DET|PronType=Art, Case=Gen|Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Nom|Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Case=Acc|Gender=Neut|Number=Sing|POS=ADP, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=NUM, Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, POS=CCONJ, Case=Nom|Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Case=Nom|Gender=Masc|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Acc|Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art, Case=Acc|Gender=Neut|NumType=Card|Number=Plur|POS=NUM, Case=Acc|Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Case=Nom|Gender=Masc|NumType=Card|Number=Plur|POS=NUM, POS=AUX, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|VerbForm=Fin|Voice=Pass, Case=Acc|Gender=Fem|Number=Plur|POS=ADP, Case=Acc|Gender=Masc|Number=Plur|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin|Voice=Pass, Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Acc|Gender=Masc|Number=Plur|POS=ADJ, Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|Gender=Neut|NumType=Card|Number=Plur|POS=NUM, Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Acc|Gender=Masc|Number=Sing|POS=ADP, Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Acc|Gender=Neut|Number=Sing|POS=PROPN, Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Case=Nom|Gender=Masc|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Case=Gen|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Rel, Case=Acc|Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs, Case=Acc|Gender=Fem|Number=Sing|POS=ADP, Case=Acc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, Case=Acc|Definite=Def|Gender=Neut|Number=Plur|POS=DET|PronType=Art, Case=Acc|Gender=Neut|Number=Plur|POS=ADJ, Case=Gen|Gender=Neut|Number=Plur|POS=PROPN, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Aspect=Imp|POS=VERB|VerbForm=Conv|Voice=Act, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Rel, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin|Voice=Pass, Case=Acc|Gender=Masc|Number=Plur|POS=ADP, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|Definite=Ind|Gender=Neut|Number=Sing|POS=DET|PronType=Art, Case=Gen|Definite=Def|Gender=Neut|Number=Plur|POS=DET|PronType=Art, Case=Gen|Gender=Neut|Number=Sing|POS=PROPN, Case=Nom|Definite=Def|Gender=Neut|Number=Sing|POS=DET|PronType=Art, Case=Nom|Gender=Neut|Number=Sing|POS=PRON|Person=3|PronType=Rel, Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=PROPN, Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Case=Nom|Gender=Fem|Number=Plur|POS=ADJ, Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Ind, Case=Nom|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Neut|Number=Plur|POS=ADJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Fem|Number=Sing|POS=ADJ, Case=Nom|Degree=Cmp|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art, Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, Case=Nom|Gender=Neut|NumType=Card|Number=Plur|POS=NUM, Case=Gen|Gender=Masc|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs, POS=SCONJ, Case=Nom|Gender=Neut|Number=Plur|POS=PRON|Person=3|PronType=Ind, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Pass, Case=Nom|Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art, Case=Nom|Gender=Masc|Number=Plur|POS=PRON|Person=3|PronType=Rel, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|VerbForm=Fin|Voice=Act, Case=Nom|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Rel, Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Case=Acc|Gender=Fem|NumType=Ord|Number=Sing|POS=NUM, Case=Gen|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Pass, Aspect=Imp|Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin|Voice=Pass, Case=Acc|Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art, Case=Nom|Gender=Neut|Number=Sing|POS=ADJ, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, Aspect=Perf|POS=VERB|VerbForm=Inf|Voice=Pass, Case=Acc|Gender=Fem|Number=Plur|POS=ADJ, Case=Nom|Definite=Def|Gender=Neut|Number=Plur|POS=DET|PronType=Art, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Gender=Neut|Number=Plur|POS=ADJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act, Aspect=Perf|POS=VERB|VerbForm=Inf|Voice=Act, Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Rel, Case=Nom|Gender=Masc|Number=Plur|POS=PROPN, Aspect=Perf|Case=Acc|Gender=Fem|Number=Sing|POS=VERB|VerbForm=Part|Voice=Pass, Case=Gen|Gender=Masc|Number=Plur|POS=PROPN, POS=PART, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Ind, Case=Nom|Degree=Cmp|Gender=Neut|Number=Sing|POS=ADJ, Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Dem, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Case=Nom|Gender=Neut|Number=Plur|POS=PRON|Person=3|PronType=Dem, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin|Voice=Pass, Case=Gen|Gender=Fem|NumType=Ord|Number=Sing|POS=NUM, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin|Voice=Act, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|VerbForm=Fin|Voice=Pass, Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Gender=Fem|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs, Abbr=Yes|POS=NOUN, Case=Acc|Gender=Neut|Number=Plur|POS=PRON|Person=3|PronType=Ind, Case=Nom|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Rel, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|VerbForm=Fin|Voice=Act, Case=Acc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Fem|Number=Plur|POS=ADJ, Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Case=Voc|Gender=Fem|Number=Sing|POS=NOUN, `Aspect=Imp|Mood=Ind|Number...
Read more