Skip to content

Trouble with labels.json Format for Dependency Parser Training in spaCy v3 #13065

Discussion options

You must be logged in to vote

The labels files aren't intended to be generated by hand, and each pipeline component has its own internal format. You can generate the labels files by using spacy init labels. If you have a very large or streamed/infinite corpus, you can generate the labels based on a representative subset of your training data instead of the whole training corpus.

But be aware that in most cases, the labels file isn't required. spacy train can determine the labels from the provided training corpus in the initialization step. If you have a large training corpus and you're repeatedly training on the same data, the main benefit is that you save some time at the beginning of spacy train by providing the lab…

Replies: 2 comments 17 replies

Comment options

You must be logged in to vote
2 replies
@LilitKharatyan
Comment options

@adrianeboyd
Comment options

Answer selected by adrianeboyd
Comment options

You must be logged in to vote
15 replies
@LilitKharatyan
Comment options

@adrianeboyd
Comment options

@LilitKharatyan
Comment options

@adrianeboyd
Comment options

@LilitKharatyan
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models
2 participants