Skip to content

How to annotate a corpus to train a SpanFinder #13100

Discussion options

You must be logged in to vote

Be sure that the score_weights include the right scores for the span_finder.

In practice, especially if you have longer texts, I've found that it doesn't work that well to train span_finder+spancat together from scratch. The span_finder needs to be trained for long enough to start giving reasonable suggestions before it makes sense to add spancat on top. And spancat can run out of memory if the untrained span_finder starts trying to suggest every single possible span in the text.

What you can do instead is train span_finder on its own until its performance is reasonable and then continue training span_finder+spancat by sourcing the span_finder in the combined config. You don't need to fre…

Replies: 6 comments 7 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@rmitsch
Comment options

Comment options

You must be logged in to vote
5 replies
@adrianeboyd
Comment options

Answer selected by ddenz
@ddenz
Comment options

@adrianeboyd
Comment options

@ddenz
Comment options

@adrianeboyd
Comment options

Comment options

You must be logged in to vote
1 reply
@adrianeboyd
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / spanfinder Feature: Span Finder
3 participants