-
Notifications
You must be signed in to change notification settings - Fork 539
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
too long too train new model use mitie #166
Comments
As far as I know, it will depend on the size of your datasets. Also, check the memory usage (maybe you are out of memory and the system starts to use swap) |
You should also make sure that your labels are consistent. Datasets that
are harder to label take longer to train. So if, for example, you have a
huge number of labeling mistakes training will take a long time.
|
@davisking @grafael could you please elaborate or give a little more examples |
It just means you are labeling your data with incorrect labels. Like maybe
sometimes you label references to the city of Boston as "city" and other
times as "place". Or maybe other times you don't label it at all.
|
Thanks @davisking for your quick reply. this time i made sure that my annotated training data have consistent labels. but still i don't see any progress in performance. from log what i found, In mitie there are two trainings going on..
here is some log .. Part II: train segment classifier test on train: overall accuracy: 0.987509 total time took in hour : 5 not sure what's going on !! Thanks in Advance. |
Sometimes it takes a while. Be patient.
What's happening is MITIE is repeatedly training a classifier and doing
hyper parameter selection to find the best one. So MITIE training is
always going to take longer than other systems since it does a whole lot of
internal validation and retraining so that you never have to fiddle with
any parameters.
|
Thanks @davisking . could you please help me to understand why large value of "C" takes more time compare to small value of "C" ? where Accuracy and F1/F score are mostly same for different values of "C". from my understanding "C" just a regularisation parameter which helps to reduce/avoid mis-classification. so it doesn't have any effect on Accuracy and F/F1 score. if my understanding is correct , can i use small value of "C" !!. if so, then what is the max minimum value of "C" i can use ? (i mean what is the minimum threshold value of "C" i can use ?) and specially in this problem ? for above problem please find log.. =============================================== C=300 test on train: overall accuracy: 0.987509 overall accuracy: 0.98662 overall accuracy: 0.985335 overall accuracy: 0.987629 ============================================== C=500 overall accuracy: 0.985272 ============================================== from above log : why best C is coming nearer value of given "C" value ? no matter what C value i choose. you can see above log. my point is what is minimum best C or any threshold value of C which can be used for starting point ? what is "num features" and why is it always 271 ? correct me whether my interpretation is wrong !! --> "number of samples" is ( sum of number of labels in each sentence ). e.g : 2 sentence each has 3 label then number of samples is 6. right !! Thanks in Advance. @grafael @lopuhin @baali @davisking @autopost-get @autopost- |
how can i improve this situation
give me some suggestion ,please
The text was updated successfully, but these errors were encountered: