Choosing the best model #479

cjacky475 · 2023-02-28T10:16:46Z

cjacky475
Feb 28, 2023

Hello,

I am trying to detect anomalous temperature points. I have several datasets manually labeled. Now my task is to find the best model that would give the highest recall, since missing anomalous temperature point is critical. Would it be fair:

Train various models on each dataset
predict() results on test set and compare with true labels
Select the model that has highest recall

The problem I think of is that if I used decision_function() and, let's say, used 0.3 threshold as anomalous, different models would give better results than models that I tested on predict(). Also, there are thousands of datasets and I would not know the correct threshold for each dataset.

So what would the best way to select the model?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choosing the best model #479

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Choosing the best model #479

cjacky475 Feb 28, 2023

Replies: 0 comments

cjacky475
Feb 28, 2023