I am trying to make binary classification and My dataset is imbalanced with a 1:7 ratio. I have 1000 "1" labels and 6990 "0" labels.
Predicting "1" Labels is more important than "0" but still, It should also detect "0" labels correctly as much as possible.
I used sampling techniques and used different models like XGBClassifier, LightGBM, SVM, KNN and I got different confusion matrixes. In some of them, detecting the "1" label is very good but detecting the "O" is not very good. And others, both "1" and "O" detecting are average.
I know accuracy is not a good metric to evaluate an imbalanced dataset, so I used the recall, f2 score, and AUC score. But still, I confused about which model is best.
According to these results, which model is best?
question from:
https://stackoverflow.com/questions/65908341/how-to-interpret-column-matrix-to-find-best-model-for-imbalanced-dataset 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…