I'm running optimization on an imbalanced dataset and need to define my optimization metric. I'm working on disease detection so maximizing AUC might not be the best solution, as the certainty of the prediction is important.
I've defined my objective score as AUC-log_loss
, and try to maximize this.
Can someone explain to me if this makes sense and please cite some sources where I can read about optimization using AUC vs log_loss in binary classification?
Thanks,
Edit: I've also read this topic already, and although it is very informative, I'm still not clear as it makes sense to optimize for both statistics.