My team is developing a credit scoring model for a situation in which...
- The positive class accounts for 10% of the training data
- FNs (predicting no default for actual default) costs us ~\$10-15K
- FPs (predicting default for no actual default) carries an opportunity cost of ~$2.5K.
Historically, we have used ROC AUC to evaluate our models, but due to the class imbalance, we are exploring other options. Currently, we are using ROC AUC with class weights.
- Does anyone have advice on the best eval metric for our situation? Several that we have considered are ROC AUC, PR AUC, and F1; however, we are open to other options and want to make the determination objectively (e.g., not just the one that conveniently happens to favor our individual models). We have also considered cost-sensitive classification, but are hesitant about its potential to hardcode our costs.
- Someone on the team has suggested tuning the class weights. Is there a situation in which that would make sense?