I am trying to find a metrics to evaluate performance of one regression based ML method. Currently we are observing difference in RMSE is very less to conclude anything. So I thought of one AUC based method 1. I will divide my test observed and predicted values into 2 labels based on different threshold(let say 100 different different thresholds). 2. I will compute TPR/FPR at each of above threshold values and take average of these TPR/FPR.
I want to know if there is any statistical mistake in this metrics