So on wikipedia here is the definition for area under the curve of the receiver operating characteristic.
Can someone give a simple example of calculating this given some predictive results in a binary classification problem, with the labels? I'm having a hard time imaging what this looks like for something like TPR = 0.6 and FPR = 0.2. When I think about it as area under that point seems infinitely small.