I'm using scikit package with RandomForestClassifier, trying to predict binary or multi-lable classifications.
I'm looking for a way to estimate the reliability of the model but really can't figure whether to use the Brier score or a Log Loss scorer.
I understand that both can estimate the reliability of the probabilities that the model outputs.
Can anyway clarify what are the pros and cons of each method, and why/when should I choose one over another?