Questions tagged [calibration]

Calibration can refer to adjustment of measurements to agree with value of some standard; to transform classifier scores into class membership probabilities; etc. Do not use for predicting an explanatory variable from an observation of the dependent variable, for that use the tag inverse-prediction.

In regression, calibration can refer to predicting an explanatory variable from an observation of the dependent variable or to adjustment of measurements to agree with value of some standard.

In machine learning it may refer to transforming classifier scores into class membership probabilities.

It can also refer to performing inference on model parameters.

247 questions

votes

2 answers

Scikit correct way to calibrate classifiers with CalibratedClassifierCV

Scikit has CalibratedClassifierCV, which allows us to calibrate our models on a particular X, y pair. It also states clearly that data for fitting the classifier and for calibrating it must be disjoint. If they must be disjoint, is it legitimate to…

asked Feb 22 '17 at 12:02

sapo_cosmico

votes

3 answers

Visualizing the calibration of predicted probability of a model

Suppose I have a predictive model that produces, for each instance, a probability for each class. Now I recognize that there are many ways to evaluate such a model if I want to use those probabilities for classification (precision, recall, etc.). …

data-visualization classification predictive-models binary-data calibration

asked Mar 29 '12 at 14:52

Michael McGowan

4,561
3
31
46

votes

2 answers

Why use Platt's scaling?

In order to calibrate a confidence level to a probability in supervised learning (say to map the confidence from an SVM or a decision tree using oversampled data) one method is to use Platt's Scaling (e.g., Obtaining Calibrated Probabilities from…

logistic cross-validation calibration

asked Dec 07 '10 at 01:31

B_Miner

7,560
20
81
144

votes

1 answer

Pooling calibration plots after multiple imputation

I would like advice on pooling the calibration plots/statistics after multiple imputation. In the setting of developing statistical models in order to predict a future event (e.g. using data from hospital records to predict post hospital discharge…

data-visualization data-imputation multiple-imputation pooling calibration

asked Mar 07 '17 at 15:30

IWS

2,554
13
30

votes

2 answers

How to choose optimal bin width while calibrating probability models?

Background: There are some great questions/answers here on how to calibrate models which predict probabilities of an outcome happening. For example Brier score, and its decomposition into resolution, uncertainty and reliability. Calibration plots…

probability predictive-models binary-data calibration scoring-rules

asked Nov 03 '15 at 23:26

Alex

3,728
3
25
46

votes

3 answers

How do I choose the best metric to measure my calibration?

I program and do test-driven development. After I made a change in my code I run my tests. Sometimes they succeed and sometimes they fail. Before I run a test I write down a number from 0.01 to 0.99 for my credence that the test will succeed. I…

forecasting decision-theory calibration scoring-rules

asked Dec 27 '16 at 22:36

Christian

votes

5 answers

What is calibration?

What does it mean to calibrate survey weights? Also, what are other definitions of calibration in statistics? I have heard it used in several contexts, particularly risk prediction (referring to whether the total number of predicted events in a…

terminology survey definition calibration survey-weights

asked Jul 27 '15 at 17:25

AdamO

52,330
5
104
209

votes

2 answers

Why is logistic regression well calibrated, and how to ruin its calibration?

In the scikit learn documents on probability calibration they compare logistic regression with other methods and remark that random forest is less well calibrated than logistic regression. Why is logistic regression well calibrated? How could one…

regression logistic calibration

asked Feb 02 '19 at 18:41

sjw

5,091
1
21
45

votes

3 answers

Probability calibration from LightGBM model with class imbalance

I've made a binary classification model using LightGBM. The dataset was fairly imbalnced but I'm happy enough with the output of it but am unsure how to properly calibrate the output probabilities. The baseline score of the model from…

python scikit-learn boosting propensity-scores calibration

asked Oct 17 '18 at 12:29

Auren Ferguson

votes

1 answer

How to estimate a calibration curve with bootstrap (R)

Question: I have fitted a probabilistic model (bayesian network) for modeling a binary outcome variable. I would like to create a high-resolution calibration plot (e.g. spline) corrected for overfitting with bootstrapping. Is there a standard…

bootstrap splines regression-strategies calibration

asked May 17 '18 at 16:13

Gino_JrDataScientist

votes

4 answers

Tuning an exponential moving average to a moving window mean?

The alpha parameter of an exponential moving average defines the smoothing that the average applies to a time series. In a similar way, the window size of a moving window mean also defines the smoothing. Is there some way to tune the alpha parameter…

exponential-smoothing moving-average calibration moving-window

asked Jul 12 '16 at 01:11

naught101

4,973
1
51
85

votes

1 answer

How to make calibration plot for survival data without binning data?

To make a calibration plot for survival probabilities estimated from a Cox model, one can divide the estimated risk into groups, calculate the average risk within a group, and then compare this to the Kaplan-Meier estimate. What are alternative…

survival regression-strategies calibration

asked Apr 08 '16 at 18:43

julieth

2,252
1
15
20

votes

2 answers

Calibration of Cox regression survival analysis

To perform calibration of a Cox regression model (i.e. assessing for the agreement between the predicted and the observed outcome), what is the best method to present the accuracy of the model in predicting the actual event? As far as I…

probability survival cox-model calibration rms

asked Nov 08 '13 at 12:36

user32454

votes

1 answer

Signatures of underfitting and overfitting in logistic regression calibration curves

My confusion stems from reading the following paper http://www.bmj.com/content/351/bmj.h3868 It states in its abstract (and they later show an empirical study that conforms to the claim) - "Overfitted models tend to underestimate the probability of…

regression logistic overfitting calibration

asked Aug 18 '17 at 16:56

Ayush Saxena

votes

0 answers

Calibration for random forests

I want to evaluate the calibration of the random forest using val.prob (rms package, R). I have no problems using it and getting an output, but I feel the results may not be accurate because I don't believe that the class membership probabilities…

probability random-forest calibration

asked Dec 11 '12 at 00:55

user4673

1,461
4
17
26

2 3

…

16 17 Next