Calibration | Learn AI Together | Page 1

heavy leaf I am not ale to understand the concept of calibration of a ML model. Read an art...

This may help a bit more: https://machinelearningmastery.com/probability-calibration-for-imbalanced-classification/

But basically, it's not saying that the model itself is bad. You don't "retrain" a model or anything. What's really being calibrated is not the model itself, but the output of model.

While models output a probability-like output, they don't necessarily mean actual probability, or the "given X, likelihood of Y". Most models are not probabilistic, they don't learn a prior.

Say that based on probability, there should be 5% of true class, in other words, 5% are fraud. We know this by analyzing the past data distribution. It's a classic imbalanced dataset. You train your model, and you likely balanced the dataset so that your model can learn the minority class, this is commonly done in model training.

But now solely based on the output of your model, your model is predicting around 20% of the samples are fraud. It's a combination of threshold selection and such, but basically if you only apply threshold on your model output, it's not aligned with actual probability of fraud.

Therefore, you apply some scaling or transformation on top of model output, the logits, so that the calibrated output/logits actually are more aligned with actual probability, say 4-7%, rather than 20% which is way off.

It's just to ensure that your model is not too off from the actual probability overall

MachineLearningMastery.com

Jason Brownlee

How to Calibrate Probabilities for Imbalanced Classification - Mach...

Many machine learning models are capable of predicting a probability or probability-like scores for class membership. Probabilities provide a required level of granularity for evaluating and comparing models, especially on imbalanced classification problems where tools like ROC Curves are used to interpret predictions and the ROC AUC metric is u...

#Calibration