I know SHAP (or shapley) values are the contribution of each input variable to the model prediction. Adding the base values to the sum of all SHAP values gives you the model prediction for any data point. I dont really understand what the base-value here actually imply.
Imagine a binary classification model predicts absence (0) or presence (1) of a crop, and I train the model with equal data points of absence and presence, then the base-prediction is 0.5. And suppose the model inputs are various environmental variables. If, at a pixel, the SHAP values of all environmental variables sum to zero, the model prediction is 0.5, or there is 50% chance that a crop is present. My question is what contributes to that 50% chance if all variables eventually contributed zero?