My production data is unlabeled.
I can’t calculate accuracy or precision anymore.
Still, I need to know if the model is degrading.
What can I realistically monitor?
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
You can detect data drift without labels by monitoring input distributions.
Track statistical properties of each feature and compare them to training baselines. Significant changes in distributions, category frequencies, or missing rates are often early indicators of performance degradation.
Use metrics like population stability index (PSI), KL divergence, or simple threshold-based alerts for numerical features. For categorical features, monitor new or disappearing categories.
This won’t tell you exact accuracy, but it provides a strong signal that retraining or investigation is needed.The key takeaway is that unlabeled drift detection is still actionable and essential in production ML