Introducing SUDO, a framework for evaluating AI systems without ground-truth annotations, to improve reliability and assess algorithmic bias in clinical settings.