ALOHa leverages large language models to reliably and localizeably detect object hallucinations in image captions, outperforming prior methods.