Evaluating the Uncertainty Estimation Capabilities of Large Language and Vision-Language Models
Large language models (LLMs) and vision-language models (VLMs) exhibit poor capability for accurately estimating their own uncertainty, often exhibiting overconfidence in their outputs across various natural language processing and image recognition tasks.