The author presents the Prejudice-Caprice Framework to comprehensively measure discrimination in Large Language Models by considering persistent prejudice and preference variation across diverse contexts.
LLMs exhibit biases that can be quantified using the Prejudice-Caprice Framework, providing insights into discrimination risks.
Prejudice-Caprice Framework (PCF) comprehensively measures discrimination in LLMs by considering persistent prejudice and preference variation across diverse contexts.