Systematic Analysis and Induction of Character Hallucination in Role-Playing Language Models
Character hallucination, where language models deviate from predefined character roles and generate inconsistent responses, is a persistent issue in role-playing systems. This phenomenon can be systematically analyzed and induced through the RoleBreak framework, which identifies query sparsity and role-query conflict as the key drivers of hallucination.