Character hallucination, where language models deviate from predefined character roles and generate inconsistent responses, is a persistent issue in role-playing systems. This phenomenon can be systematically analyzed and induced through the RoleBreak framework, which identifies query sparsity and role-query conflict as the key drivers of hallucination.


coremsg

systematic-analysis-and-induction-of-character-hallucination-in-role-playing-language-models


Systematic Analysis and Induction of Character Hallucination in Role-Playing Language Models