Investigating and Improving Language Models' Ability to Resist Requests for Misinformation
Language models tend to prioritize helpfulness over logical reasoning, making them vulnerable to generating misinformation when presented with illogical requests. Prompt-based and parameter-based approaches can improve the detection of logic flaws in requests and prevent the dissemination of medical misinformation.