Belangrijkste concepten
This research paper introduces Misinformation with Legal Consequences (MisLC), a new task leveraging large language models to detect misinformation that could potentially violate existing laws.
Statistieken
Only 13.1% (93 samples) of the dataset, which was pre-filtered for checkworthiness, were labeled as potentially having legal violations.
The nominal Krippendorff’s Alpha for the legal annotators is 0.441, indicating a degree of subjectivity in legal task annotations.
GPT-3.5-turbo achieved a 12-point increase in F1 score over random guessing in the binary classification setting.
Llama3-70b, the best-performing open-source model, achieved a 14.4-point increase in F1 score over random guessing in the binary classification setting.
GPT-4o showed the most significant improvement with retrieval, increasing its F1 score by 9 points.
Citaten
"Misinformation, defined as false or inaccurate information, can result in significant societal harm when it is spread with malicious or even innocuous intent."
"Unlike previous work that has focused on factual accuracy or checkworthiness as potential controversy of a topic, we ground our definition in legal literature and social consequence."
"We introduce a new task: Misinformation with Legal Consequence (MisLC), which leverages definitions from a wide range of legal domains covering 4 broader legal topics and 11 fine-grained legal issues, including hate speech, election laws, and privacy regulations."
"After thorough empirical study, we find the existing LLMs perform reasonably well at the task, achieving non-random performance without external resources."
"However, LLMs are still far from matching human expert performance."