核心概念
Generative AI agents can uncover noteworthy insights from datasets that may inspire further investigative data reporting.
摘要
This paper introduces a system that uses three specialized generative AI agents - an analyst, a reporter, and an editor - to collaboratively generate and refine tips from datasets for investigative data reporting.
The key steps in the pipeline are:
- Question Generation: The reporter agent generates a set of questions that can be addressed using the provided dataset.
- Analytical Planning: For each question, the analyst drafts an analytical plan detailing how the dataset can be used to answer the question. The editor provides feedback to bulletproof the plan.
- Execution and Interpretation: The analyst executes the analytical plan and summarizes the insights. The reporter assesses the newsworthiness of the findings, and the editor provides additional feedback to ensure journalistic integrity.
- Compilation and Presentation: The most significant insights are compiled into a tip sheet for the user.
The authors validate this agent-based system using real-world investigative data reporting stories and compare it to a baseline model without agents. The results show that the agent-based system generally generates more newsworthy and accurate insights, although some variability was observed across different stories.
The findings highlight the potential of generative AI to provide leads for investigative data reporting by uncovering noteworthy patterns and anomalies in datasets that may inspire further journalistic exploration.
统计
Los Angeles experiences significant racial and ethnic disparities in homelessness acuity scores, with white individuals having the highest mean scores.
The average length and frequency of homelessness episodes increased between 2016 and 2021.
引用
"Now, large language models (LLMs) hold the potential not only to identify more complex newsworthy patterns in datasets but also to generate news angles with greater flexibility and creativity, overcoming the limitations of standard templates."
"Similar to recent work on generative literary translation and text interpretation, we developed AI agents with specialized roles designed to perform distinct subtasks and offer mutual feedback."