This workshop aims to outline the practice of AI red teaming, drawing on historical insights to understand its trajectory and structure. The organizers prioritize understanding the humans involved in AI red teaming and how their roles influence the development of AI systems. The workshop will focus on three key themes:
Conceptualization of Red Teaming: Participants will engage in deeper discussions about the complexities of red teaming and consider its impact within broader frameworks of Responsible AI.
Labor of Red Teaming: Researchers will investigate the stakeholders involved in red teaming practices and examine the labor arrangements and power dynamics that shape AI systems.
Well-being of and Harms Against Red Teamers: The workshop will identify strategies and interventions to mitigate potential harms from exposure to harmful content during red teaming activities, with the goal of fostering a culture of well-being within the AI red teaming community.
The workshop will include a red teaming exercise, panel discussions, and collaborative artifact development activities to synthesize key insights and establish an AI red teaming research network. The organizers aim to publish a post-workshop report to inform practitioners and researchers in this emerging field.
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Alice Qian Z... às arxiv.org 09-12-2024
https://arxiv.org/pdf/2407.07786.pdfPerguntas Mais Profundas