- 解决问题The paper aims to present a thorough exposition of how and why people perform attacks on large language models (LLMs) and how the community plays a crucial role in this activity.
- 关键思路The paper presents a grounded theory of how and why people attack large language models: LLM red teaming in the wild.
- 其它亮点The paper uses a formal qualitative methodology to interview dozens of practitioners from a broad range of backgrounds. It relates and connects the motivations, goals, strategies, and techniques of LLM red teaming practitioners. The experiments and datasets used are not mentioned, but the paper highlights the importance of the community in this activity. The paper suggests that this novel activity of attacking LLMs is a significant development in the field of AI.
- Related work is not mentioned in the abstract.
Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild