LessWrong (Curated & Popular) cover image

“Why White-Box Redteaming Makes Me Feel Weird” by Zygi Straznickas

LessWrong (Curated & Popular)

CHAPTER

Ethical Dilemmas in White Box Red Teaming

This chapter delves into the ethical challenges of white box red teaming, particularly the use of AI models that might experience distress. It raises critical questions about the morality of intentionally inducing harm for research purposes and reflects on the responsibilities of AI creators toward their systems.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner