
915: How to Jailbreak LLMs (and How to Prevent It), with Michelle Yi
Super Data Science: ML & AI Podcast with Jon Krohn
00:00
Navigating Trustworthy AI
This chapter examines the challenges of ensuring trustworthy AI systems amidst their growing prevalence. It explores the concept of world models as a solution to improve AI reliability and the complexities of updating Large Language Models (LLMs) while addressing potential vulnerabilities. The discussion also highlights the ethical implications of generative AI, manipulation of narratives, and the quest for recognition in the field of data science.
Transcript
Play full episode