Machine Learning Street Talk (MLST) cover image

Want to Understand Neural Networks? Think Elastic Origami! - Prof. Randall Balestriero

Machine Learning Street Talk (MLST)

CHAPTER

Navigating Neural Network Complexities

This chapter examines the challenges surrounding steerability, alignment, and interpretability in advanced neural networks, particularly focusing on the evolution of 'concept scrubbing' and its limitations. It further investigates Reinforcement Learning from Human Feedback (RLHF) and the implications of jailbreaking large language models, underscoring the critical need for improved methodologies in managing high-dimensional spaces.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner