Machine Learning Street Talk (MLST) cover image

Want to Understand Neural Networks? Think Elastic Origami! - Prof. Randall Balestriero

Machine Learning Street Talk (MLST)

00:00

Navigating Neural Network Complexities

This chapter examines the challenges surrounding steerability, alignment, and interpretability in advanced neural networks, particularly focusing on the evolution of 'concept scrubbing' and its limitations. It further investigates Reinforcement Learning from Human Feedback (RLHF) and the implications of jailbreaking large language models, underscoring the critical need for improved methodologies in managing high-dimensional spaces.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app