AXRP - the AI X-risk Research Podcast cover image

29 - Science of Deep Learning with Vikrant Varma

AXRP - the AI X-risk Research Podcast

00:00

Introduction

Exploring the complexities of training AI systems to align outputs with human preferences while navigating the risks of achieving superhuman performance without a clear understanding of the inner workings. Introducing the concept of eliciting latent knowledge for safer AI utilization.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app