LessWrong (Curated & Popular) cover image

“Anomalous Tokens in DeepSeek-V3 and r1” by henry

LessWrong (Curated & Popular)

00:00

Exploring the Anomalies of DeepSeek's Behavior

This chapter examines the unique tendency of DeepSeek towards endless repetition, proposing it as an intrinsic trait. It invites listeners to engage with its behavior, highlighting user interactions and overlooked factors like Chinese tokens, to enhance understanding in AI exploration.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app