LessWrong (Curated & Popular) cover image

“Anomalous Tokens in DeepSeek-V3 and r1” by henry

LessWrong (Curated & Popular)

00:00

Exploring Non-English Tokens in Deep Learning Models

This chapter explores the intricacies of analyzing deep learning tokens, particularly non-English glitch tokens in Cebueno and other regional Filipino languages. It highlights the challenges of translating these tokens and the unpredictable outputs they can generate within the system.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app