The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

NOTE

Processing Raw Data for Machine Learning

Observations in the community highlight issues caused by tokenizers in language processing models, leading to difficulties like arithmetic and spelling. Tokenizers operating at a higher level than characters may hinder models' ability to reason over individual characters. Advancing towards processing raw data end-to-end is deemed crucial for optimal machine learning outcomes.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner