
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Processing Raw Data for Machine Learning
Observations in the community highlight issues caused by tokenizers in language processing models, leading to difficulties like arithmetic and spelling. Tokenizers operating at a higher level than characters may hinder models' ability to reason over individual characters. Advancing towards processing raw data end-to-end is deemed crucial for optimal machine learning outcomes.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.