Machine Learning Street Talk (MLST) cover image

Transformers Need Glasses! - Federico Barbero

Machine Learning Street Talk (MLST)

00:00

Exploring Graph Neural Networks and Attention Mechanisms

This chapter explores the intricate relationship between graph neural networks (GNNs) and spectral graph theory concepts, particularly focusing on commute time and its implications for information flow in neural architectures. It discusses the complexities of attention mechanisms in models with large token contexts, highlighting the influence of token length and architecture on model performance and stability. Additionally, the chapter emphasizes the significance of entropy in tokenization and the challenges faced by self-attention transformers regarding numerical precision and sequence representation.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app