Get the app
Federico Barbero
Lead author of "Transformers Need Glasses!" at DeepMind/Oxford, researching architectural bottlenecks in transformers and information fidelity.
Best podcasts with Federico Barbero
Ranked by the Snipd community
72 snips
Mar 8, 2025
• 1h 1min
Transformers Need Glasses! - Federico Barbero
chevron_right
Federico Barbero, a lead author at DeepMind/Oxford, dives into the quirks of transformers and why large language models falter at tasks like counting. He reveals fascinating architectural bottlenecks that affect their performance. By drawing parallels with graph neural networks, he sheds light on the softmax function's role in limiting decision-making clarity. But not all hope is lost! Federico shares innovative 'glasses' to enhance transformer performance, including input tweaks and structural modifications to boost their clarity and efficiency.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app