Latent Space: The AI Engineer Podcast cover image

ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)

Latent Space: The AI Engineer Podcast

NOTE

Importance of Architecture in Large Language Models

The similarity in architectures among large language models is notable, with most models utilizing a common architecture such as llama architecture. Although architecture is crucial, the focus has shifted towards optimizing existing architectures as evidenced by the superior scalability of newer models like the Wama transformer compared to the original transformer.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner