Latent Space: The AI Engineer Podcast cover image

Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere

Latent Space: The AI Engineer Podcast

00:00

Speculating on the Size of Copilot's Dataset

The speaker speculates that OpenAI's Codex, which powers Github Copilot, is trained on trillions of tokens. They believe this due to the large amount of public code on GitHub and the need to track the data set to parameter ratio. They also mention the possibility of Copilot being a mixture of experts.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app