3min snip

Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0 cover image

No Moat: Closed AI gets its Open Source wakeup call — ft. Simon Willison

Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0

NOTE

LoRA Retraining Language Models and the Value of Open Source Models

Laura stands for low rank adaptation, where instead of retraining the whole model, you can freeze a part of it and train the smaller part. This approach is being used in various areas of research, including computer science and language models. The advantage of having a base model like llama is that subsequent Laura work can be done around the same rank, which is compatible with the architecture. However, one challenge with Laura is that if the base model is retrained, the loras built on top of it may become invalid. Open source models that have been extensively iterated upon can outperform newer models. This demonstrates the value of open source models and the switching costs involved in adopting new architectures.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode