4min snip

ThursdAI - The top AI news from the past week cover image

📅 ThursdAI Nov 02 - ChatGPT "All Tools", Bidens AI EO, many OSS SOTA models, text 2 3D, distil-whisper and more AI news 🔥

ThursdAI - The top AI news from the past week

NOTE

Mistral Yarn: Enabling Long Context in Trained Models

Imozilla announces the release of Mistral Yarn, a method to enhance R&D trained models to handle longer context windows. The model's positional embeddings caused it to stop at its trained context length, but the ARM paper discovered a way to unlearn this behavior without changing the model's world knowledge. Additionally, support for yarn in llama cpp was introduced, enabling accessibility for long context within average compute requirements. The Mistral model is robust and performs equivalently or slightly better than llama two 13 B. Imozilla is also working on training a yarn llama two 70 B model.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode