

Luca Soldaini
Engineer and research lead at AI2 specializing in pre-training, data curation, and building open model pipelines such as the Olmo family.
Top 3 podcasts with Luca Soldaini
Ranked by the Snipd community

32 snips
Nov 20, 2025 • 1h 28min
Can America Win the Open Source AI Race? — Olmo 3 with Ai2’s Nathan Lambert & Luca Soldaini
Nathan Lambert and Luca Soldaini from AI2 dive into the groundbreaking OLMo 3 release, showcasing their approach to open-source AI with full transparency. They discuss the significance of releasing comprehensive model data and the distinction between base, instruct, and thinking models. The conversation highlights the impact of Meta's retreat from the open-source space, leading to the rise of Chinese models. Nathan and Luca also explore the challenges of reasoning in AI, emphasizing the need for U.S. innovation and broader engagement in shaping AI's future.

28 snips
Dec 23, 2024 • 42min
2024 in Open Models [LS Live @ NeurIPS]
Luca Soldaini, a research scientist at the Allen Institute for AI, and Sophia Yang, head of Developer Relations at Mistral AI, dive into the explosive rise of open models in 2024. They discuss breakthrough models like Llama 3 and the MOE model, highlighting the competitive dynamics in AI. Key challenges such as regulatory hurdles and limited training data access are explored. The conversation also emphasizes the need for collaboration and open-source methodologies to foster innovation in a rapidly evolving landscape.

11 snips
Jan 22, 2025 • 1h 13min
Interviewing OLMo 2 leads: Open secrets of training language models
Luca Soldaini, the Data lead for the OLMo project at AI2, joins the discussion to unveil the intricacies of training language models. He shares tales of overcoming challenges in pretraining efficiency and the quest for stability, especially after a significant 70B model attempt. The conversation dives into the strategic decisions behind building effective language modeling teams, the intricate balance of deep versus wide network architectures, and the importance of community-driven advancements in AI.


