Latent Space: The AI Engineer Podcast

swyx + Alessio
undefined
394 snips
Oct 11, 2024 • 1h 57min

Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust

Ankur Goyal, founder of Impira and former head of Figma AI, discusses the evolving landscape of AI engineering. He highlights the challenges in LLM Ops, emphasizing the importance of evaluations in systematic AI development. Ankur reflects on his journey navigating career pressures as a first-generation Indian student and shares insights from his experiences in tech startups. He dives into advancements in data extraction, the integration of software engineering principles with AI, and the significance of community feedback for product improvements.
undefined
16 snips
Oct 3, 2024 • 2h 9min

Building AGI in Real Time (OpenAI Dev Day 2024)

Sam Altman, CEO of OpenAI, leads a team of experts including Kevin Weill and Michelle Pokrass, who unveil groundbreaking advancements in AI at Dev Day. They discuss the game-changing **Realtime API**, highlighting its capabilities for voice interaction and function calling. Alistair Pullen shares insights on developing AI models for software engineering. The conversation also dives into the challenges of AI fine-tuning and ethical considerations in AI communications, envisioning a future where seamless human-AI interaction transforms productivity.
undefined
111 snips
Sep 27, 2024 • 1h 30min

Language Agents: From Reasoning to Acting

Harrison Chase, founder of LangChain, LangSmith, and LangGraph, teams up with Shunyu Yao, an AI researcher known for his work on ReAct. They discuss the evolution of AI language agents, highlighting advancements in reasoning and acting. Shunyu shares insights on the ReAct framework and its impact on decision-making in AI. They delve into the challenges of benchmarking and interactive problem-solving in coding agents, while also exploring the future of AI model scaling and user experience in customer service applications.
undefined
263 snips
Sep 20, 2024 • 1h 9min

The Ultimate Guide to Prompting

Sander Schulhoff, author of The Prompt Report and creator of LearnPrompting.org, dives deep into the intricacies of prompt engineering and AI safety. He discusses the extensive research landscape with over 1,600 papers on prompting, clarifying popular methodologies like chain-of-thought and zero-shot prompting. Schulhoff also tackles ethical dilemmas surrounding AI-generated academic work and introduces innovative AI tools to streamline research methodologies. Further, he covers advancements in multimodal prompting, illustrating the complexities and potential of AI interactions.
undefined
247 snips
Sep 13, 2024 • 2h 4min

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

Michelle Pokrass leads the API Platform at OpenAI and has an impressive background in building scalable platforms at leading tech companies. In their discussion, they delve into the significance of structured outputs in AI, emphasizing its reliability for developers. The conversation covers everything from the latest advancements in OpenAI's capabilities, including the O1 model, to navigating database challenges and enhancing user experience through innovations in their APIs. They also touch on the complexities of cognitive biases in decision-making related to AI.
undefined
8 snips
Sep 3, 2024 • 1h 5min

Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation

Nyla Worker, a Senior PM at Nvidia with a background in optimizing AI models at Google and eBay, shares insights on dramatic advancements in AI efficiency and inference. The discussion highlights a staggering reduction in costs and time for training models, with examples like the Cerebras platform achieving unheard-of speeds. They delve into optimizing large language models and the revolutionary potential of 3D conversational AI technology. Worker also touches on the future of digital personas and their applications in various sectors, including healthcare.
undefined
81 snips
Aug 29, 2024 • 1h 10min

Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind

Nicholas Carlini, a research scientist at DeepMind specializing in AI security, discusses the power of personalized LLM benchmarks. He encourages focusing on individual use of AI tools, emphasizing that AI shines in automating mundane tasks. Carlini shares insights from his viral blog, detailing creative applications of AI in coding and problem-solving. He also navigates the dualities of LLMs, the importance of critical evaluation, and the ongoing need for robust, domain-specific benchmarks to truly gauge AI performance.
undefined
55 snips
Aug 22, 2024 • 1h 5min

Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)

Alistair Pullen, Co-founder and CEO of Cosign, discusses the groundbreaking advancements of Cosine Genie, the top coding agent that utilizes fine-tuned GPT-4o technology. He shares insights on the innovative training techniques that enable the model to learn from real software engineers, enhancing coding efficiency. The conversation also delves into the challenges of fine-tuning models, the importance of synthetic data, and future innovations in AI tooling, revealing the transformative potential of advanced language models in software development.
undefined
117 snips
Aug 16, 2024 • 59min

AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai

Jeremy Howard, Founder of Answer.ai and a prominent figure in deep learning and fast.ai, joins the conversation to share innovative insights. He discusses revolutionary AI model training techniques that allow anyone to use minimal resources to achieve maximum output. Howard emphasizes collaboration within diverse teams, steering clear of traditional hierarchies, to foster creativity. They also explore the FastHTML framework, showcasing how it simplifies web development. The podcast dives into the ethics surrounding AI governance and the promise of dialogue engineering in transforming coding environments.
undefined
15 snips
Aug 7, 2024 • 1h 4min

Segment Anything 2: Demo-first Model Development

Joseph Nelson, a computer vision expert at Roboflow, and Nikhila Ravi, Research Engineering Manager at Facebook AI, share their insights on the groundbreaking Segment Anything Model 2 (SAM2). They discuss its remarkable efficiency in video segmentation, achieving better accuracy with significantly fewer interactions. The conversation highlights the model's revolutionary role in real-time object tracking and its open-source commitment. They also touch on the importance of user-friendly demonstrations and community involvement in evolving AI technologies.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app