Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0 cover image

Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0

Latest episodes

undefined
Jun 25, 2024 • 1h 22min

State of the Art: Training >70B LLMs on 10,000 H100 clusters

Return guests Kanjun and Jonathan discuss Imbue and Databricks' 70B LLM, outperforming GPT-4o. Topics include infrastructure needs, cost-aware hyperparameter optimizer CARBs, and challenges in training large models. They delve into MFU monitoring, bug tracing, and optimizing infrastructure for deep learning. The podcast explores evaluation metrics and AI model practical applications, emphasizing the importance of creating useful tools for customers.
undefined
Jun 25, 2024 • 50min

[High Agency] AI Engineer World's Fair Preview

Shawn Wang discusses the AI Engineer role, team composition, trends in AI research, and advice for product creators with a focus on AI Engineer World Fair.
undefined
Jun 21, 2024 • 1h 4min

How To Hire AI Engineers — with James Brady & Adam Wiggins of Elicit

Guests from Elicit share insights on hiring AI engineers, blending conventional skills with ML knowledge. Topics include fault tolerance in coding, model shadowing, sourcing engineers, and challenges in AI engineering.
undefined
Jun 11, 2024 • 55min

How AI is eating Finance — with Mike Conover of Brightwave

Guest Mike Conover from Brightwave discusses using large language models in finance, challenges in long context windows, and the evolution of AI research. Topics include implications of polarizing data, value of employee-led datasets, complexities of data temporality in finance, AI in financial services, AI hedge funds, and the evolution of large language models.
undefined
Jun 10, 2024 • 4h 29min

ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)

Expert guests Graham Neubig and Aman Sanger discuss AI topics like Code Edits, Sandboxes, Academia vs Industry. They delve into Benchmarks like SWEBench, Dataset Contamination Detection, and GAIA Benchmark. The conversation also touches on Reasoning - Self-RAG, Let's Verify Step By Step, and developments in multi-agent systems with MetaGPT.
undefined
May 30, 2024 • 58min

How to train a Million Context LLM — with Mark Huang of Gradient.ai

Mark Huang of Gradient.ai discusses training a 1 million context LLM, highlighting challenges like memory scaling and the need for techniques such as curriculum learning. The podcast explores strategies for training large language models, evaluating ring attention implementations in PyTorch, and enhancing value in AI technology through early fusion models and wise resource investment.
undefined
May 27, 2024 • 3h 38min

ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever

Christian Szegedy, Ilya Sutskever, and Durk Kingma discuss the most notable topics from ICLR 2024, including expansion of deep learning models, latent variable models, generative models, unsupervised learning, adversarial machine learning, attention maps in vision transformers, efficient model training strategies, and optimization in large GPU clusters.
undefined
May 16, 2024 • 54min

Emulating Humans with NSFW Chatbots - with Jesse Silver

Jesse Silver discusses the rise of NSFW AI chatbots in adult entertainment, exploring challenges and potential in automating interactions. The conversation covers maximizing creator profit, emulating human interactions realistically, and managing creator income streams. It also delves into technical aspects like context, long memory, and personality profiling in generating interactions aligned with creators' brands.
undefined
Apr 27, 2024 • 54min

WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai

Guests Joscha Bach, Karan Malhotra, and Rob Haisfield discuss the evolution of generative AI, GANs, GPT-2, and simulative AI. They explore the potential of AI for game experiences, chat interactions, and world simulations, highlighting advancements in text generation and creative applications of AI technology.
undefined
Apr 19, 2024 • 52min

High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor

Jason Liu, Co-founder of Instructor, discusses the evolution of recommendation frameworks with GPT-3 embeddings at Stitch Fix, the importance of function calling for structured outputs, optimizing workflows for model fine-tuning, the role of prompts in AI models, and exploring hiring practices and innovation in AI engineering.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode