"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

The Dawn of Dynamic AI: RFT Comes Online, w/ Predibase CEO Dev Rishi, from Inference by Turing Post

250 snips

Jul 16, 2025

Dev Rishi, CEO and co-founder of Predibase, dives into the revolutionary shift from static to continuously learning AI systems. He explains how reinforcement learning can adapt via ongoing user feedback, showcasing its potential in healthcare and finance. Rishi also discusses the challenges of implementing these dynamic models, like reward hacking and maintaining quality. The conversation highlights the possibilities of 'practical specialized intelligence' as a more stable alternative to traditional AGI, and how it can reshape various economic niches.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

INSIGHT

Reinforcement Fine-Tuning Impact

Reinforcement fine-tuning (RFT) enables improving models with small data via reward signals instead of labeled data.
This method will shift from one-off tuning to continuous learning inside production feedback loops.

ANECDOTE

Healthcare Uses Continuous Learning

Some healthcare companies use a feedback pipeline combining expert annotations and model judges to improve AI assistants in production.
This early real-world implementation shows dynamic learning from user and expert feedback is feasible today.

ADVICE

Build Feedback Data Pipelines

Collect prompts and responses automatically from production to build feedback datasets.
Use small amounts of user feedback to fine-tune models continuously with techniques like Direct Preference Optimization (DPO).

Get the Snipd Podcast app to discover more snips from this episode

Get the app

This crossover episode from Inference by Turing Post features CEO Dev Rishi of Predibase discussing the shift from static to continuously learning AI systems that can adapt and improve from ongoing user feedback in production. Rishi provides grounded insights from deploying these dynamic models to real enterprise customers in healthcare and finance, exploring both the massive potential upside and significant safety challenges of reinforcement learning at scale. The conversation examines how "practical specialized intelligence" could reshape the AI landscape by filling economic niches efficiently, potentially offering a more stable alternative to AGI development. This discussion bridges theoretical concepts with real-world deployment experience, offering a practical preview of AI systems that "train once and learn forever."

Turing Post channel: @RealTuringPost Turpin Post website: https://www.turingpost.com

Sponsors:

Google Gemini 2.5 Flash :

Build faster, smarter apps with customizable reasoning controls that let you optimize for speed and cost. Start building at https://aistudio.google.com

Labelbox:

Labelbox pairs automation, expert judgment, and reinforcement learning to deliver high-quality training data for cutting-edge AI. Put its data factory to work for you, visit https://labelbox.com

Oracle Cloud Infrastructure:

Oracle Cloud Infrastructure (OCI) is the next-generation cloud that delivers better performance, faster speeds, and significantly lower costs, including up to 50% less for compute, 70% for storage, and 80% for networking. Run any workload, from infrastructure to AI, in a high-availability environment and try OCI for free with zero commitment at https://oracle.com/cognitive

The AGNTCY:

The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at https://agntcy.org

NetSuite by Oracle:

NetSuite by Oracle is the AI-powered business management suite trusted by over 42,000 businesses, offering a unified platform for accounting, financial management, inventory, and HR. Gain total visibility and control to make quick decisions and automate everyday tasks—download the free ebook, Navigating Global Trade: Three Insights for Leaders, at https://netsuite.com/cognitive

PRODUCED BY:

https://aipodcast.ing

CHAPTERS:

(00:00) Sponsor: Google Gemini 2.5 Flash

(00:31) About the Episode

(03:46) Training Models Continuously

(05:03) Reinforcement Fine-Tuning Revolution

(09:31) Agentic Workflows Challenges (Part 1)

(12:51) Sponsors: Labelbox | Oracle Cloud Infrastructure

(15:28) Agentic Workflows Challenges (Part 2)

(15:41) ChatGPT Pivot Moment

(19:59) Planning AI Future

(24:45) Open Source Gaps (Part 1)

(28:35) Sponsors: The AGNTCY | NetSuite by Oracle

(30:50) Open Source Gaps (Part 2)

(30:54) AGI vs Specialized

(35:26) Happiness and Success

(37:04) Outro