Role of Human Feedback in Rewards

David explains how rewards can be automated or human-derived and when human feedback is used with PPO, DPO, and GRPO.

Play episode from 24:51

chevron_right

Transcript

chevron_right

Transcript

Episode notes

David Corbitt is the cofounder and Chief Product Officer of OpenPipe, a Y Combinator-backed AI startup launched in 2023 by brothers Kyle and David Corbitt. OpenPipe helps developers convert expensive, slow GPT-3.5/4 prompts into customized fine-tuned language models that deliver similar or better performance at a fraction of the cost and latency.

Before starting OpenPipe, David worked at Palantir and Qualtrics and also cofounded GenerationalStory, a video legacy startup focused on preserving family histories. With a background in both technical roles and entrepreneurship, he now helps engineers easily fine-tune and deploy AI models without requiring deep machine learning expertise.

The Collective Intelligence Community Podcast by the AI Collective brings the brightest minds in AI to the table to discuss the field’s most pressing topics! Each interview provides a unique perspective on this industry, providing sharp insights and informed commentaries shared to our growing community of 70,000+ founders, funders, and thought leaders.

PODCAST CREATED BY THE AI COLLECTIVE:

Website: https://www.aicollective.com/

LinkedIn: https://www.linkedin.com/company/aicollective/

Twitter: https://x.com/_ai_collective

Slack: https://join.slack.com/t/genai-collective/shared_invite/zt-1ya6vi6ti-h8nzjNxISgPfEtya89F2Kw

HOST:

Magnus Kubo-Allen

LinkedIn: https://www.linkedin.com/in/magnuskuboallen/

Twitter: https://www.linkedin.com/in/magnuskuboallen/

GUEST:

David Corbitt

LinkedIn: https://www.linkedin.com/in/davidcorbitt/

Twitter: https://x.com/dvdcrbt

MUSIC:

"jiglr - Odyssey" is under a Creative Commons (BY-SA 3.0) license:

https://creativecommons.org/licenses/by-sa/3.0/

https://soundcloud.com/jiglrmusic

Music powered by BreakingCopyright: • 🤖 Technology & Synthwave (Free Music) - "ODYSSEY" by Jiglr

“The Labyrinth” by DaniHaDani

https://artlist.io/royalty-free-music/artist/danihadani/2295

The Labyrinth

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Home Top podcasts Popular guests Top books