The Data Exchange with Ben Lorica cover image

Fine-tuning and Preference Alignment in a Single Streamlined Process

The Data Exchange with Ben Lorica

00:00

Intro

Ji-Woo Hong and Noah Lee from Kaist AI discuss their paper on ORPO, explaining how it combines supervised fine-tuning and preference alignment to streamline learning processes for building AI applications in industry.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app