
Fine-tuning and Preference Alignment in a Single Streamlined Process
The Data Exchange with Ben Lorica
00:00
Intro
Ji-Woo Hong and Noah Lee from Kaist AI discuss their paper on ORPO, explaining how it combines supervised fine-tuning and preference alignment to streamline learning processes for building AI applications in industry.
Transcript
Play full episode