Get the app
Noah Lee
Co-author of ORPO: Monolithic Preference Optimization without Reference Model.
Best podcasts with Noah Lee
Ranked by the Snipd community
Jun 13, 2024
• 36min
Fine-tuning and Preference Alignment in a Single Streamlined Process
chevron_right
Jiwoo Hong and Noah Lee from KAIST AI discuss their method ORPO, combining supervised fine-tuning and preference alignment in a single step. They highlight the advantages of their approach, such as minimal data requirement, bias prevention, and enhanced adaptability of language models. The Orpo method has received positive feedback from the research community and industry for efficient alignment and scaling models with smaller datasets.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app