
Big Data, Reinforcement Learning and Aligning Models
The AI Buzz from Lightning AI
00:00
How do you fine-tune models for conversation?
Luca describes supervised fine-tuning with question–answer pairs to condition models to respond rather than just continue text.
Transcript
Play full episode