
Big Data, Reinforcement Learning and Aligning Models
The AI Buzz from Lightning AI
00:00
How do you fine-tune models for conversation?
Luca describes supervised fine-tuning with question–answer pairs to condition models to respond rather than just continue text.
Play episode from 11:39
Transcript


