
Scaling LLMs and Accelerating Adoption with Aidan Gomez at Cohere
Gradient Dissent: Conversations on AI
How to Fine-Tune a Model to Respond to Commands
A model before it's had this sort of specific human feedback to make it respond to commands? Feels a little bit like schizophrenic. It's hard to get it to do what you want. You're trying to give it instructions and it might break in some very unpredictable way. And then what exactly is happening in that fine-tuning phase with the human feedback? What exactly are you doing?"
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.