
Training superhuman coding models at Cursor
Cursor
00:00
Fast Offline-Online Reward Loops
Panel recommends shortening training-to-deployment loops and retraining reward models frequently with real feedback.
Play episode from 29:53
Transcript

Panel recommends shortening training-to-deployment loops and retraining reward models frequently with real feedback.