

Nous Hermes 3 and exploiting underspecified evaluations
Aug 16, 2024
The discussion kicks off with the launch of a new model, questioning what defines a 'frontier model.' Notable comparisons are drawn with LAMA 3.1 and the importance of transparent evaluation metrics emerges. The conversation elaborates on valuable lessons learned from the training process of Hermes 3. The broader implications for technology policy are also highlighted, emphasizing the need for integrity in AI evaluations.
Chapters
Transcript
Episode notes