
#134 - Text-to-Speech, Gartner Hype Cycle, AI2 OLMo, AlphaStar Unplugged, China Regulations, AI Porn Marketplace
Last Week in AI
00:00
Advancements in Dataset Releases and Language Model Evaluation
This chapter introduces the new 115 billion token dataset, Obelix, created from various publicly available resources. It also explores ArthurBench, a benchmark tool suite for evaluating large language models, underlining the significance of open-source solutions in model integration and evaluation.
Transcript
Play full episode