晚点聊 LateTalk

103: 用Attention串起大模型优化史,详解DeepSeek、Kimi最新注意力机制改进

Feb 26, 2025
Ask episode
Chapters
Transcript
Episode notes