Maohao Shen

PhD student at MIT. His research focuses on making AI systems more intelligent and reliable, with a focus on uncertainty quantification and developing AI systems that can think more like humans.

Best podcasts with Maohao Shen

Ranked by the Snipd community

149 snips

Apr 8, 2025 • 52min

Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726

Maohao Shen, a PhD student at MIT specializing in AI reliability, discusses his groundbreaking work on 'Satori.' He reveals how it enhances language model reasoning through reinforcement learning, enabling self-reflection and exploration. The podcast dives into the innovative Chain-of-Action-Thought approach, which guides models in complex reasoning tasks. Maohao also explains the two-stage training process, including format tuning and self-corrective techniques. The conversation highlights Satori’s impressive performance and its potential to redefine AI reasoning capabilities.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app