Get the app
Maohao Shen
PhD student at MIT. His research focuses on making AI systems more intelligent and reliable, with a focus on uncertainty quantification and developing AI systems that can think more like humans.
Best podcasts with Maohao Shen
Ranked by the Snipd community
149 snips
Apr 8, 2025
• 52min
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726
chevron_right
Maohao Shen, a PhD student at MIT specializing in AI reliability, discusses his groundbreaking work on 'Satori.' He reveals how it enhances language model reasoning through reinforcement learning, enabling self-reflection and exploration. The podcast dives into the innovative Chain-of-Action-Thought approach, which guides models in complex reasoning tasks. Maohao also explains the two-stage training process, including format tuning and self-corrective techniques. The conversation highlights Satori’s impressive performance and its potential to redefine AI reasoning capabilities.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app