#147 Yilun Du: AI Debates, Reinforcement Learning, & The Power of Generative Models

Oct 22, 2023

Topics discussed include using AI agents debating to enhance language models, applications of generative models in creating intelligent agents, reinforcement learning in GPT models, using a debate strategy to improve language models, barriers to open source AI, importance of physically intelligent AI agents, exploring multi-agent debate for language models, shift from academia to industry, and AI in enterprises and environmentally friendly cloud computing.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 4min

Generative Models, Reinforcement Learning, and Language Models

03:39 • 4min

Reinforcement Learning in GPT Models

07:55 • 20min

Applying Debate Strategy for Language Model Improvement

27:44 • 16min

Barriers to Open Source AI and Importance of Physically Intelligent AI Agents

44:06 • 5min

Exploring the Potential of Multi-Agent Debate for Language Models

49:18 • 2min

Shift from Academia to Industry: Research Freedom in the Age of Language Models

50:53 • 2min

AI in Enterprises and Environmentally Friendly Cloud Computing

52:30 • 3min

This episode is sponsored by Crusoe. Crusoe Cloud is a scalable, clean, high-performance cloud, optimized for AI and HPC workloads, and powered by wasted, stranded or clean energy. Crusoe offers virtualized compute and storage solutions for a range of applications - including generative AI, computational biology, and rendering.

Visit https://crusoecloud.com/ to see what climate-aligned computing can do for your business

This episode is sponsored by Celonis ,the global leader in process mining. AI has landed and enterprises are adapting. To give customers slick experiences and teams the technology to deliver. The road is long, but you're closer than you think. Your business processes run through systems. Creating data at every step. Celonis recontrusts this data to generate Process Intelligence. A common business language. So AI knows how your business flows. Across every department, every system and every process.

Go to https:/celonis.com/eyeonai/ to find out more.

Welcome to episode 147 of the Eye on AI podcast. In this episode, host Craig Smith sits down with Yilna Du, a final year PhD student at MIT EECS with a background in research at leading institutions like OpenAI, FAIR, and Google Deepmind.

Yilun's extensive expertise spans generative models, decision making, robot learning, and embodied agents, making him a valuable voice in the AI domain.

Our conversation kicks off with a brief on Yilun's academic journey, leading into a deep dive into Reinforcement Learning with AI feedback (RLHF) - its history, inception, and challenges. We then touch upon the effectiveness of RLHF, the intriguing concept of multi-agent debate, and the PAPES procedure.

Craig and Yilun further explore the vast realm of AI, debating the gaps between open-source and proprietary models, the need for more compute resources, and the future of robotics interlaced with AI. Yilun provides a glimpse into his vision of decentralized AI systems, contrasting the industry's commercial trajectory with academia.

Craig Smith Twitter: https://twitter.com/craigss

Eye on A.I. Twitter: https://twitter.com/EyeOn_AI

(00:00) Preview, Celonis and Crusoe Ad

(04:06) Yilun's Academic Background

(05:52) Origin and Applications of RLHF

(12:16) ROHF and the Multi-Agent Debate Method

(17:32) AI Model Interaction without Human Intervention?

(20:41) Applicability and Inconsistency Detection

(28:43) The Future of AI Training

(45:26) Robotics and Decentralized AI Systems