
#063 - Prof. YOSHUA BENGIO - GFlowNets, Consciousness & Causality
Machine Learning Street Talk (MLST)
00:00
Exploration vs. Exploitation in G-FlowNets
This chapter explores the trade-off between exploration and exploitation within G-FlowNets, particularly in multi-armed bandit scenarios. It discusses the application of bandit algorithms and the upper confidence bound (UCB) objective to refine exploration strategies for drug discovery. The speakers emphasize the role of G-FlowNets in enhancing decision-making under uncertainty, enabling diverse sampling paths, and improving gameplay in reinforcement learning contexts like chess.
Transcript
Play full episode