
How to Build and Optimize AI Research Agents
The Data Exchange with Ben Lorica
00:00
Optimizing Prompts Without Full RL
Ben asks about a reinforcement learning paper; Jakub introduces JEPA (Genetic Pareto Optimization) as a simpler alternative to RL for evolving prompts.
Transcript
Play full episode