
Reinforcement Learning in the Era of LLMs
Deep Papers
00:00
Exploring Data Embeddings and Prompt Optimization in Reasoning Systems
Exploring the mathematical measurability of data embeddings through various distance metrics and the process of prompt engineering in reasoning systems. Discussing challenges in prompt comparison and optimization through reinforcement learning.
Transcript
Play full episode