
How AI Is Built #045 RAG As Two Things - Prompt Engineering and Search
13 snips
Mar 6, 2025 In this discussion, John Berryman, an expert who transitioned from aerospace engineering to search and machine learning, explores the dual nature of retrieval-augmented generation (RAG). He emphasizes separating search from prompt engineering for optimal performance. Berryman shares insights on effective prompting strategies using familiar structures, testing human evaluations, and managing token limits. He dives into the differences between chat and completion models and highlights practical techniques for tackling AI applications and workflows. It's a deep dive into enhancing interactions with AI!
AI Snips
Chapters
Books
Transcript
Episode notes
Agents Are A Spectrum, Not A Single Thing
- 'Agent' lacks a fixed definition and sits on a spectrum between assistants and workflows.
- True agency implies independent decision-making and learning, not just tool use.
Decompose Workflows Horizontally
- Break complex tasks horizontally into well-defined steps with clear inputs/outputs.
- Evaluate each step independently and add error-state routing to improve reliability.
Slice Vertically For Specialized Cases
- Slice vertically when tasks contain distinct case types and specialize per type.
- Route inputs to specialized sub-flows or tools for better accuracy on diverse data.


