Why Digital Work is the Perfect Training Ground for AI Agents

10 snips

Sep 18, 2025

Andrew Rabinovich, CTO and Head of AI at Upwork, dives into the synergy between digital work marketplaces and AI agents. He reveals how Upwork's unique Reinforcement Learning from Experience (RLEF) methods optimize agent training. The discussion highlights the importance of user evaluation in shaping AI performance and explores innovative multimodal feedback systems. Rabinovich also envisions a future where human workers and AI coalesce seamlessly, providing expedited services and new opportunities for freelancers.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

UMA Orchestrates Digital Work End-to-End

Upwork built UMA, a meta-agent that facilitates client–freelancer interactions end-to-end on the platform.
UMA helps clients express needs, finds matching freelancers, and orchestrates contracts and work.

INSIGHT

Mixture Of Fine-Tuned Skills With RAG

Upwork layers fine-tuning: a base personality for UMA and many narrow skill models for specific tasks.
They use RAG to bring dynamic platform data into UMA's skills that need up-to-date context.

ADVICE

Route For Latency; Use Narrow Models

Prioritize latency-sensitive routing and reserve heavy reasoning only for tasks that need it.
Train narrow, task-specific models instead of relying on general reasoning models for real-time interactions.

Get the Snipd Podcast app to discover more snips from this episode

Get the app