
From Spark to Eventual: Reinventing Data for the AI Era (Chat with Sammy from Eventual)
The Infra Pod
00:00
Handling stragglers, failures, and quarantine
Sammy discusses data-level fault tolerance: tracing failures, quarantining bad assets, and improving GPU utilization and reliability.
Play episode from 13:30
Transcript


