
Ep. #7, The March 2023 Datadog Outage with Laura de Vesine
Heavybit Podcasts
00:00
The Underlying Root Cause of the Problem in Ubuntu
In version 2204 of Ubuntu, there was a change made to system D network D where when it is restarted, it deletes IP tables rules that it doesn't know about. We use Silium to manage our Kubernetes network connectivity. And when Silium installs itself on aKubernetes node, it rewrites the IP tables in order to allow for routing to pods. An automated security update was made by Ubuntu that in no way was a problem in and of itself. But because the security update was to system D, it restarted system D. That caused the IP tables rules to be deleted that Silium had put in.
Transcript
Play full episode