Separating signal from noise and ensuring actionable alerts is crucial for both security and SRE.
Applying the concept of error budgets to security helps identify when action needs to be taken and enables effective prioritization.
Deep dives
Understanding the Impact of Alerting Noise
Alerting noise in both security and SRE causes personal, financial, and health problems for teams. It is crucial to separate signal from noise and ensure actionable alerts. Non-actionable alerts are vague, not well-described, or redundant. Keeping track of alerts and their actionable nature helps measure and improve the signal-to-noise ratio.
Using Error Budgets and SLOs to Prioritize
Error budgets, a concept from SRE, can be applied to security. It involves allowing a certain amount of failure and measuring the rate at which it is spent. Measuring the rate of spending the error budget helps identify when action needs to be taken. By separating symptoms from causes, security professionals can prioritize effectively and focus on the most critical issues.
Managing Risk and Automation
When it comes to risk tolerance, security and reliability have different approaches. Reliability focuses on opportunity risk and growth, while security deals with pure risk without any upside. However, finding the right balance is crucial as excessive risk can lead to project or product risk. Automation should be considered carefully, ensuring it does not compromise security goals.
Translating SRE Lessons to Security
Lessons from SRE can be adapted to security, but the presence of an active adversary adds complexity. Prioritization becomes key, focusing on the principle that security aims to protect, such as confidentiality and availability. Detection and response in security require additional steps due to the intelligent and persistent nature of adversaries. Collaboration between security professionals and engineers is crucial to understand the unique architecture and create effective solutions.