The organization emphasizes reliability and scale, willing to allocate more resources to ensure failover operations are seamless and data movements are efficient. Despite the consistency versus availability trade-offs and the risk of encountering unknown unknowns during database operations, the company invests heavily in creating playbooks for worst-case scenarios. Although there have been instances of successfully recovering from unexpected failures within minutes, the goal is to automate the process further. The focus currently lies on over-allocating resources to ensure six to 12 months of growth post-shard split, prioritizing reliability and scale over cost efficiency. Restart operations are designed to be transparent for application developers, minimizing the impact on their experience.
Sammy Steele is a Senior Staff Engineer at Figma, and the tech lead for their databases team. She previously worked at Dropbox, where she built out their petabyte-scale metadata storage and search systems.
Sammy recently published a blog called “How Figma’s databases team lived to tell the scale”. The blog went viral and made it to the top of Hacker News. We invited Sammy on the podcast to learn more, and she is our guest today.
Check out Figma Config at https://config.figma.com/.
Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from information visualization to quantum computing. Currently, Sean is Head of Marketing and Developer Relations at Skyflow and host of the podcast Partially Redacted, a podcast about privacy and security engineering. You can connect with Sean on Twitter @seanfalconer .
The post Database Scaling at Figma with Sammy Steele appeared first on Software Engineering Daily.