Networking Stories
Networking incidents are famous for looking like somebody else’s problem first. These stories cover DNS confusion, routing issues, connection behavior, and the debugging work required when packets become the mystery.
3 stories
How a Database Permissions Change Doubled a Feature File and Took Down a Global CDN for Six Hours
“We run one of the largest edge networks in the world — millions of requests per second, across hundreds of data centers in over 100 countries. Our network sits between users and th...”
The Empty DNS Record That Took Down 70 AWS Services for 14 Hours
“We operate one of the largest cloud infrastructure platforms in the world, running hundreds of interdependent services across dozens of regions. Our DynamoDB service in us-east-1 —...”
Two Silent Consul Bugs That Took Down a Gaming Platform for 73 Hours
“We run a gaming platform with 50 million daily active players, 18,000+ servers, and 170,000 containers. Our entire infrastructure — service discovery, container orchestration, secr...”