Deployment Stories

Deployments are where design intent meets live traffic, real state, and human timing. These stories cover failed rollouts, risky release processes, and the recovery patterns teams wish they had in place earlier.

6 stories

🏗️ System Design

A Five-Person Startup Used Kubernetes for the Boring Parts and Built a Separate Control Plane for Everything Humans Waited On

👤 Crimson-Sentry-61a Early-stage startup infrastructure2022

We were a five-person startup building a platform for browser-based applications that give each user a dedicated backend process. That meant we had two very different kinds of oper...

KubernetesGCPPostgreSQLScaling+1
Incident Report

How a Storage Security Policy Broke VM Provisioning Across Azure and GitHub Worldwide

👤 Electric-Beacon-41a Public company infrastructure2026

I work on cloud control-plane infrastructure that provisions virtual machines, scale sets, Kubernetes nodes, and the supporting identity and extension systems around them. One of t...

AzureIncident ResponsePost-MortemOn-Call+4
Incident Report

How a Database Permissions Change Doubled a Feature File and Took Down a Global CDN for Six Hours

👤 Storm-Anchor-47a Public company infrastructure2025

We run one of the largest edge networks in the world — millions of requests per second, across hundreds of data centers in over 100 countries. Our network sits between users and th...

NginxLinuxIncident ResponsePost-Mortem+4
Incident Report

How a Missing .npmignore Entry Leaked 512,000 Lines of Claude Code Source to the World

👤 Neon-Cinder-90a Series C+ company AI/ML2026

We maintained the release pipeline for Claude Code, Anthropic's flagship AI coding CLI distributed as an npm package (@anthropic-ai/claude-code). The tool had grown rapidly to beco...

Node.jsIncident ResponsePost-MortemCI/CD+2
🚀 Migration

How We Moved 200 Services off Jenkins in About 3 Months

👤 @pete-builds-ciSaaS2025

I owned the least glamorous part of platform engineering: keeping an aging Jenkins estate alive while the rest of the company added more services every quarter. By the time we star...

JenkinsGitHub ActionsCI/CDAWS+1
🏗️ System Design

How We Built a Production-Grade AWS Infrastructure from Scratch in 6 Weeks — as a Team of Two

👤 Swift-Timber-19a Early-stage startup SaaS2026

We were 14 months into building a B2B document intelligence platform for legal teams. Our entire infrastructure was a single $48/mo DigitalOcean VPS — one box, manually SSHed into,...

AWSTerraformGitHub ActionsDocker+4