From Nitro to Junction: Testing in Production at Scale
During my time at AWS, I learned that even the most rigorous pre-production testing has limits. I share how we built Nitro's reliability by treating production as part of the test loop—with proper saf...
At AWS, I led the team that built EC2's Nitro virtualization stack— C code deployed to 500K servers. Biggest lesson? Pre-production testing has limits. You must embrace safe production testing.
My new blog explains how Nitro did it, and how Junction brings this to any Kubernetes team.
20.05.2025 15:44
👍 8
🔁 3
💬 1
📌 0
DNS and the December 2024 OpenAI Outage
In December 2024, OpenAI faced a significant outage lasting approximately four hours. This incident highlighted a critical challenge in container orchestration: maintaining reliable service discovery ...
In my new blog post for Junction Labs, I explore service discovery by delving into the December OpenAI outage. I analyze root cause and discuss a principle we grasped during the development of EC2's Networking: static stability. Check out the full post here: www.junctionlabs.io/blog/dns-and...
28.01.2025 16:48
👍 5
🔁 2
💬 0
📌 0