"Why does our service keep crashing even after we reduce the load?"
metastable failure—your system is stuck in a vicious cycle. slow responses → timeouts → retries → more load → even slower. It can't escape on its own. #DistributedSystems (1/4)
Hi i’m Hannah, senior software engineer looking for my next role 🎯
8+ years in go. high-throughput distributed systems, microservices, kubernetes, cloud. billions of requests/hr. oauth/oidc, grpc, platform.
Please share!
#golang #hiring #distributedSystems #platformengineering
#Uber redesigned its #MySQL fleet with MySQL Group Replication:
✅ Cluster failover: minutes → seconds
✅ Leader election & failure detection inside DB
✅ Better availability & consistency across thousands of clusters.
🔗Learn more: bit.ly/4cGIzRb
#DistributedSystems #Clusters #Databases #InfoQ
Distributed systems are like Minesweeper.
Reliable message delivery is all about managing failure and trade-offs.
m.youtube.com/watch?v=dKaZ...
#DistributedSystems
At scale, database uptime becomes a consensus problem.
www.infoq.com/news/2026/03...
#DistributedSystems #MySQL
Multi-availability-zone database architectures improve availability but introduce tradeoffs in performance, consistency, and operations.
Read the full blog: https://monkeylink.co/4a3185
#CloudArchitecture #DistributedSystems
Is your Kafka lag dashboard lying? Learn how log compaction and retention silently manipulate reported timestamps and how to ensure your monitoring reflects actual system performance.
softwaremill.com/compaction-a...
#ApacheKafka #DataEngineering #DistributedSystems #EventStreaming #Observability
Not just @oreilly.bsky.social authors but @itrevolution.com too plus plenty of superb practitioners! We'll be covering a wide range of topics including #distributedsystems, #AI, #Ops, #Management, #Leadership, #organisations, #planetarycomputing, #security, #resilience, #sustainability, #ethics...
Ceph Public Network Migration (No Downtime) Ceph Public Network Migration (Proxmox) 172.16.0.0/16 → 10.50.0.0/24 No service downtime, no data loss 📌 Context This procedure documents a live Cep...
#distributedsystems #linux #networking #tutorial
Origin | Interest | Match
If your node is idle this weekend, put it to work. We need the data.
📖 Read why we do this: holo.host/blog/into-th...
#WindTunnel #DistributedSystems #EdgeComputing #Holochain
“Schemaless” doesn’t mean structureless. It means your data model can evolve without risky migrations or coordination overhead.
This blog explains:
• What schemaless actually means
• How it works
• When it’s the right choice
https://monkeylink.co/9acf51
#Databases #NoSQL #DistributedSystems
Learn how to scale load testing with Apache JMeter, Scribe, and TimescaleDB. Simulate millions of requests, centralize logs, and analyze performance in real tim #distributedsystems
AP systems prioritize availability during network partitions, serving stale data rather than going offline—and resolve conflicts after recovery. #distributedsystems
Genuinely love log statements, or harbour some bizarre obsession with them!?
NO? Then you may enjoy reading about the Log Quad: a semantic framework to evaluate your logging, and fine-tune its bill.
www.bahmanm.com/2026/03/log-...
#BrainWork_BrainToil #SoftwareEngineering #SRE #DistributedSystems
✍️ New blog post by Renaldi
Step Functions Distributed Map Best Practices for Large-Scale Batch Workloads
#architecture #aws #distributedsystems #serverless
Resilience. Keep Distributed Systems Alive
binarycoders.wordpress.com/2026/03/01/r...
#DistributedSystems #ResilienceEngineering #SystemDesign
Kubernetes Federation Taming the Multiverse: Your Deep Dive into Kubernetes Federation So, you've mastered Kubernetes. You're orchestrating your microservices like a maestro, your deploymen...
#cloud #devops #distributedsystems #kubernetes
Origin | Interest | Match
Designing Trading Systems Series by Code Maverick is the featured bundle of ebooks 📚 on Leanpub!
#SoftwareArchitecture #DistributedSystems #EventDrivenArchitecture #SystemIntegration #Microservices #MessageDriven #EngineeringManagement #ProductManagement #ProjectManagement #SystemsAnalysis
Line of Defense: Three Systems, Not One Three Systems, Not One "Rate limiting" gets used as a catch-all for anything that rejects or slows down requests. But there are actually three distin...
#systemdesign #distributedsystems #ratelimiter
Origin | Interest | Match
Amazon Web Services (AWS) EC2 placement groups use cluster, partition, and spread strategies to position instances for performance, redundancy, and isolation in distributed environments.
Read the full breakdown by Brien Posey: https://ow.ly/Wcoi50YmTnN
#AWS #EC2 #DistributedSystems
Apache Kafka 4.2.0 is here. The headliner? Queues for Kafka (KIP-932) is officially GA. But there is much more!
See below:
#ApacheKafka #Kafka42 #EventStreaming #DataEngineering #DistributedSystems
How do you trust a chaotic mobile P2P network? Enter "Nodes of Truth." #Virtual, duplicated #consensus entities spanning multiple devices. If a user drops offline, the virtual node survives, securing the network's state and reputation. 🛡️🕸️ #DistributedSystems #P2P
New on Foojay: Elie Hannouch writes about how MongoDB uses the Raft consensus algorithm to maintain data consistency across replica sets.
foojay.io/today/mongod...
#MongoDB #Raft #DistributedSystems #Java
5/x by adding a voluntary computational component to distributed clients (think SETI) the federated distributed network could rival the computational capability of the centralized oligoprocessopoly (data centers)
#science #computing #distributedsystems #distributedcomputing #FediFlops
Operational Lessons from Large-Scale Valkey Deployments
📈 “Scale exposes all truths.”
That’s the reality for teams running Valkey at large scale—where tail latency, payload shape, and traffic patterns start to matter more than peak numbers.
Read more from Allen Helton and Mike Callahan:
valkey.io/blog/operati...
#Valkey #DistributedSystems #SRE
🎉 We are delighted to confirm two keynote speakers for Euro-Par 2026:
• Luca Benini
• Michela Taufer
📌 Submission deadlines (AoE):
- Abstract: 27 February 2026
- Paper: 6 March 2026
More info: 2026.euro-par.org
#EuroPar2026 #HPC #ParallelComputing #DistributedSystems
許明輝氏の『分散セキュリティ』は、独立したセキュリティ特性から相乗的な信頼への画期的な転換をもたらします。複雑な分散システム向けに、堅牢で統合されたセキュリティを実現します!
#Cybersecurity #TrustModels #DistributedSystems #InformaQ
#Cybersecurity #TrustModels #DistributedSystems
Legacy security fails across clouds. Embrace Zero Trust for distributed SQL. 🔒 Check the 12-week roadmap and more: https://ow.ly/V4E150Ykn9r
#ZeroTrust #DatabaseSecurity #DistributedSystems
A deep dive into Amazon Dynamo’s architecture—CAP trade-offs, vector clocks, consistent hashing, and building highly available systems. #distributedsystems
A network partition is a communication break between nodes in a distributed system. Partition tolerance is necessary for any distributed deployment because network failures are unavoidable in practice.
#NetworkPartition #DistributedSystems #SystemDesign