This CSI driver enables mounting cloud storage providers supported by rclone as persistent volumes in Kubernetes clusters
➤ https://ku.bz/czJ7TMvBX
This CSI driver enables mounting cloud storage providers supported by rclone as persistent volumes in Kubernetes clusters
➤ https://ku.bz/czJ7TMvBX
https://miro.medium.com/v2/0*XIeJGF7iQrF91hUn.png
This case study describes rebuilding a fragile Kubernetes infrastructure into a production-grade platform for GPU-based ML workloads, improving deployment frequency from weekly to 10+ times daily
➜ https://ku.bz/lFNBx-8yT
"A lot of organizations don't have that choice. You basically have to take what your security team is telling you."
Mauro Morales on picking an OS for Kubernetes
📺: https://ku.bz/8cpgjFfjn
Argo Workflows is fire-and-forget. Once the job's done, pods terminate—no streaming
That's why Sriharsha Yayi and Derek Wang built NumaFlow
📺: https://ku.bz/dbFK-9Yxw
📣: https://ku.bz/PQ-hn3ZCm
This code lets containerd pull and run images directly from the Nix store instead of traditional OCI layers by giving containerd native awareness of Nix packages and a CRI image service for Kubernetes
➤ https://ku.bz/77WMMFTdq
https://github.com/skyhook-io/radar/raw/main/docs/screenshot.png
Radar provides Kubernetes cluster visibility through topology graphs, event timelines, and service traffic visualization running as a single binary that connects directly to the Kubernetes API without cluster-side installation
➜ https://ku.bz/MhXx4kyjB
"We are very excited to announce the launch of Hadron Linux — a Linux distribution engineered from scratch by the Kairos team."
Ettore Di Giacinto on the new minimal OS for edge Kubernetes
📺: https://ku.bz/wMhKpZ5bQ
📣: https://ku.bz/_9RmXnjDJ
https://miro.medium.com/v2/1*No0EJIUEz3dmwqN-tm607w.png
This case study walks through how the team moved Argo CD from an in-cluster Redis to AWS ElastiCache Serverless, which almost wiped out their Redis costs and removed all the maintenance work
➤ https://ku.bz/TmbtK2v0K
https://miro.medium.com/v2/1*WAqMiw6h23D2Snn5ykGyLg.png
This tutorial teaches how to build a home Kubernetes cluster using three Raspberry Pi 4 devices for hands-on learning and experimentation
➜ https://ku.bz/M2ctfBBpC
Korrel8r is a rule-based correlation engine that automatically discovers and graphs relationships between Kubernetes cluster resources and observability signals across Prometheus, Loki, Alertmanager, and the Kubernetes API
➤ https://ku.bz/tdFklzjTN
H8s is a home infrastructure project combining Kubernetes with Talos OS security, running on 2 N100 mini PCs with GitOps deployment via ArgoCD
➤ https://ku.bz/2VMpLQ5Tp
pwru is an eBPF-based tool for tracing network packets in the Linux kernel with advanced filtering capabilities
It allows fine-grained introspection of kernel state to facilitate debugging network connectivity issues
➜ https://ku.bz/Q3X1ngZGC
https://assets.learnk8s.io/linkedin-173.png
This week on the Learn Kubernetes Weekly:
🔥 Egress with Squid Proxy
💪 OS Migration Reduced Infra 30%
⚡ Auto & Load-based Scaling
🎯 Smart Scheduling: Cost Optimization
🤖 Claude Code Pilots K8s
⭐️ Hadron
Read it now: https://kube.today/issues/173
https://kube.careers/image-gen/digest?companies=Anthropic&companies=Parloa&companies=Anduril+Industries¤cy=USD&salaryFrom=171400&salaryTo=87242145&subtitle=This+week&title=6+Kubernetes+jobs+that+offer+relocation
This article shows how to migrate a Factorio game server from a basic VPS to Kubernetes with Longhorn storage, addressing lag issues and improving reliability
➤ https://ku.bz/GN8XQThYw
This article teaches how to make Kubernetes workloads survive AWS Spot Instance interruptions using NTH or Karpenter, PDBs, stateless design, and AWS FIS game days
➜ https://ku.bz/RZ4TYSl95
https://res.cloudinary.com/learnk8s/image/upload/v1772544380/gpu-sharing-problems-2026/slide-1.png
You want to share GPUs: one team runs inference, another trains models, and both need the same expensive cards.
The problem is that GPUs don't behave like CPU and RAM under contention.
(I will cover this on Thursday: ku.bz/multitenant26 )
🧵
"We assumed that implicit budget happens in Karpenter. It's not the case."
Adhi Sutandi on Karpenter's default 10% disruption budget catching his team off guard
📺: https://ku.bz/XyVfsSQPr
kinc is an open-source tool that lets you create a single-node, rootless Kubernetes cluster inside a container (using Podman + CRI-O + kubeadm)
➤ https://ku.bz/WNZh_5B-m
TNS CSI is a Kubernetes Container Storage Interface driver for TrueNAS Scale 25.10 and later, supporting both NFS for file storage and NVMe-oF for high-performance block storage
➤ https://ku.bz/8p4NdY5H7
This tutorial teaches how to build a minimal multi-node Kubernetes cluster on NixOS without K3s, K0s, or NixOS Kubernetes modules, using vanilla Kubernetes components managed by systemd
➤ https://ku.bz/2y7tYKrhd
https://github.com/clowdhaus/cookiecluster/raw/main/.github/demo.svg
cookiecluster is a CLI tool written in Rust that generates EKS cluster definitions in Terraform through an interactive configuration process
➜ https://ku.bz/HWhpFR_91
https://res.cloudinary.com/learnk8s/image/upload/v1772004192/kubex-book-2026/slide-1.png
Gulcan and I wrote a free book on right-sizing GPUs in Kubernetes.
Here's the short version (thread)
https://res.cloudinary.com/learnk8s/image/upload/v1772003454/kubex-book-2026/gpu-right-sizing.png
How much of your GPU spend is actually producing work?
We published a free book on right-sizing GPUs in Kubernetes, covering metrics to architecture decisions across 4 chapters.
ku.bz/KL4jRvsL4
This tutorial teaches how to transform observability data into reliability improvements through:
- Service Level Objectives,
- error budgets,
- runbooks
- and blameless post-mortems for Kubernetes environments
➤ https://ku.bz/8C3k8vR62
Kured is a Kubernetes daemonset that automates OS-level node reboots by monitoring reboot signals and coordinates safe rolling reboots using API server locks, draining, and optional Prometheus alert integration
➤ https://ku.bz/VcHmCqrXk
This article shows how SAP AI Core deploys production ML and generative AI models at scale using KServe on Kubernetes
➤ https://ku.bz/YhVdmcyyq
🗣️ Andrew Hillier, Co-founder CTO @ Densify, discusses why organizations are moving away from CPU limits in Kubernetes, arguing that premature throttling provides no benefit when nodes rarely reach capacity
Watch: https://ku.bz/-0wmZX03V
This article shares a backend engineer's journey deploying DeepSeek Coder for an on-premise security code review tool at a startup
➤ https://ku.bz/RRRBl42LQ
nix-csi mounts Nix store paths into Kubernetes pods using CSI ephemeral volumes that share pod lifetimes
It supports pulling pre-built store paths, evaluating flake references, or building Nix expressions directly within pod specifications
➤ https://ku.bz/jTXzVvVYB