Blog

Guías, novedades e ideas sobre inferencia LLM distribuida y autoalojada.

Más reciente EN

From docker sprawl to k3s: rebuilding my home inference fleet

12 de junio de 2026 · @briancaffey

A 'healthy' mesh-generation service sat wedged for three days while my agent.yaml described services that didn't exist. So I moved four GPU boxes — three RTX 4090s and a DGX Spark — onto k3s and taught the inference-club-agent to discover services from the Kubernetes API instead of a config file. Health checks lie; queues don't. Config is fiction; clusters are testimony.

#k3s #kubernetes #homelab #architecture #deep-dive