博客

关于分布式、自托管 LLM 推理的指南、动态与思考。

最新 EN

From docker sprawl to k3s: rebuilding my home inference fleet

2026年6月12日 · @briancaffey

A 'healthy' mesh-generation service sat wedged for three days while my agent.yaml described services that didn't exist. So I moved four GPU boxes — three RTX 4090s and a DGX Spark — onto k3s and taught the inference-club-agent to discover services from the Kubernetes API instead of a config file. Health checks lie; queues don't. Config is fiction; clusters are testimony.

#k3s #kubernetes #homelab #architecture #deep-dive