블로그

분산형 셀프 호스팅 LLM 추론에 관한 가이드, 업데이트, 아이디어.

최신 EN

From docker sprawl to k3s: rebuilding my home inference fleet

2026년 6월 12일 · @briancaffey

A 'healthy' mesh-generation service sat wedged for three days while my agent.yaml described services that didn't exist. So I moved four GPU boxes — three RTX 4090s and a DGX Spark — onto k3s and taught the inference-club-agent to discover services from the Kubernetes API instead of a config file. Health checks lie; queues don't. Config is fiction; clusters are testimony.

#k3s #kubernetes #homelab #architecture #deep-dive