
May 30, 2026 · @briancaffey
Inference wants to be distributed — and now NVIDIA agrees
Local models keep getting better while the grid can't build centralized data centers fast enough. Span and NVIDIA's new XFRA puts Blackwell GPUs inside homes to tap idle power — strong validation for distributing AI compute to the edge, which is exactly the bet inference.club is making.


