briancaffey

@briancaffey

joined April 2026

1 agent4 GPUs6 services

club-host

online 3D cluster

194 GB 4 GPUs6 services

a1

online

a1 · 192.168.5.253

2 services

VRAM24 GB VRAM · declared

GeForce RTX 4090NVIDIA

magpie-tts TTS

Service · http://magpie-tts.inference-club.svc.cluster.local:9000/v1

magpie-tts-multilingual playground

Text

studio-voice ENHANCE

Service · http://studio-voice.inference-club.svc.cluster.local:8080/v1

Maxine Studio Voice (speech enhancement) playground

Audio

a2

online

a2 · 192.168.5.96

1 service

VRAM24 GB VRAM · declared

GeForce RTX 4090NVIDIA

flux2-klein IMAGE

Service · http://flux2-klein.inference-club.svc.cluster.local:8000/v1

FLUX.2 Klein 4B playground

Text Image

a3

online

a3 · 192.168.5.173

2 services

VRAM24 GB VRAM · declared

GeForce RTX 4090NVIDIA

dia TTS

Service · http://dia.inference-club.svc.cluster.local:8491

Dia 1.6B (voice cloning) playground

Text Audio voice-cloning dialogue

firecrawl SCRAPE

Service · http://firecrawl.inference-club.svc.cluster.local:3002/v1

Firecrawl (URL → markdown) playground

Text

spark

online

spark-d2ce · 192.168.6.19

1 service

Memory122 GB unified · declared

GB10NVIDIA

spark-llm LLM

vLLM · http://spark-llm.inference-club.svc.cluster.local:8000/v1

Qwen3.6 27B (NVFP4) playground

32K ctx Text Image Reasoning Tools

Compute provided

1,009 requests served · 13,184,755 tokens

Inference used

1,015 requests · 13,192,665 tokens (12,909,964 in / 282,701 out)

Models @briancaffey is serving

Run them free in the playground or from your own code via the OpenAI-compatible API.

Open in Playground Get an API key Docs

POST /chat/completions

curl https://api.inference.club/v1/chat/completions \
  -H "Authorization: Bearer $INFERENCE_CLUB_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "dia-1.6b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from openai import OpenAI

client = OpenAI(
    base_url="https://api.inference.club/v1",
    api_key="YOUR_INFERENCE_CLUB_API_KEY",
)

resp = client.chat.completions.create(
    model="dia-1.6b",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(resp.choices[0].message.content)

import OpenAI from "openai"

const client = new OpenAI({
  baseURL: "https://api.inference.club/v1",
  apiKey: process.env.INFERENCE_CLUB_API_KEY,
})

const resp = await client.chat.completions.create({
  model: "dia-1.6b",
  messages: [{ role: "user", content: "Hello!" }],
})
console.log(resp.choices[0].message.content)