아직 한국어로 제공되지 않아 영어로 표시합니다.

POST /v1/completions

The legacy text-completion endpoint. Same shape as OpenAI's /v1/completions. Most clients use chat completions instead, but completion-style models (older base models, code-completion models) still use this surface.

Request

curl https://api.inference.club/v1/completions \
  -H "Authorization: Bearer $INFERENCE_CLUB_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "code-llama-7b",
    "prompt": "def fizzbuzz(n):",
    "max_tokens": 100
  }'

Routing rules and authentication are identical to chat completions. The body is forwarded to <provider-callback-url>/completions unchanged, and streaming works the same way ("stream": true).

If you don't have a clear reason to use this endpoint, use chat completions — modern instruct-tuned models are trained for that format.