Skip to content

Service Overview

What Nexus Is

Nexus accepts requests from your client, validates the key, sends the request to a compatible model, and returns the response in a familiar format.

For users, this means:

  • no need to set up separate foreign access;
  • you can keep using familiar CLI tools;
  • the key, balance, and model list are managed in one place;
  • support and documentation are available in Russian.

How Access Works

Each Nexus API key is configured separately. Every key has:

  • its own balance in rubles;
  • its own list of allowed endpoints;
  • its own list of allowed models.

That means two keys can see different models. Always check the current list with GET /v1/models.

What Is Supported

Nexus supports:

  • Claude Code via POST /v1/messages;
  • token counting for Claude-compatible clients via POST /v1/messages/count_tokens;
  • Codex via POST /v1/responses;
  • compact response workflows via POST /v1/responses/compact;
  • OpenAI-compatible clients via POST /v1/chat/completions.

The detailed endpoint list is on the Supported Endpoints page.

Streaming

Streaming is supported on:

  • POST /v1/messages;
  • POST /v1/responses;
  • POST /v1/responses/compact.

For POST /v1/chat/completions, use the regular JSON response mode.

How Pricing Works

Pricing is based on actual model usage: input tokens, output tokens, and supported cache types.

Before the request starts, Nexus temporarily reserves an amount equal to the maximum possible request cost. After the request completes, the reserve is released and the actual cost in rubles is charged. If the cost cannot be calculated reliably, the request is sent for manual review.

More details: Pricing and Balance