Service Overview¶
What Nexus Is¶
Nexus accepts requests from your client, validates the key, sends the request to a compatible model, and returns the response in a familiar format.
For users, this means:
- no need to set up separate foreign access;
- you can keep using familiar CLI tools;
- the key, balance, and model list are managed in one place;
- support and documentation are available in Russian.
How Access Works¶
Each Nexus API key is configured separately. Every key has:
- its own balance in rubles;
- its own list of allowed endpoints;
- its own list of allowed models.
That means two keys can see different models. Always check the current list with GET /v1/models.
What Is Supported¶
Nexus supports:
- Claude Code via
POST /v1/messages; - token counting for Claude-compatible clients via
POST /v1/messages/count_tokens; - Codex via
POST /v1/responses; - compact response workflows via
POST /v1/responses/compact; - OpenAI-compatible clients via
POST /v1/chat/completions.
The detailed endpoint list is on the Supported Endpoints page.
Streaming¶
Streaming is supported on:
POST /v1/messages;POST /v1/responses;POST /v1/responses/compact.
For POST /v1/chat/completions, use the regular JSON response mode.
How Pricing Works¶
Pricing is based on actual model usage: input tokens, output tokens, and supported cache types.
Before the request starts, Nexus temporarily reserves an amount equal to the maximum possible request cost. After the request completes, the reserve is released and the actual cost in rubles is charged. If the cost cannot be calculated reliably, the request is sent for manual review.
More details: Pricing and Balance