Quick Start
1. Create an account
Section titled “1. Create an account”Sign up at cheapestinference.com/register. You can use email/password, Google, or GitHub.
2. Get access
Section titled “2. Get access”You have two options:
- Subscribe to a pool — Go to Pools, reserve one or more daily 8-hour time blocks (from $39/mo, up to full 24/7), and get unlimited usage of Kimi K2.6, GLM 4.7, and MiniMax M2.5. Pay with card or USDC.
- Credits — Top up starting at $10 for pay-as-you-go access. No subscription required. Learn more
3. Create an API key
Section titled “3. Create an API key”Go to API Keys and click Create Key. Each pool subscription (or your credit balance) can mint keys — give the key a name and you’re ready to call the API.
4. Make a request
Section titled “4. Make a request”curl https://api.cheapestinference.com/v1/chat/completions \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "kimi-k2.6", "messages": [{"role": "user", "content": "Hello!"}] }'Python (OpenAI SDK)
Section titled “Python (OpenAI SDK)”from openai import OpenAI
client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.cheapestinference.com/v1")
response = client.chat.completions.create( model="kimi-k2.6", messages=[{"role": "user", "content": "Hello!"}])print(response.choices[0].message.content)Node.js (OpenAI SDK)
Section titled “Node.js (OpenAI SDK)”import OpenAI from "openai";
const client = new OpenAI({ apiKey: "YOUR_API_KEY", baseURL: "https://api.cheapestinference.com/v1",});
const response = await client.chat.completions.create({ model: "kimi-k2.6", messages: [{ role: "user", content: "Hello!" }],});console.log(response.choices[0].message.content);Python (Anthropic SDK)
Section titled “Python (Anthropic SDK)”from anthropic import Anthropic
client = Anthropic( api_key="YOUR_API_KEY", base_url="https://api.cheapestinference.com/anthropic")
message = client.messages.create( model="kimi-k2.6", max_tokens=1024, messages=[{"role": "user", "content": "Hello!"}])print(message.content[0].text)What’s next
Section titled “What’s next”- Authentication — API key management and error codes
- Models — See all available models
- Chat Completions — Full endpoint reference
- Streaming — Real-time responses with SSE