Skip to content

Unlimited Subscriptions

Unlimited subscriptions give you a dedicated model with guaranteed throughput — no budget caps, no token counting. You pick which time blocks you need and subscribe instantly. Your API key gets guaranteed RPM and tok/s during your selected hours.

All endpoints require authentication. Use your management key (mk_):

Authorization: Bearer mk_your_key

GET /api/pools

Returns all active Unlimited plans.

Terminal window
curl https://api.cheapestinference.com/api/pools \
-H "Authorization: Bearer mk_your_key"
{
"success": true,
"data": [
{
"id": "pool_uuid",
"slug": "minimax-m25",
"modelId": "MiniMax-M2.5",
"modelName": "MiniMax M2.5",
"description": "Dedicated MiniMax M2.5 with unlimited usage.",
"status": "active",
"totalSlots": 10,
"minPricePerDay": "39.00",
"manualProvisioning": false,
"annualDiscount": 0.15
}
]
}
FieldDescription
statusdraft (not listed), funding (accepting pledges), activating (spinning up), active (open for subscriptions), or paused
minPricePerDayLowest monthly price across all time blocks (USD, per seat)
manualProvisioningIf true, key is delivered manually (usually within hours, always within 24h)
annualDiscountFraction off when billed yearly (0.15 = 15% off). Applied when billingCycle: "year" is passed to subscribe.

GET /api/pools/:id

Returns full plan info including all hour slots. :id can be the UUID or slug.

Terminal window
curl https://api.cheapestinference.com/api/pools/minimax-m25 \
-H "Authorization: Bearer mk_your_key"
{
"success": true,
"data": {
"id": "pool_uuid",
"slug": "minimax-m25",
"modelId": "MiniMax-M2.5",
"status": "active",
"hourSlots": [
{ "id": "slot_uuid", "hour": 0, "slotIndex": 0, "pricePerDay": "4.88", "status": "available" },
{ "id": "slot_uuid", "hour": 1, "slotIndex": 0, "pricePerDay": "4.88", "status": "available" }
]
}
}

Each hourSlot represents one hour of one seat. Prices are monthly amounts distributed across the 8 hours in a block.


POST /api/pools/:id/subscribe

Subscribe using block names. Your card is charged immediately and you receive your API key.

Each day is divided into three fixed UTC blocks:

BlockHours (UTC)Region
asia00:00–07:59Asia-Pacific
europe08:00–15:59Europe / Middle East
americas16:00–23:59Americas
ParameterTypeRequiredDescription
blocksstring[]YesOne or more: "asia", "europe", "americas"
quantityintegerNoSeats per block (1–20, default 1)
billingCyclestringNo"month" (default) or "year" — annual charges upfront with the pool’s annualDiscount applied
Terminal window
curl -X POST https://api.cheapestinference.com/api/pools/minimax-m25/subscribe \
-H "Authorization: Bearer mk_your_key" \
-H "Content-Type: application/json" \
-d '{"blocks": ["americas"]}'

Example — all blocks, 2 seats, billed annually

Section titled “Example — all blocks, 2 seats, billed annually”
Terminal window
curl -X POST https://api.cheapestinference.com/api/pools/minimax-m25/subscribe \
-H "Authorization: Bearer mk_your_key" \
-H "Content-Type: application/json" \
-d '{"blocks": ["asia", "europe", "americas"], "quantity": 2, "billingCycle": "year"}'

Annual amount = monthlyPrice × 12 × (1 − pool.annualDiscount). The discount fraction is returned on the pool detail response (see below). Charges occur upfront; subscription renews yearly until canceled.

{
"success": true,
"data": {
"id": "subscription_uuid",
"poolId": "pool_uuid",
"status": "active",
"monthlyPrice": "39.00",
"hours": [
{ "hour": 0, "slotIndex": 0, "pricePerDay": "4.88" },
{ "hour": 1, "slotIndex": 0, "pricePerDay": "4.88" }
],
"key": null
}
}

Note: No API key is created automatically. Use POST /api/keys/subscription to create your key when ready, optionally passing {"subscriptionId": "subscription_uuid"} to target a specific subscription.

Unlimited throughput — no RPM or TPM caps. The only limit is 1 concurrent request per key. Your key is live 24/7 during your subscribed blocks.

If the plan has manual provisioning enabled, status will be pending_key and key will be null. You’ll receive an email when your key is ready.

monthlyPrice = sum of prices for all selected slots. Each block has a fixed monthly price. With quantity: 2 and two blocks, you pay 2 × (block_a + block_b).


GET /api/pools/:id/my-subscriptions

Returns all your active subscriptions for this pool (you can have multiple if you purchased additional blocks).

Terminal window
curl https://api.cheapestinference.com/api/pools/minimax-m25/my-subscriptions \
-H "Authorization: Bearer mk_your_key"
{
"success": true,
"data": [
{
"id": "subscription_uuid",
"status": "active",
"monthlyPrice": "39.00",
"billingCycle": "month",
"cancelAtPeriodEnd": false,
"currentPeriodEnd": "2026-05-14T00:00:00.000Z",
"hours": [
{ "hour": 16, "slotIndex": 0, "pricePerDay": "4.88" }
],
"key": {
"id": "key_uuid",
"apiKey": "sk_pool_abc123...",
"isActive": true
}
}
]
}
FieldMeaning
statusactive, pending_key, past_due or canceled
billingCycle"month" or "year" — chosen at subscribe time
cancelAtPeriodEndtrue when the user has scheduled cancellation; access remains until currentPeriodEnd

status semantics:

statusMeaning
activeSubscription running, key is live
pending_keyPaid, key being set up (manual provisioning)
past_duePayment failed, key suspended
canceledSubscription ended

GET /api/pools/:id/my-keys

Returns all API keys from all your subscriptions on this pool.

Terminal window
curl https://api.cheapestinference.com/api/pools/minimax-m25/my-keys \
-H "Authorization: Bearer mk_your_key"

Use any apiKey as Authorization: Bearer on any /v1/* or /anthropic/* inference endpoint.


DELETE /api/pools/:id/pledge

Cancels your most recent subscription. Schedules cancellation at the end of the current billing period. Your key keeps working until currentPeriodEnd; after that Stripe stops renewing, the key is revoked, and the slots are freed. No refund is issued for unused time.

Terminal window
curl -X DELETE https://api.cheapestinference.com/api/pools/minimax-m25/pledge \
-H "Authorization: Bearer mk_your_key"
{
"success": true,
"data": { "canceled": true }
}

  1. Browse — pick a model and select your time blocks (Asia, Europe, Americas).
  2. Subscribe — your card is charged immediately. No API key is created automatically — you create it yourself.
  3. Create key — call POST /api/keys/subscription to generate your API key. You can have multiple keys if you have multiple subscriptions.
  4. Use — your key gets unlimited throughput during your selected blocks. No budget limits. The only limit is 1 concurrent request per key.
  5. Renew — subscriptions recur monthly. Cancel anytime — access continues to the end of the paid period.