Alaya NeW Cloud

Billing & plans

Subscription tiers, quota math, rate limits

AlayaCode is subscription-based — fixed monthly quota, no per-call surprise charges. This page covers four things: picking a tier, how tokens are counted, when rate limits trigger, and what happens when you run out.

Tiers

TierMonthlyToken quota / moRPMTPMModelsFor
Free¥01 M6030 Kminimax-m2.1, glm-5 (throttled)Trial, personal hacks
Pro¥X.XX30 M300150 K+ minimax-m2.5, glm-5.1Daily-driver for individual devs
Team¥X.XX200 M600500 K+ deepseek-v4-pro (1M ctx)5-20 person teams
EnterpriseContact salesCustomCustomCustomAll + private custom modelsLarge teams / high-compliance

Live prices are in the console; promotional credits, quarterly commitment rebates, etc. follow the marketing calendar.

¥X.XX is a placeholder. Pricing is set by ops, not docs — we don't hard-code numbers here. The console (AlayaCode → Plan) shows the live rate.

How tokens are counted

Input and output tokens cost the same, scaled by the model's pricing factor:

quota_used = (input_tokens + output_tokens) × model_factor

Sample factors:

Modelfactor
minimax-m2.51.0
minimax-m2.10.6
glm-51.0
glm-5.11.2
deepseek-v4-pro2.0 (long-context model)

Example: on Pro (30 M tokens / mo), a glm-5.1 call with 500 input + 800 output tokens costs (500 + 800) × 1.2 = 1560 quota tokens.

Rate limit triggers

Two dimensions, whichever hits first:

  • RPM (requests per minute), 60 s rolling window
  • TPM (tokens per minute), 60 s rolling window, summing input + output

A trigger returns 429 rate_limit_exceeded with a Retry-After header on the next request. Throttling does not consume quota.

Quota exhausted ≠ service down

When your monthly token quota is spent, we don't error by default — we drop you into "throttled mode":

ModeBehavior
NormalRPM / TPM at tier ceiling
ThrottledRPM cut to 60, TPM cut to 30 K (same as Free tier)

Throttled-mode requests come from a separate "throttle pool" (refills 100 K / day, same across tiers). If even that drains, you get 429 quota_exhausted.

Want to avoid throttling? Two options:

  1. Buy a one-shot top-up in the console — quoted in tokens, billed at the tier's factor
  2. Upgrade tier — proration applies for the rest of the month

Usage monitoring

AlayaCode → Usage in the console shows:

  • Spent / total quota this month
  • Consumption split by model
  • Daily call volume curve
  • Headroom before throttling
  • Hourly rate-limit-trigger count

Email alerts can fire at 80% / 95% quota.

Invoices & enterprise payment

Subscription billing draws from the main account balance; top-up methods and invoice requests live on the billing reference.

Cancellation

AlayaCode → PlanDowngrade / Cancel. The current cycle's fee is not refunded — change takes effect at the next billing cycle.

Last updated on

Was this page helpful?

On this page