


Alibaba Cloud's AI Coding Plan gives developers access to Qwen, Kimi, GLM, and MiniMax from $10/month. Complete guide with setup, pricing, and comparison.
Head of Growth & Customer Success
AI-assisted coding has become essential to modern software development. But the price tag remains a serious barrier for most developers. Claude Code runs between $100 and $200 per month for heavy use. GitHub Copilot Pro+ costs $39 per month. API-based billing can spike without warning when agents loop unexpectedly. For independent developers, small teams, and startups in markets outside the US and Western Europe, these costs are prohibitive.
Alibaba Cloud's AI Coding Plan attacks this problem directly. Launched through its Model Studio platform, the plan offers flat-rate access to multiple frontier AI models starting at $10 per month, with a promotional first month at just $3. The approach is fundamentally different from per-token billing: you pay a fixed price, get a defined number of API requests, and never face a surprise bill at the end of the month.
This is not a stripped-down free tier with aggressive limitations. It is a multi-model subscription that works with the tools developers already use, including Claude Code, Cursor, Cline, OpenCode, and Codex.
The AI Coding Plan is a monthly subscription that provides a fixed number of API requests across multiple state-of-the-art language models. It comes in two tiers.
$3 for the first month, $5 for the second
18,000 requests per month
Capped at 1,200 requests per 5-hour window and 9,000 per week
Access to all included models
$15 for the first month, $25 for the second
90,000 requests per month
Capped at 6,000 requests per 5-hour window and 45,000 per week
Access to all included models
The critical differentiator is multi-model access from a single API key. You can switch between models without managing separate accounts, API keys, or billing systems.
The plan bundles models from four different Chinese AI labs, each with distinct strengths.
Model | Developer | Context Window | Max Output | Key Strength |
|---|---|---|---|---|
Qwen3.5-Plus (vision) | Alibaba | 1,000,000 tokens | 65,536 tokens | Most versatile, supports image input |
Kimi K2.5 (vision) | Moonshot AI | 262,144 tokens | N/A | Strong long-context handling |
GLM-5 | Zhipu AI | 202,752 tokens | N/A | Deep thinking, reasoning |
MiniMax M2.5 | MiniMax | 196,608 tokens | N/A | Balanced reasoning |
Qwen3-Coder-Next | Alibaba | 262,144-1M tokens | N/A | Specialized code generation |
Qwen3-Coder-Plus | Alibaba | 262,144-1M tokens | N/A | Specialized code generation |
GLM-4.7 | Zhipu AI | 202,752 tokens | N/A | High demand; access restricted due to popularity |
Qwen3.5-Plus is the default workhorse for most coding tasks. It handles everything from function generation to debugging to multi-file refactoring. The vision capability means it can also process screenshots, diagrams, and UI mockups.
GLM-4.7 deserves special attention. Zhipu AI had to cap subscriptions due to overwhelming demand, primarily from developers in the US and China who chose it over tools like GPT and Claude for routine coding tasks.
One important detail that affects real-world value: requests count the model's internal invocations, not just your messages. A simple task like generating a function might produce 5 to 10 internal invocations. A complex task involving multi-file analysis could generate 10 to 30.
On the Lite plan, 18,000 monthly requests translate to roughly 600 to 3,600 completed tasks, depending on complexity. For the Pro plan, that range expands to 3,000 to 18,000 tasks per month.
Understanding this distinction is essential for choosing the right tier and managing usage throughout the month.
The most popular configuration is plugging the Alibaba plan into Claude Code. This gives you the full Claude Code interface and workflow while routing requests through Alibaba's models instead of Anthropic's infrastructure.
Go to the Coding Plan page in Alibaba Cloud Model Studio. Choose Lite or Pro. Complete payment.
After subscribing, you will receive an exclusive API key in the format sk-sp-xxxxx. This is different from the standard Model Studio API key (sk-xxxxx). Do not mix them up, or your requests will be billed outside the subscription.
On macOS or Linux, edit ~/.claude/settings.json:
{
"env": {
"ANTHROPIC_BASE_URL": "https://coding-intl.dashscope.aliyuncs.com/apps/anthropic",
"ANTHROPIC_AUTH_TOKEN": "sk-sp-YOUR_API_KEY",
"ANTHROPIC_MODEL": "qwen3.5-plus"
}
}Then edit or create ~/.claude.json and set "hasCompletedOnboarding": true.
Open a new terminal, launch Claude Code, and type /status to confirm the configuration. Use /model to switch between models on the fly.
Install OpenCode via npm install -g opencode-ai. Create the configuration file at ~/.config/opencode/opencode.json with the bailian-coding-plan provider, the base URL https://coding-intl.dashscope.aliyuncs.com/apps/anthropic/v1, and your API key. Launch OpenCode and use /models to select your model.
The plan also works with Cursor, Cline, Codex, the Claude Code VS Code plugin, and JetBrains IDEs.
The pricing comparison tells a striking story, but the value equation depends on what you prioritize.
Feature | Alibaba Lite | Alibaba Pro | GitHub Copilot Pro+ | Claude Code Max (20x) |
|---|---|---|---|---|
Monthly price | $10 | $50 | $39 | $200 |
Monthly requests/premium actions | 18,000 | 90,000 | 1,500 premium | Unlimited |
Model access | 4+ model families | 4+ model families | GPT, Claude, Gemini | Claude Sonnet/Opus |
Best for | Budget-conscious devs | Daily agent users | IDE autocomplete | Complex architecture |
Code quality ceiling | Good (85-90% of US models) | Good (85-90% of US models) | Excellent | Best-in-class |
The numbers make it clear: for raw request volume per dollar, the Alibaba plan is unmatched. A five-person team on Alibaba Pro costs $250 per month total, compared to $1,000 per month for Claude Code Max. But volume is not quality. According to a RAND report, Chinese models operate at roughly one-sixth to one-quarter the cost of comparable US systems, with a corresponding quality trade-off on complex reasoning tasks.
For routine development tasks (function generation, test writing, refactoring, boilerplate scaffolding), Qwen3.5-Plus and GLM-5 perform well. Developers using the plan report that quality is "generally good" for these tasks.
The gap appears on complex tasks: multi-step reasoning across large codebases, nuanced software architecture decisions, and sophisticated debugging that requires deep understanding of code interdependencies. On these tasks, Claude Sonnet 4.5 and Opus maintain a meaningful edge.
A practical test showed Qwen3.5-Plus generating a complete Astro.js website in about 2 minutes. GLM-5 took approximately 15 minutes for a similar task. Model selection within the plan matters significantly.
Alibaba Cloud's international servers are in Singapore and Virginia. European developers will experience some additional latency compared to native Claude Code or Copilot. From the Americas, speed is "generally decent" based on user reports, faster than GLM via other providers but slightly slower than Anthropic or OpenAI directly.
The smartest approach for most developers is not an either/or decision. Use the Alibaba plan as your primary tool for routine coding: scaffolding projects, writing tests, generating boilerplate, and rapid iteration. Keep a Claude Code subscription (or even the free tier) for the critical moments when you need the absolute best reasoning quality for complex architectural decisions.
This hybrid model delivers the best cost-to-output ratio. You get volume where volume matters and quality where quality matters, at a fraction of the cost of running Claude Code Max full time.
Hobbyists and learners exploring AI-assisted coding for the first time
Indie developers building side projects who want to experiment with multiple models
Budget-conscious developers who outgrew free tiers but cannot justify $200/month
Active developers using coding agents daily who need headroom for 90,000 requests
Small teams and startups wanting generous access without enterprise pricing
Developers who want to benchmark which model family performs best for their stack
You work on complex software architecture where reasoning quality is non-negotiable
Minimal latency is critical to your workflow
Enterprise compliance (SSO, audit logs, IP indemnity) is required
You need Claude Opus for deep multi-step reasoning
Alibaba's plan exists within a broader market dynamic. Since DeepSeek demonstrated that a high-performing model could be trained for $5.6 million versus the typical $100 million in the US, Chinese companies have systematically undercut AI pricing. Alibaba's Qwen model surpassed 700 million downloads on Hugging Face by early 2026, making it the world's most widely used open-source AI system.
The strategy is clear: deliver models at 85-90% of the performance of American models at 10% of the price, and bet on open source and mass adoption. Whether you end up using Alibaba's plan or not, its existence is pushing every AI coding provider to offer better value.
This democratization of AI coding tools mirrors what is happening across the entire AI productivity stack. Just as Alibaba makes AI-assisted coding accessible to developers on tight budgets, tools like Maylee are bringing AI-powered productivity to email, where features like auto-drafting replies and intelligent inbox classification were previously only available through expensive enterprise solutions. The trend is consistent: powerful AI capabilities are becoming accessible at a fraction of what they cost even a year ago.
Before subscribing, understand three non-negotiable constraints:
Interactive use only. The plan is restricted to coding tools (IDEs, coding agents). Automated scripts, application backends, and batch API calls are prohibited. Alibaba reserves the right to suspend accounts for misuse.
Non-refundable. The subscription cannot be cancelled or refunded once purchased.
Structured rate limits. Caps are not just monthly. The Lite plan limits you to 1,200 requests per 5-hour window. During an intensive coding session, you can hit this limit and be forced to wait.
Alibaba's AI Coding Plan is the best value-for-volume deal in AI-assisted coding today. It is not the best quality-for-money deal if your work demands the absolute frontier of AI reasoning. The distinction matters.
For the majority of developers whose daily work consists of routine coding tasks, the plan offers genuine, substantial savings. For developers pushing the boundaries of what AI can do with complex software systems, it is a powerful complement to premium tools, not a replacement.
The AI coding market is in full flux. Prices are falling. Models are improving rapidly. Competition between the US and China is directly benefiting developers worldwide. A year ago, accessing four frontier coding models for $10 per month would have been unthinkable. Today it is the new normal.
The plan includes Qwen3.5-Plus (vision), Kimi K2.5 (vision), GLM-5, MiniMax M2.5, Qwen3-Coder-Next, Qwen3-Coder-Plus, and GLM-4.7, all accessible with a single API key.
Yes. You configure Claude Code to route requests through Alibaba's API by setting environment variables in your Claude Code settings file. You keep the full Claude Code interface while using Alibaba's models.
The Lite plan at $10/month offers 18,000 requests versus Copilot Pro's 300 premium requests. However, Copilot integrates natively with IDE autocomplete, while the Alibaba plan works through terminal-based coding agents. They serve different workflows.
For routine tasks like function generation, test writing, and refactoring, quality is generally good. For complex software architecture, multi-step reasoning, and deep debugging, Claude Sonnet 4.5 and Opus remain superior.
Requests count the model's internal invocations, not your messages. A simple task generates 5-10 invocations, a complex one 10-30. On the Lite plan, this translates to roughly 600-3,600 completed tasks per month.
The subscription is non-refundable and cannot be cancelled once purchased. You can choose not to renew at the end of the billing cycle.
There is no free trial, but the first month is heavily discounted: $3 for Lite and $15 for Pro. This serves as a low-risk way to evaluate the plan.
Key limitations include interactive-use-only restrictions (no batch API calls), structured rate limits (1,200 requests per 5-hour window on Lite), some additional latency for non-Asian users, and a quality gap on complex reasoning tasks compared to frontier US models.