Qwen Pricing 2026 thumbnail featuring bold white and yellow headline text with “Free App, API Costs,” alongside a glowing purple-and-white 3D Qwen logo on a dark blue and purple neon background.

Qwen Pricing 2026: Free App, API Costs, and What You Actually Pay

Qwen pricing splits into two very different paths, and most people only need the free one. The Qwen Chat app is completely free with no subscription, while developers pay per token through the API, where prices run from $0.05 per million input tokens for the cheapest model up to $3.75 per million output tokens for the flagship. That spread, roughly 75x between the cheapest and most expensive text models, is the single most important thing to understand before you choose a plan.

This guide breaks down every part of Qwen pricing for 2026, including the free Qwen Chat app, the pay-as-you-go API rates for each model, the free token trial for new developers, and how Qwen stacks up against DeepSeek, ChatGPT, and Claude on cost. We also cover the change that caught many developers off guard, the end of the free API tier on April 15, 2026, and where Qwen still gives you the best value.

The Key Takeaways

  • Qwen Chat is free, with no consumer subscription like ChatGPT Plus or Claude Pro.
  • API pricing runs from $0.05 / $0.20 per million tokens (Qwen-Turbo) to $1.25 / $3.75 (Qwen3.7-Max on a 50% promo; list price $2.50 / $7.50).
  • The mid-tier Qwen-Plus costs $0.40 / $1.20 per million tokens, the best balance of price and capability for most apps.
  • The free developer API tier ended April 15, 2026, replaced by a one-time 70M-token trial for new accounts.
  • A batch API and prompt caching can cut real costs by roughly 50% to 90% on the right workloads.

Qwen Pricing at a Glance

Qwen is Alibaba’s family of AI models, and you can use it in two ways. The consumer Qwen Chat app at chat.qwen.ai is free, and the developer API is billed per token through Alibaba Cloud Model Studio (the platform formerly branded DashScope). There is no flat monthly consumer plan in between, which makes Qwen unusual next to Western rivals.

Here is what the main text models cost on the API, priced per million tokens. Input is what you pay for the text you send in, and output is what you pay for the text the model generates back.

ModelInput ($/1M)Output ($/1M)Context windowBest for
Qwen-Turbo$0.05$0.201M tokensHigh-volume, simple tasks
Qwen-Plus$0.40$1.201M tokensThe everyday workhorse
Qwen3.7-Plusfrom $0.32from $1.281M tokensMultimodal apps
Qwen3-Max$1.20$6.001M tokensHard reasoning, coding
Qwen3.7-Max$1.25$3.751M tokensFlagship, top benchmarks

The flagship Qwen3.7-Max rate of $1.25 / $3.75 is a 50% promotional discount; the standard list price is $2.50 / $7.50 per million tokens. That promo is the reason the flagship currently undercuts its own predecessor, Qwen3-Max, on output cost.

One thing to know about reading these numbers, Model Studio tiers some rates by the input size of each request, so the figures above are the standard headline rates and your effective cost can land lower on short prompts. Always confirm the live rate for your model and region in the Alibaba Cloud Model Studio pricing docs before you build a budget.

Is Qwen Free?

Yes. The Qwen Chat app at chat.qwen.ai is free to use with no subscription, much like the free tier of ChatGPT or Gemini. You get access to Qwen’s strong models for everyday chat, writing, coding help, and image tasks without paying anything, and Alibaba has clear commercial reasons to keep the consumer app free.

Paid pricing only kicks in when you build on the Qwen API through Alibaba Cloud Model Studio, which charges per token. So if you just want to talk to Qwen, you pay nothing. If you want to plug Qwen into your own product, you pay by usage.

Qwen API Pricing by Model

Below we broke down API pricing by each of Qwen models for you to better understand the costs.

Qwen-Turbo

Qwen-Turbo is the cheapest tier at $0.05 input / $0.20 output per million tokens. It is built for speed and high volume, so it suits classification, simple Q&A, tagging, and any task where you send a lot of requests and do not need flagship-level reasoning. At these rates you can process millions of tokens for a few dollars.

Qwen-Plus

Qwen-Plus sits in the middle at $0.40 input / $1.20 output per million tokens and is the model most apps should start with. It handles the bulk of real work, summarizing, drafting, coding assistance, and retrieval tasks, at a price that stays reasonable at scale. Thinking mode, where the model reasons step by step before answering, raises the output rate but improves accuracy on harder problems.

Qwen3-Max and Qwen3.7-Max

The flagship tier is where you pay for top performance. Qwen3-Max lists at $1.20 input / $6.00 output, while the newer Qwen3.7-Max runs $1.25 / $3.75 on its current promotion (list $2.50 / $7.50). Qwen3.7-Max is the one to reach for on the hardest reasoning, math, and software-engineering tasks; it scored 56.6 on the Artificial Analysis Intelligence Index and 80.4 on SWE-Verified, landing it in the global top 10 at launch. For the full benchmark breakdown, see our Qwen3.7-Max review.

The Free Tier Change Developers Need to Know

Qwen used to offer a standing free API tier, and that ended on April 15, 2026. The free developer OAuth tier and the free Qwen Code CLI access were both removed on the same date, so older tutorials that promise “free Qwen API access” are now out of date.

What replaced it is a one-time onboarding trial. New Alibaba Cloud Model Studio accounts get roughly 70 million free tokens, structured as 1 million tokens per model across most proprietary Qwen models, valid for 90 days on the Singapore endpoint. That is generous for prototyping, but it is a trial, not a permanent free plan, so plan your budget for when it runs out.

How to Cut Your Qwen Bill

Two built-in features lower real costs well below the headline rates. The asynchronous batch API processes non-urgent jobs at roughly 50% off standard rates, which is ideal for bulk summarization, data labeling, or overnight processing where you do not need an instant reply.

Prompt caching is the second lever. When you reuse the same long prompt or system context across many requests, cached input reads cost only a small fraction of the normal input rate, in some cases around 10% of it. For chatbots and agents that repeat a large system prompt on every call, this is the difference between a workable bill and an expensive one.

Qwen vs DeepSeek, ChatGPT, and Claude on Cost

Qwen’s appeal is price-to-performance, and it competes directly with other low-cost models. Against DeepSeek, another Chinese open-weight family known for aggressive pricing, Qwen-Plus and Qwen-Turbo are in the same budget bracket, so the choice comes down to which model performs better on your specific tasks. Both sit far below US flagship pricing.

Compared with ChatGPT and Claude, Qwen is dramatically cheaper at the API level. Premium US flagship models often cost several dollars per million input tokens and far more on output, while Qwen-Plus delivers strong results for cents. If raw cost per token is your deciding factor, Qwen and DeepSeek win; if you need the absolute top of the benchmark charts or specific ecosystem features, the picture is closer. You can see where the US labs stand in our breakdown of how Claude and GPT compare, and check current promos in our guide to AI free trials. If you want Qwen alongside other models in one place, aggregators like OpenRouter also resell Qwen on pay-as-you-go terms.

A Simpler Alternative to Per-Token Billing

Per-token API pricing is powerful, but it is overkill if you just want to use top AI models day to day. Tracking input and output rates across models, watching your free trial expire, and managing separate bills for each provider adds friction that most people do not want.

This is where Fello AI fits. For one flat $9.99 per month, you get access to many leading models in a single Mac app, including Claude, ChatGPT, Gemini, Grok, and DeepSeek, with no per-token math and no juggling multiple subscriptions. Instead of metering every request, you pick the best model for the job and switch freely. If you want flagship-grade AI without API billing, that one-price, many-models setup is far simpler; you can compare the lineup in our roundup of the best AI models, and see other budget options in our MiniMax pricing guide.

Conclusion

For most people, Qwen is effectively free, the Qwen Chat app costs nothing and covers everyday use. For developers, Qwen is one of the best value APIs available, from $0.05 per million tokens on Qwen-Turbo up to a flagship that still undercuts US rivals. Just remember the free API tier is gone as of April 2026, so build your budget around pay-as-you-go rates and lean on batch processing and prompt caching to keep costs down. If you would rather skip per-token billing entirely, a flat-rate multi-model app like Fello AI gives you Qwen-class value alongside Claude, ChatGPT, and Gemini for one monthly price.

FAQ

How much does Qwen cost?

The Qwen Chat app is free. On the API, prices range from $0.05 input / $0.20 output per million tokens for Qwen-Turbo up to $1.25 / $3.75 for the flagship Qwen3.7-Max on its current promo (list $2.50 / $7.50). The popular mid-tier Qwen-Plus is $0.40 / $1.20.

Is Qwen free to use?

Yes, the consumer Qwen Chat app at chat.qwen.ai is free with no subscription. You only pay if you use the Qwen API through Alibaba Cloud Model Studio, which bills per token.

Does Qwen have a subscription plan?

No, there is no flat-rate consumer subscription like ChatGPT Plus or Claude Pro. You either use the free app or pay per token on the API. Alibaba does offer fixed monthly Model Studio plans aimed at high-volume developer workloads.

Is there still a free Qwen API tier?

Not as a standing plan. The free developer API tier ended on April 15, 2026. New accounts instead get a one-time trial of about 70 million tokens (1 million per model) valid for 90 days on the Singapore endpoint.

Is Qwen cheaper than ChatGPT?

At the API level, yes, significantly. Qwen-Plus and Qwen-Turbo cost a fraction of what premium US flagship models charge per token, which is why Qwen and DeepSeek are popular choices for cost-sensitive projects.

Share Now!

Facebook
X
LinkedIn
Threads
Email

Ricevi suggerimenti esclusivi sull'intelligenza artificiale nella tua casella di posta!

Rimanete al passo con le intuizioni degli esperti di IA, fidati dei migliori professionisti del settore tecnologico!