Current State of Chinese LLMs: Market Leaders, Open Models, and Business Models

This is a summary of the current Chinese LLM scene based on research shared on r/LocalLLaMA. The analysis categorizes major players by their proprietary models, open-weight offerings, and business approaches.
Major Companies and Their Models
ByteDance: Their proprietary model dola-seed (also called Doubao) is described as the current market leader, playing a role similar to OpenAI. They also have an open-source Seed OSS 36B model, but the source notes it doesn't get much discussion.
Alibaba: Their proprietary model Qwen Max is reportedly not widely used. However, Alibaba is noted as strongest in open-weight offerings, especially small models, and leads in text-to-image (T2I) and text-to-video (T2V) capabilities.
Tencent: Their proprietary model Hunyuan is also not widely used. Their T2I and T2V efforts are considered second to Alibaba's.
Baidu: Their proprietary model Ernie is not widely used, with Baidu being stronger in autonomous driving.
Xiaomi: Their proprietary model is Mimo V2 Pro, and they have an open-weight model Mimo V2 Flash 309B-A15B.
DeepSeek: The Innovative Side Project
DeepSeek is described as a side project from an algorithmic trading firm. Current usage in China is reportedly a close second to ByteDance's Doubao, with about half the users. The source highlights DeepSeek as "the most innovative among all Chinese LLM companies," having invented techniques like MLA, MTP, DSA, and GRPO. The analysis suggests its business model might be similar to the 'Six AI Small Tigers,' but speculates the project might be more for attracting investments and gaining political access.
The Six AI Small Tigers
This group is characterized by highly similar business models: releasing big open-weight models to gain recognition while providing cheap inference services. The source questions their long-term viability.
- Zhipu: IPOed in Hong Kong. Their current
GLM-5model is described as a derivative of DeepSeek. - Minimax: IPOed in Hong Kong. They have a proprietary
MiniMax 2.7model and an open-weightMiniMax 2.5model, which is described as a "vanilla MoE 229B-A10B." This architecture reportedly gives them significantly lower inference costs than others. - Moonshot: Their
Kimiopen-weight model is described as a derivative of DeepSeek. - Stepfun: Their open-weight
Step 3.5 flashmodel uses a mixture of full attention and sliding window attention (SWA) layers at a 1:3 ratio. It's described as a 196B-A11B model with a business model similar to Minimax's, though their model is reportedly not as good. - Baichuan: Their
Baichuan-M3 235Bis described as a medical-enhanced open-weight model based onQwen3Moe. - 01 AI: Their last open-weight model was
Yi-34B, published in November 2024. They now reportedly focus on enterprise AI agent systems, making them "irrelevant to people here."
📖 Read the full source: r/LocalLLaMA
👀 See Also

GPU Power Consumption Deviates from Token Predictor Theory in Small LLMs
An experiment testing the 'stochastic parrot' theory on four 8B-parameter models found GPU power consumption often scales non-linearly with token count, with divergence rates ranging from 7.7% to 36.7%. The study also revealed persistent residual heat after philosophical queries and order-dependent effects.

Pentagon Sets Friday Deadline for Anthropic to Drop AI Ethics Rules
The Pentagon has given Anthropic until Friday to abandon its AI ethics rules, according to a Politico report. The article received 15 points and 3 comments on Hacker News.

Claude Skills vs. MCP: A Developer's Practical Boundary Question
A developer questions where MCP's value becomes decisive versus Claude Skills after the Skills release made tool integration reasoning harder, noting that well-structured instructions can often suffice without protocol boundaries.

Claude Connection Failures for Organizations Blocking GitHub by IP Address
An automatic status update reports connection failures for organizations that restrict GitHub access by IP address, with ongoing incident tracking via status.claude.com.