VCs eye $12B opportunity as China’s AI chip upstarts crack training
Key Takeaways
- As China’s top AI labs begin training models on domestic silicon instead of Nvidia, the shift creates a huge opening for chip startups, model optimization platforms, and training-as-a-service ventures.
- The first multimodal model fully trained on Huawei Ascend chips signals a decoupling that could reshape the entire AI hardware supply chain.
Mentioned
Key Intelligence
Key Facts
- 1No top Chinese AI model is yet known to have been pre-trained on domestic chips; domestic chips are widely used only for inference.
- 2Zhipu AI’s GLM-Image, open-sourced in January 2026, was the first state-of-the-art multimodal model trained entirely on Chinese hardware (Huawei Ascend 910).
- 3Training image models requires far less compute than large language models (LLMs), and Zhipu has not yet achieved full LLM pre-training on domestic chips.
- 4Washington’s export controls and Beijing’s self-sufficiency push are forcing Chinese AI labs to migrate pre-training and post-training onto indigenous accelerators.
- 5Natixis economist Gary Ng said Chinese labs may develop slower with domestic suppliers, but in the long run they are building a rare complete AI supply chain.
- 6The article profiles five AI models using local silicon, signaling an accelerating trend but a remaining compute gap.
Zhipu AI
Company- Founded
- 2019
- Latest Valuation
- $2.5B (estimated)
Beijing-based AI lab backed by major Chinese VCs, known for pushing the frontier of domestic chip training. Open-sourced GLM-Image in Jan 2026.
Startup opportunity as domestic chip training scales
Analysis
For startups, the most compelling number isn't performance teraflops — it's the $12 billion that China's AI infrastructure market is projected to reach by 2027. With Washington choking off the latest Nvidia GPUs, Chinese AI labs are finally booting up domestic chips for pre-training, not just inference. The Zhipu GLM-Image milestone proves it's possible, and where there's a compute gap, there's a startup gold rush: custom silicon, optimization middleware, and cloud-based training clusters are all up for grabs.
What to Watch
The quest to replace Nvidia in China’s AI ecosystem is entering a new phase, as domestic labs begin shifting model training — not just inference — onto homegrown silicon. The South China Morning Post article examines five AI models trained on Chinese chips, revealing both significant progress and a stubborn compute gap. While Chinese AI models are increasingly competitive with US counterparts, none of the country’s top-tier large language models (LLMs) have been pre-trained entirely on local hardware. Instead, adoption has been concentrated in the less demanding inference stage. This disparity traces back to the three-tiered AI development pipeline: pre-training (the most computationally intensive phase requiring thousands of GPUs running for months), post-training (fine-tuning with less compute), and inference (deploying the model). Driven by Washington’s escalating export controls and Beijing’s push for technological self-sufficiency, Chinese AI labs are now experimenting with early-stage training on indigenous accelerators. The article highlights Zhipu AI’s GLM-Image, an image generation model open-sourced in January 2026, as the first state-of-the-art multimodal model trained entirely on domestic hardware — specifically Huawei’s Ascend Atlas 800T A2 servers powered by the Ascend 910 AI accelerator and the MindSpore deep learning framework. However, training image models demands significantly less compute than LLMs, and Zhipu has yet to achieve full LLM pre-training on domestic chips. Natixis economist Gary Ng notes that while dependence on indigenous suppliers may slow development efficiency, China is constructing an entire domestic AI supply chain — a rare global feat. The long-term strategic play is clear: if Chinese labs can crack high-end training on local silicon, it would fundamentally alter the global AI hardware market, which Nvidia currently dominates with an estimated 80%–90% share in the data center GPU segment. The implications are profound for investors and startups alike. For Nvidia, the growth story in China — which contributed 20–25% of its data center revenue before recent restrictions — faces structural headwinds. For the startup ecosystem, the shift opens a massive opportunity for chip design firms, AI optimization software, and model-training-as-a-service platforms tailored to domestic hardware ecosystems. The transition will be gradual; current domestic chips lack the performance and software maturity of Nvidia’s CUDA ecosystem, but the forced decoupling is accelerating investment. The article identifies five models that mark the early inroads: GLM-Image, and presumably others detailed in the full report. As of mid-2026, no Chinese LLM has been fully pre-trained on local silicon, but the technical trajectory suggests that milestone could be reached within 12–18 months. Once that happens, expect a wave of updates from cloud providers like Alibaba Cloud and Huawei Cloud offering domestic training clusters, and a re-rating of Chinese AI chip startups. For global markets, a successful decoupling would reduce China’s reliance on Nvidia, potentially softening its pricing power and reshaping geopolitical risk premiums.
How we covered this story
Every story in our startup coverage is assembled from multiple primary sources, cross-referenced for factual consistency, and scored along three independent dimensions: sentiment, operational impact, and source-cluster confidence. Single-source rumors and unverifiable claims do not pass our editorial gate. When a story shows "Verified by N sources" with N≥2, the development is independently corroborated; when N=1, we mark it explicitly so readers can weigh the signal accordingly.
Impact scoring uses a 1-10 scale weighted toward regulatory, financial, and operational consequence rather than coverage volume. A topic that runs in every outlet but moves no real decisions ranks lower than a niche regulatory filing that reshapes how operators in the startup space have to behave. Read our full methodology for the scoring rubric, our glossary for term definitions, and our trends index for the longitudinal view across the beat.
| Signal on this page | What it tells you |
|---|---|
| Verified by N sources | Independent corroboration count. N≥2 is our confidence floor; N=1 is marked explicitly. |
| Impact score (1-10) | Regulatory + financial + operational weight. 8+ signals an experienced-operator action item. |
| Sentiment | Five-tier classification trained on labeled startup-specific corpora. |
| Timeline | Where applicable, the related-events sequence that contextualizes today's development. |