Market analysis
Analysis
Positioning
Competitors
- NVIDIA CorporationIncumbent merchant data-center GPU leader (~80-92% revenue share)
Defines the category through CUDA + NVLink + first-priority CoWoS allocation; FY2025 Q4 revenue $39.3B; data-center Q3 FY2026 $51.2B
- Advanced Micro Devices, Inc.Principal merchant-GPU challenger
Instinct MI300X / MI325X / MI355X; >$5B Instinct accelerator revenue FY2024; ROCm software stack
- Intel CorporationThird merchant accelerator vendor
Gaudi line; EMIB/Foveros packaging as alternative to TSMC CoWoS bottleneck
- Alphabet Inc. (Google TPU)Hyperscaler custom-ASIC competitor (and customer of merchant GPUs)
Longest-running custom AI ASIC program — TPU v1 2015 → v7 Ironwood 2025; co-designed with Broadcom
- Amazon (AWS Trainium/Inferentia)Hyperscaler custom-ASIC competitor (and customer of merchant GPUs)
Two-chip Trainium-for-training / Inferentia-for-inference cost play inside AWS; designed by Annapurna Labs (acq. 2015)
- Microsoft Corporation (Maia)Hyperscaler custom-ASIC competitor (and largest single NVIDIA customer)
Maia 100 announced Nov 2023; deployed inside Azure for first-party OpenAI workloads
- Meta Platforms, Inc. (MTIA)Hyperscaler custom-ASIC competitor (and large NVIDIA customer)
MTIA designed with Broadcom; talks reported Q4 2025 to additionally rent Google TPUs from 2026
- Broadcom Inc.Dominant custom-ASIC co-design partner + AI networking switch silicon
AI revenue $8.4B in Q1 FY2026 (+106% YoY); designs across Google TPU, Meta MTIA, OpenAI Titan, ByteDance
- Huawei Technologies (Ascend)PRC-domestic AI accelerator competitor under US export controls
Ascend 910C dual-chiplet + HBM2E DaVinci NPU; ~400k units planned in 2025; mass shipment from mid-2025
- Cerebras SystemsMerchant inference-specialist competitor
Wafer-scale WSE-3 + CS-3 supercomputers + inference cloud; IPO May 2026 at ~$56.4B FDV
- Groq, Inc.Merchant inference-specialist competitor
LPU ASIC + inference cloud; $6.9B valuation Sep 2025
- SambaNova SystemsMerchant inference-specialist competitor (enterprise / on-prem)
- TenstorrentMerchant accelerator competitor (RISC-V, open software)
SWOT
- Demand-side tailwind: hyperscaler capex commitments at $600-725B for 2026 (Big Four), ~75% directed at AI infrastructure Anchors a multi-year, multi-hundred-billion-dollar order book for the whole accelerator + packaging + memory + networking stack
- Mature software moat (CUDA) at the incumbent that compounds as the installed base grows CUDA + cuDNN + NCCL + TensorRT-LLM is the highest single switching cost in the industry; ROCm, XLA, Triton are closing the gap only slowly
- Diversified credible alternatives across multiple architectural approaches Merchant GPUs (AMD), wafer-scale (Cerebras), inference-only ASIC (Groq), reconfigurable (SambaNova), hyperscaler ASIC (TPU/Trainium/Maia/MTIA), and PRC-domestic NPU (Huawei) — collectively reduce single-vendor concentration risk to buyers
- Single-vendor revenue concentration at NVIDIA (~80-92% data-center share) Industry-level price discovery, software stack evolution, and supply allocation are all set by one firm — and that firm's decisions about which customers, geographies, and SKUs to prioritise become market structure
- Severe physical bottleneck at TSMC CoWoS + HBM Estimated ~90% of global advanced packaging and HBM consumed by the top four AI designers in 2025; ramp from 65-75K WPM (2025) to 90-110K WPM (2026) still trails demand
- Geographic concentration risk at TSMC / Taiwan Sole-source dependence on a single island for advanced AI accelerator production is the dominant supply-chain risk
- Software-fragmentation risk on the non-NVIDIA path ROCm, XLA, MLIR, vendor-specific compilers for TPU/Trainium/Maia/MTIA/Huawei increase porting cost; every additional accelerator increases developer-tooling debt
- Inference workload share rises faster than training, expanding the market faster than NVIDIA can serve it Test-time compute / agentic workloads (long-context, multi-step) push inference compute consumption per token higher; favors lower-cost ASICs and inference-specialist clouds
- Custom-ASIC programs at every major hyperscaler create a $30B+ ASIC sub-market growing >30% per year Broadcom and Marvell are direct beneficiaries; the structural shift dwarfs the merchant-challenger story in size
- PRC-domestic stack (Huawei Ascend + SMIC) opens a $10B+ parallel market under US export controls Permanent bifurcation gives Huawei a captive demand pool inside China at scale; 2025 production at ~400k Ascend 910C units
- Inference-cloud business model (Cerebras, Groq, neoclouds) lets challengers sell capacity rather than chips Decouples the customer adoption decision from CUDA software lock-in and removes the upfront capex barrier
- Hyperscaler capex sustainability if monetisable AI revenue lags $600-725B aggregate Big Four capex 2026 requires a commensurate increase in AI-monetised revenue downstream; if it slips, the merchant accelerator pricing structure compresses sharply
- US BIS export-control regime can swing in either direction with little notice Apr 2025 H20 freeze removed ~$16B / 1.3M-unit pipeline overnight; Jan 2026 brought a partial relaxation — the policy uncertainty is itself a structural cost
- Customer-base concentration risk at challengers (single-customer revenue) Cerebras revenue 85% G42 in 2024, 62% MBZUAI in 2025 — typical of the inference-specialist cohort and a clear IPO-risk-factor
- Taiwan geopolitical risk Any disruption to TSMC's Hsinchu / Tainan / Kaohsiung fabs would cause a worldwide AI chip shortage of unprecedented scale
Porter's Five Forces
Greenfield entry requires: a credible chip-design team (years), TSMC CoWoS allocation (allocation rights, not just dollars), HBM supply (allocation), $100M+ of mask + design costs per generation, and a software stack to compete with CUDA. The 2025-2026 wave of new entrants are essentially the existing hyperscalers using Broadcom/Marvell to bridge the design gap; no plausible standalone new entrant has emerged outside that pattern.
TSMC controls ~70% of foundry share and is sole-source for advanced AI accelerator packaging via CoWoS; HBM is a three-supplier oligopoly (SK hynix, Samsung, Micron). The dominant suppliers can — and do — allocate capacity by customer priority, and ramp constraints persist through 2027.
One vendor (NVIDIA) holds 80-92% revenue share, which structurally suppresses head-to-head price rivalry; the real rivalry happens architecturally between NVIDIA GPUs and hyperscaler custom ASICs at the buy-vs-build decision, plus within the merchant inference-specialist cohort fighting over a smaller pool.
Buyer base is highly concentrated (Big Four hyperscalers + a small group of frontier labs and sovereign AI buyers), which in principle confers buyer power. But supply scarcity inverts the relationship — buyers compete for allocation rather than negotiate price down. The largest buyers exercise their power through ASIC programs (build-instead-of-buy) rather than price negotiation.
Substitutes exist (hyperscaler ASICs, merchant challengers, PRC-domestic stack) and are gaining share at the margin, but the CUDA software moat plus inertia in production ML pipelines limits the speed of substitution. Inference workloads substitute more readily than training; PRC market is fully substituted by export-control mandate.