Business Models

How Does NVIDIA Make Money? GPUs, Data Centers, and the AI Infrastructure Business Model

NVIDIA generates $130.5B in annual revenue, driven by AI data center chips, gaming GPUs, networking, software, and automotive platforms.

By Swet Parvadiya·June 27, 2026

How Does NVIDIA Make Money?

NVIDIA generates $130.5B in annual revenue — a 122% year-over-year increase in FY2024 — driven almost entirely by its Data Center segment supplying AI training and inference chips to hyperscalers, cloud providers, and enterprises building AI infrastructure. NVIDIA's business model has shifted from gaming hardware to AI infrastructure in the span of three years, making it one of the most remarkable business pivots in technology history.

Data Center: The Revenue Engine

Data Center is NVIDIA's dominant segment, generating approximately $115B in FY2025 revenue — roughly 88% of total company revenue. This segment includes:

H100 and H200 GPUs (Hopper architecture): NVIDIA's H100 GPU, priced at $25,000–$40,000 per unit, became the primary training chip for large language models including GPT-4, Gemini, Llama, and Claude. Demand for H100s far exceeded supply through 2023–2024, with lead times of six months or more. Microsoft, Google, Meta, Amazon, and Oracle collectively purchased tens of billions of dollars of H100 and H200 units. The H200 (2024) added HBM3e memory, improving inference throughput for deployed AI models.
Blackwell architecture (B100, B200, GB200): NVIDIA's next-generation GPU platform, launched in 2024, delivers 4x the AI training performance and 30x the inference performance of Hopper. The GB200 NVL72 rack system — 72 Blackwell GPUs connected via NVLink — is the flagship AI training system, priced at approximately $3M per rack. Blackwell became the fastest product ramp in NVIDIA's history, with demand again exceeding production capacity through 2025.
DGX systems and HGX reference designs: NVIDIA sells complete server systems (DGX) pre-configured for AI workloads, as well as reference HGX motherboard designs used by OEM partners (Dell, HPE, Supermicro). DGX Cloud provides NVIDIA AI computing as a cloud service hosted in partnership with Microsoft Azure, Google Cloud, and Oracle Cloud.
Networking (InfiniBand, Ethernet): Following the Mellanox acquisition (2020, $6.9B), NVIDIA supplies high-bandwidth networking interconnects that link GPU clusters in AI data centers. InfiniBand is the dominant high-performance interconnect in large AI training clusters. This networking business generates approximately $10–12B annually and is bundled with GPU deployments in full-stack AI infrastructure sales.
CUDA software platform: CUDA — Compute Unified Device Architecture — is NVIDIA's GPU programming framework and ecosystem. Over 4 million developers write AI and HPC code in CUDA, and approximately 3,000 GPU-accelerated applications exist in the CUDA ecosystem. CUDA is not priced separately; it is the software moat that creates GPU lock-in. Developers who write CUDA code cannot run it on AMD or Intel GPUs, making NVIDIA GPU replacement extraordinarily difficult despite AMD's competitive hardware efforts.

Gaming: The Legacy Business

Gaming — NVIDIA's original core business — now represents approximately 9–10% of revenue at $10–13B annually. GeForce GPUs (RTX 4000 and 5000 series) are used in gaming PCs and laptops. The GeForce NOW cloud gaming service streams games to devices without dedicated GPUs. Gaming GPU gross margins are high (~60–65%) but the segment's relative contribution has declined as Data Center revenue exploded.

Professional Visualization

NVIDIA's RTX professional GPU line (used in 3D design, animation, CAD, and media production workstations) generates approximately $500M–1B quarterly. The Omniverse platform — for industrial digital twins and 3D simulation — represents NVIDIA's attempt to move up the stack from hardware into enterprise software for virtual factory simulation, robotics, and autonomous vehicle testing.

Automotive

NVIDIA's automotive segment (~$1.5–2B annually) supplies DRIVE Orin and DRIVE Thor chips to automakers for in-vehicle AI computing — driver assistance, autonomous driving, and digital cockpit systems. Design wins include Mercedes-Benz, Volvo, Li Auto, BYD, and others, with a $14B+ automotive revenue pipeline across multi-year vehicle programs. This is an early but significant long-term growth opportunity as vehicle software complexity increases.

Why NVIDIA's Margins Are Exceptional

NVIDIA's gross margin reached 74–75% in FY2025, well above typical semiconductor companies (Intel ~40%, AMD ~50%). This margin premium reflects several factors: NVIDIA designs chips but outsources manufacturing to TSMC (fabless model), eliminating factory capital costs; CUDA software lock-in means customers have no competitive alternative for large-scale AI training; demand has consistently exceeded supply, giving NVIDIA pricing power; and the full-stack offering (chips + networking + software) means NVIDIA captures more of the AI infrastructure budget than a component supplier would.

The CUDA Moat vs. Competition

AMD's MI300X and MI325X GPUs are competitive with NVIDIA's Hopper generation on paper and cheaper. Google's TPUs, Amazon's Trainium, and Microsoft's Maia are custom AI accelerators designed to reduce hyperscaler dependence on NVIDIA. Meta has invested in AMD GPUs to reduce vendor concentration. Despite this, NVIDIA has maintained 70–80%+ market share in AI training chips, primarily because CUDA's developer ecosystem is 15 years deep and its software tools (cuDNN, TensorRT, Triton Inference Server, NeMo, Megatron) are more mature. Moving a large AI training cluster from CUDA to ROCm (AMD's equivalent) requires significant software migration effort that most organizations have deferred given the pace of AI development.

Summary

NVIDIA makes money primarily through Data Center GPU sales ($115B, ~88% of revenue) — H100, H200, and Blackwell GPUs for AI training and inference — plus InfiniBand networking and the CUDA software ecosystem that locks developers into NVIDIA hardware. Gaming (~10% of revenue) is the historical core business. Automotive is a growing secondary segment. NVIDIA's 74% gross margins reflect the pricing power of being the only at-scale AI training chip supplier with a mature software ecosystem. Verify all figures against NVIDIA's current 10-K or most recent earnings release.

Related Reading

Continue the research with Nvidia company profile, how companies make money, business moats.

Disclaimer:Financial figures cited in this article are approximate and sourced from publicly available reports. Always verify against the company's current SEC filings (10-K, 10-Q) or earnings releases before using in investment or business analysis.

← Back to all articles

How Does NVIDIA Make Money?

Data Center: The Revenue Engine

Data Center is NVIDIA's dominant segment, generating approximately $115B in FY2025 revenue — roughly 88% of total company revenue. This segment includes:

H100 and H200 GPUs (Hopper architecture): NVIDIA's H100 GPU, priced at $25,000–$40,000 per unit, became the primary training chip for large language models including GPT-4, Gemini, Llama, and Claude. Demand for H100s far exceeded supply through 2023–2024, with lead times of six months or more. Microsoft, Google, Meta, Amazon, and Oracle collectively purchased tens of billions of dollars of H100 and H200 units. The H200 (2024) added HBM3e memory, improving inference throughput for deployed AI models.

Blackwell architecture (B100, B200, GB200): NVIDIA's next-generation GPU platform, launched in 2024, delivers 4x the AI training performance and 30x the inference performance of Hopper. The GB200 NVL72 rack system — 72 Blackwell GPUs connected via NVLink — is the flagship AI training system, priced at approximately $3M per rack. Blackwell became the fastest product ramp in NVIDIA's history, with demand again exceeding production capacity through 2025.

DGX systems and HGX reference designs: NVIDIA sells complete server systems (DGX) pre-configured for AI workloads, as well as reference HGX motherboard designs used by OEM partners (Dell, HPE, Supermicro). DGX Cloud provides NVIDIA AI computing as a cloud service hosted in partnership with Microsoft Azure, Google Cloud, and Oracle Cloud.

Networking (InfiniBand, Ethernet): Following the Mellanox acquisition (2020, $6.9B), NVIDIA supplies high-bandwidth networking interconnects that link GPU clusters in AI data centers. InfiniBand is the dominant high-performance interconnect in large AI training clusters. This networking business generates approximately $10–12B annually and is bundled with GPU deployments in full-stack AI infrastructure sales.

CUDA software platform: CUDA — Compute Unified Device Architecture — is NVIDIA's GPU programming framework and ecosystem. Over 4 million developers write AI and HPC code in CUDA, and approximately 3,000 GPU-accelerated applications exist in the CUDA ecosystem. CUDA is not priced separately; it is the software moat that creates GPU lock-in. Developers who write CUDA code cannot run it on AMD or Intel GPUs, making NVIDIA GPU replacement extraordinarily difficult despite AMD's competitive hardware efforts.

Gaming: The Legacy Business

Professional Visualization

Automotive

Why NVIDIA's Margins Are Exceptional

The CUDA Moat vs. Competition

Summary