NVIDIA told us exactly where AI is going — and almost everyone heard it wrong

Business & Strategy4 months ago

NVIDIA told us exactly where AI is going — and almost everyone heard it wrong

Descriptions:

CES 2026 may be remembered less for its gadgets than for the moment the AI industry’s industrial posture became impossible to ignore. Nate B. Jones breaks down what Nvidia’s announcements — particularly the Vera Rubin rack-scale platform — actually signal about where AI infrastructure investment is heading and why inference, not training, is now the dominant cost center for every major AI lab.

The Rubin platform is a six-chip rack-scale system that Nvidia claims cuts inference token generation costs by a factor of 10 while supporting 10-million-token context windows. Crucially, the platform ships with a dedicated inference context memory storage tier — essentially externalizing the KV cache from the GPU itself — which Jones reads as an explicit acknowledgment that inference scaling is now as much a memory and data-movement problem as a compute problem. Sam Altman’s October 2025 figure of 800 million weekly active ChatGPT users illustrates the permanent serving load that now dwarfs any individual training run.

Jones connects the hardware story to OpenAI’s supply chain positioning: a $38 billion AWS capacity lock, a multi-billion dollar Coreweave deal, and the Stargate project’s new partnership with Samsung and SK Hynix targeting 900,000 DRAM wafers per month. Reuters data showing DRAM prices up over 300% in Q4 2025 underscores how tight the supply chain has become — and why the companies that locked in capacity agreements early are structurally advantaged heading into the scale-out phase of 2026.

📺 Source: AI News & Strategy Daily | Nate B Jones · Published January 08, 2026
🏷️ Format: News Analysis

1 Item

Channels

No Image Available

AI News & Strategy Daily | Nate B Jones

2 Items

Companies

No Image Available

Nvidia

No Image Available

OpenAI

Tags

AMD Anthropic AWS Broadcom CES 2026 ChatGPT CoreWeave Nvidia OpenAI Sam Altman Samsung SK Hynix TPU

Prev

How I Grew My App to $13K/month

How I Grew My App to $13K/month

Next

Spec-Driven Development: Agentic Coding at FAANG Scale and Quality — Al Harris, Amazon Kiro

Spec-Driven Development: Agentic Coding at FAANG Scale and Quality — Al Harris, Amazon Kiro

18 Related Posts

Related Posts

12:23

Business & Strategy

Claude’s 13 Free AI Courses in 12 Minutes

23 hours ago

44:03

Business & Strategy

Cerebras Goes Public in Year’s Biggest IPO | Bloomberg Tech 5/14/2026

23 hours ago

19:11

Business & Strategy

Your Agent Can Now Train Models — Merve Noyan, Hugging Face

2 days ago

41:46

Business & Strategy

I’m terrified of this…

2 days ago

07:44

Business & Strategy

Anthropic Just Dethroned OpenAI. Here’s What Happens Next.

2 days ago

25:38

Business & Strategy

The Best Way to Talk to Your Agents

3 days ago