A full Petaflop in the Palm of Your Hand – The Dell Pro Max with GB10

Foundation Models4 months ago

A full Petaflop in the Palm of Your Hand – The Dell Pro Max with GB10

Descriptions:

Dave’s Garage host Dave puts Dell’s GB10-based system through three practical workloads to assess whether Nvidia’s compact Blackwell superchip is ready for serious edge AI development. At the hardware level, the GB10 pairs a 20-core ARM CPU (10 Cortex X925 + 10 A725 efficiency cores) with 6,144 CUDA cores and a shared pool of 128GB LPDDR5 memory connected over NVLink C2C — eliminating the host-to-device memory shuffling that complicates discrete GPU setups. Nvidia rates the chip at roughly 1 petaflop of FP4 AI compute; the unit draws about 230W through an external power brick.

The first workload runs large language models through Ollama compiled with CUDA and linked against TensorRT-LLM, taking advantage of Blackwell’s FP4 quantization path to keep 120B-parameter models resident in memory — within about one percentage point of FP8 accuracy according to Nvidia’s calibration documentation. The second builds a reinforcement learning system that trains a game-playing agent for the arcade title Tempest, replacing two ThreadRipper towers with dual RTX 6000 cards. The third deploys a fully local vehicle detection pipeline using YOLO and DeepStream over RTSP, generating SMS alerts only for unfamiliar cars by comparing embeddings against a household vehicle gallery — no cloud upload required.

Dave is candid about the tradeoffs: a discrete RTX 4090 outperforms the GB10 on raw FP8 throughput, but the unified memory architecture makes the GB10 more practical for multi-model pipelines that would otherwise require constant model swapping.

📺 Source: Dave’s Garage · Published January 11, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

Dave’s Garage

Tags

Blackwell CUDA Nvidia

Prev

Candlestick Patterns Are Easy to Sell — Hard to Test — So I Built This

Candlestick Patterns Are Easy to Sell — Hard to Test — So I Built This

Next

I Built a Voice Agent That Calls Every New Lead

I Built a Voice Agent That Calls Every New Lead

18 Related Posts

Related Posts

16:23

Foundation Models

Your SaaS Bill Just Got a Second Meter. You’re About to Pay It.

1 hour ago

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago