DeepMind’s New AI: A Gift To Humanity

DeepMind’s New AI: A Gift To Humanity

More

Descriptions:

Dr. Károly Zsolnai-Fehér of Two Minute Papers provides a detailed technical breakdown of Google DeepMind’s Gemma 4, an open-weight model family notable for running entirely offline on consumer hardware — including phones and, as demonstrated in the video, a first-generation Nintendo Switch. The core argument is that capable locally-owned AI represents genuine independence from cloud subscription terms that can be revoked without notice.

Four technical improvements explain Gemma 4’s surprising efficiency. First, highly curated training data rather than bulk internet ingestion. Second, a hybrid attention mechanism combining a local sliding window for fine-grained detail with global attention for broader document context. Third, native aspect ratio image processing — Gemma 3 squished all inputs to squares, losing information; Gemma 4 does not. Fourth, a shared KV-cache that borrows intermediate memory from earlier network layers rather than recomputing it, reducing redundant work. The 31B dense model (which activates all parameters, unlike mixture-of-experts architectures) ranks third among open models and outperforms some models ten times its size on select benchmarks.

Beyond architecture, Gemma 4 doubles the context window to 256k tokens versus Gemma 3, demonstrates strong agentic capabilities including tool use and local coding, and ships under an Apache 2.0 license — a significant upgrade from the restrictive Gemma license that constrained commercial use of earlier versions. Within days of release, community developers had already built offline translation apps and real-time browser-based image classification tools.


📺 Source: Two Minute Papers · Published April 16, 2026
🏷️ Format: Deep Dive

1 Item

Channels

1 Item

Companies