My Custom AI Went Superhuman Yesterday…

Foundation Models3 months ago

My Custom AI Went Superhuman Yesterday…

Descriptions:

Dave Palmer, a retired Microsoft software engineer known for work going back to the MS-DOS and Windows 95 era, recounts the moment his custom reinforcement learning agent surpassed his own official world record playing Tempest — the 1981 Atari arcade game — on its most punishing ‘extreme’ difficulty settings. The video is an unusually detailed hobbyist RL engineering story, covering a full year of development, a frustrating plateau, and the two changes that finally broke through it.

The system reads live game state directly from MAME’s 6502 emulator memory via a Lua scripting bridge, transmitting 195 game-state floats per frame over a socket to a Python-based training loop. Training runs on a Dell workstation that achieves 3,000–4,000 frames per second and 50 training steps per second, using only 10% CPU and 20% GPU — a claimed 5x improvement in AI throughput over an Apple Mac Pro Ultra. The replay buffer holds 15 million frames (roughly six days of gameplay) with prioritized sampling weighted toward surprising or high-error experiences.

The breakthrough came from two architectural changes: adding an attention mechanism and switching the spatial representation from a flat 2D grid to polar/radial coordinates that match how Tempest’s geometry actually works. Palmer also covers reward shaping (score delta plus a subjective lane-safety signal), the deliberate use of a ‘not very good’ teacher for imitation learning bootstrapping, and a frank discussion of where vibe-coding with AI assistants helps versus where serious ML engineering still requires manual precision.

📺 Source: Dave’s Garage · Published February 28, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

Dave’s Garage

Tags

Apple Claude Opus 4.5 Microsoft

Prev

Claude Code Mobile just changed AI coding forever (Remote Control)

Claude Code Mobile just changed AI coding forever (Remote Control)

Next

⚡️ Polsia: Solo Founder Tiny Team from 0 to 1m ARR in 1 month & the future of Self-Running Companies

⚡️ Polsia: Solo Founder Tiny Team from 0 to 1m ARR in 1 month & the future of Self-Running Companies

18 Related Posts

Related Posts

16:23

Foundation Models

Your SaaS Bill Just Got a Second Meter. You’re About to Pay It.

1 hour ago

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago