we just arrived at the “WTF” moment in AI

Business & Strategy4 months ago

we just arrived at the “WTF” moment in AI

Descriptions:

Wes Roth covers what he characterizes as a genuine capability milestone: autonomous AI systems solving previously unsolved problems from the famous Erdős collection, a database of 1,135 open mathematical challenges. Terrence Tao — widely considered one of the greatest living mathematicians, the youngest IMO gold medalist in history, and a UCLA professor — publicly validated the breakthrough, confirming that Erdős problem #728 was solved “more or less autonomously” by AI, with the result representing a novel proof not previously found in the literature and not based on a trivial loophole.

The video systematically categorizes the wave of recent AI math achievements: fully autonomous solutions to open problems, human-AI collaborative proofs, new proofs of previously-solved problems, AI-powered literature reviews, and formalized proofs. GPT-5.2 (released December 11, 2025) and Grok 4.20 are highlighted as the primary drivers. Scott Aaronson is cited as having published a paper in September 2025 where a key technical proof step came directly from GPT-5 Thinking — his own characterization of the event.

Roth situates all of this within a steep capability curve concentrated in November 2025 through January 2026. Tao’s framing — “a genuine increase in capability of these tools in recent months” — anchors the analysis, and the video also explores AI’s emerging ability to rapidly generate expository mathematical writing as a secondary development. The pace and clustering of these breakthroughs across multiple independent models and research groups is presented as the central signal.

📺 Source: Wes Roth · Published January 12, 2026
🏷️ Format: News Analysis

1 Item

Channels

No Image Available

Wes Roth

1 Item

Companies

No Image Available

OpenAI

Tags

ChatGPT Google GPT-5.2 Pro OpenAI

Prev

Claude Code Q&A – 5 Questions I Get Asked All The Time

Claude Code Q&A – 5 Questions I Get Asked All The Time

Next

The greatest Claude Code workflow ever (10x your speed)

The greatest Claude Code workflow ever (10x your speed)

18 Related Posts

Related Posts

41:05

Business & Strategy

Anthropic on USA vs China

11 minutes ago

24:56

Business & Strategy

everyone JUST got HACKED…

11 minutes ago

12:23

Business & Strategy

Claude’s 13 Free AI Courses in 12 Minutes

1 day ago

44:03

Business & Strategy

Cerebras Goes Public in Year’s Biggest IPO | Bloomberg Tech 5/14/2026

1 day ago

19:11

Business & Strategy

Your Agent Can Now Train Models — Merve Noyan, Hugging Face

2 days ago

41:46

Business & Strategy

I’m terrified of this…

2 days ago