Al Agents That Actually Work: The Pattern Anthropic Just Revealed

Foundation Models5 months ago

Al Agents That Actually Work: The Pattern Anthropic Just Revealed

Descriptions:

Nate B. Jones breaks down a pattern Anthropic published for building reliable long-running AI coding agents, addressing what he identifies as the core failure mode of most real-world agent deployments: amnesia. Even capable models like Claude Opus 4.5, Gemini 3, or GPT 5.1 start each session with no memory of prior work, causing them to re-derive goals, contradict earlier decisions, and loop indefinitely without making meaningful forward progress.

The solution is a two-agent architecture organized around persistent domain memory. An initializer agent transforms a high-level user prompt into a set of structured artifacts: a JSON feature list with every item initially marked ‘failing,’ a progress log, scaffolding instructions, and explicit test criteria defining what counts as success. A separate coding agent then runs in repeated sessions — each time reading the progress log, selecting a single failing feature, implementing it, running end-to-end tests, updating the feature status, writing a progress note, and committing. The state persists across sessions; the coding agent never guesses where it left off.

Jones is careful to emphasize that this architecture pattern is not limited to software. Any domain where agents need to operate across multiple sessions — content production, research, operations — can benefit from the same principle: a persistent, structured representation of goals, constraints, prior attempts, and current status. The harness, not raw model intelligence, is what makes long-horizon agent work reliable, and that harness is something builders can construct today using the Claude Agent SDK or comparable frameworks.

📺 Source: AI News & Strategy Daily | Nate B Jones · Published December 08, 2025
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

AI News & Strategy Daily | Nate B Jones

1 Item

Companies

No Image Available

Anthropic

Tags

Anthropic Claude Claude Agent SDK Claude Opus 4.5 Gemini 3 Slack

Prev

What 1250 Professionals Said About Working With AI

What 1250 Professionals Said About Working With AI

Next

AI Has a PR Problem

AI Has a PR Problem

18 Related Posts

Related Posts

16:23

Foundation Models

Your SaaS Bill Just Got a Second Meter. You’re About to Pay It.

1 hour ago

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago