Field noteJan 23, 2026Sam Gaddis

Ralph Wiggum vs. Claude Code Tasks

Claude Code tasks changed the context-window problem. Smaller task packets make agent work easier to review, retry, and trust.

The short version: a constraint that has been limiting AI-assisted coding just got significantly smaller. The people who know how to use these tools can now tackle much larger features with more autonomy. Development timelines should shrink. Expectations for what a skilled engineer can ship should go up.

If you want to understand why, keep reading.

The Problem These Tools Solve

The context window has always been the ceiling. Fill it up, and the model loses the thread. For small features, manageable. For big ones (a full Twilio integration, an auth system refactor), it's a hard limit.

Both Ralph Wiggum and Claude's native tasks solve this the same way: externalize the plan into files that live in your repo. Build an 11-step implementation plan, and those steps persist as files. Clear the context, start fresh, and the agent reads the plan to pick up where it left off.

Ralph Wiggum: The Iterative Approach

Geoffrey Huntley created Ralph in mid-2025. His description: "Ralph is a Bash loop." It runs through your terminal, feeding prompts to the agent until the job is done. Named after the Simpsons character because the philosophy is the same: keep trying until it works.

By late 2025, Anthropic's Head of Claude Code, Boris Cherny, had formalized Ralph into an official plugin. So while it started as a community hack, it's now part of the official toolkit.

What makes Ralph different is the self-review loop:

Planning mode: Agent creates a plan, reviews it, refines. I run 3 iterations.
Build mode: Agent builds, reviews, fixes, repeats. I run 10 iterations.

This catches problems that single-pass execution misses. It also burns tokens like crazy.

Claude Code Tasks: The Native Version

The recent tasks update builds these ideas directly into Claude Code. Same core mechanic (plans as persistent files), but more token-efficient and less iterative than Ralph. One quirk: Claude doesn't always realize it can do this. You sometimes have to push it into task mode. Expect that to smooth out soon.

The Tradeoff

Ralph: Better results on complex features. Higher token cost. More setup.

Native tasks: More efficient. Less iteration. Already built in.

If you're doing serious Claude Code work today, Ralph is worth learning. If you'd rather wait, give it a month. These things tend to converge fast.

The Runpoint Letter

The worldview behind the letter →