What is loop engineering?

Loop engineering is the practice of designing autonomous cycles where an AI agent performs a task, verifies the output against a defined criterion (tests pass, spec is met, review approves), and retries if the check fails — rather than relying on a single prompt and manual review. The term was popularised by Peter Steinberger in June 2026 and championed by Boris Cherny of Anthropic's Claude Code team.

How is loop engineering different from prompt engineering?

Prompt engineering optimises a single instruction you type. Loop engineering optimises the autonomous system that decides what to prompt, when to retry, and whether the result is acceptable. Prompt engineering ends when Claude replies; loop engineering ends when a verifiable condition is satisfied — or when you run out of tokens.

What does Boris Cherny say about loop engineering?

Boris Cherny, who leads Claude Code at Anthropic, argues that engineers who stop prompting and start building loops ship dramatically more code — citing that Claude authors 80%+ of production code at Anthropic using this approach. His core claim is that verification gates, not better prompts, are what make AI coding reliable at scale.

What are the risks of self-improvement loops?

Matt Pocock (Total TypeScript author) argues that automatic self-improvement loops — where the agent writes its own CLAUDE.md updates or memory entries — tend to produce low-quality suggestions that the agent then over-indexes on in subsequent sessions. The loop amplifies whatever bias is in the first suggestion. He recommends human review before any auto-generated memory is applied.

What Claude Code commands support loop engineering?

Claude Code's /loop command runs a prompt on a schedule or until a condition is met. The /goal command sets a long-running objective with intermediate checkpoints. Together they are the primary infrastructure for loop engineering in Claude Code. See the Loop Engineering implementation guide for working patterns.

Loop Engineering Goes Mainstream: The AI Skill Everyone Is Talking About in June 2026 | explainx.ai Blog

Update — July 18, 2026: Peter Steinberger's next wave — graph engineering — arrived with a single tweet ("Are we still talking loops or did we shift to graphs yet?") that hit 575K views. Graphs make multi-agent orgs programmable; this post covers the loop layer below it.

Update — June 30, 2026: Andrew Ng's three-loop framing (The Batch) stacks agentic coding (minutes), developer feedback (hours), and external feedback (days) for 0→1 products — complementing the Boris Cherny / Peter Steinberger harness-level loop engineering discourse below.

From Niche Technique to Twitter Trend

Two weeks ago, most developers encountered "loop engineering" as a term in a single viral tweet. This week it is a trending topic with over 2,200 posts, a Grok summary, multiple tutorial threads, and a notable critical response from one of the most-followed TypeScript educators on the platform.

The speed of that transition tells you something. Loop engineering did not go viral because it is new — the underlying idea (agent → check → retry) predates the term by years. It went viral because the gap between what one-shot prompts can do and what production software actually needs has become impossible to ignore, and loop engineering is the most legible name for the solution.

Here is what the discourse actually says.

The Core Idea in One Paragraph

Loop engineering is the practice of designing cycles where an AI agent performs a task, evaluates the output against a verifiable criterion — tests pass, lint is clean, a spec is met, a human approves — and automatically retries if the check fails. The loop runs until success or until a token budget or time limit terminates it. You define the task, the check, and the exit condition. The agent handles everything in between.

That is it. The power is in the verifiable check. Without it, you have a one-shot prompt. With it, you have a system.

Video course: Introduction to Loop Engineering (~1 hour) — loop types, Claude Code /loop in Cursor, termination, memory, and guardrails. Course hub →

A walkthrough of loop engineering — what it is and why it matters for AI-assisted development.

Who Is Driving the Conversation

Peter Steinberger (OpenAI)

@steipete's June 8 tweet — "stop making prompts, start designing loops" — has now cleared 6.5 million views and is the canonical origin point for the current wave. Steinberger's framing was intentionally provocative: prompt engineering is a skill you will not need in 18 months; loop engineering is the skill that replaces it.

Boris Cherny (Anthropic, Claude Code)

@bcherny runs Claude Code at Anthropic and has been the most detailed advocate for the practice. His argument is operational: at Anthropic, Claude now authors more than 80% of production code, and that only became possible once engineers stopped reviewing individual responses and started building loops that verify results programmatically. The human role shifted from "review what Claude wrote" to "design the check that determines whether what Claude wrote is acceptable."

Claude Code's /loop and /goal commands are the direct infrastructure expression of this philosophy — covered in depth in our implementation guide.

0xMarioNawfal and Mike (@mikenevermiss)

The retweeted summary that pushed the concept to its widest audience this week: "Loops are the meta right now. If you're having issues engineering loops you need to bookmark this post and read it." Simple, algorithmic, effective — exactly the kind of post that turns a technique into a trend.

The Honest Account: Dan Bochman's 13-Hour Loop

Dan Bochman (@DanBochman), co-founder at fashn.ai, posted the funniest and most accurate description of what loop engineering looks like in practice:

Typical coding day with Claude (Opus 4.8):

explain to Claude the task (5 minutes)

Claude implements task (10 minutes)

me: "Why is this necessary?"

Claude: "You're right to push back! I over-engineered this!"

Repeat ×87 times (13 hours)

This is not a critique of Claude. It is a description of what happens when you design a loop without a proper exit condition. The check is "does Yash think this is reasonable?" — which is not a check, it is a conversation. A real loop has a programmatic criterion. Tests pass. The diff is under 300 lines. The output scores above 0.8 on an eval. The human only enters when those checks are satisfied.

Bochman's post got thousands of likes precisely because everyone recognised themselves in it. The 87-iteration back-and-forth is not a loop engineering failure — it is the absence of loop engineering.

The Backlash: Matt Pocock's Warning

Matt Pocock (@mattpocockuk), author of Total TypeScript and one of the most careful thinkers in the TypeScript community, pushed back on a specific variant: self-improvement loops.

"I have a deep distrust of almost any 'self-improvement' loop in coding agents — automatically created memories, CLAUDE.md suggestions applied after every session. Often the suggestions themselves are shit. But even if they're good, the agent often over-indexes on them."

This is an important distinction. Pocock is not arguing against verification loops (retry until tests pass). He is arguing against loops that let the agent rewrite its own instructions — specifically auto-generated CLAUDE.md updates and memory entries.

His concern is calibrated: a bad suggestion in a self-improvement loop does not just produce one bad response. It gets written into the agent's permanent context, where it biases every subsequent response. The loop amplifies the error. The damage compounds.

The practical implication: use loops for task verification, not for unsupervised self-modification. Human review before any agent-written instruction becomes persistent context.

This is consistent with what the best loop engineering practitioners actually do — the check in a well-designed loop is external and objective, not the agent's own assessment of its output.

What Makes a Loop Work

Based on the discourse and the underlying practice, the elements that separate functional loops from expensive infinite-retry cycles:

1. A verifiable exit criterion Not "does this look good?" — tests pass, diff under N lines, eval score above threshold, API call returns 200. Something the system can check without a human.

2. A cheap check Token costs accumulate inside a loop. If your verification step is "run Claude again to review the output," you are paying frontier model prices for a judge. Use deterministic checks first: compilation, lint, unit tests, type checking. AI-as-judge only for what those can't cover.

3. A hard exit Maximum iterations, maximum tokens, maximum wall-clock time. Every loop needs a ceiling. The worst outcome is not a loop that fails — it is a loop that runs for 6 hours and $40 before anyone notices.

4. Human gates at the right level Boris Cherny's harness engineering framework places humans at the spec-definition and result-acceptance layer, not inside the retry cycle. You define what success looks like. The loop handles getting there. You review the final output, not each intermediate step.

Why Now

The technique itself is not new. Retry-until-pass patterns have existed since the first coding agents. What changed:

Models got reliable enough to make it worth it. With Claude Opus 4.8, the per-iteration hit rate on non-trivial coding tasks is high enough that a loop converges in a reasonable number of turns. Six months ago the same loop might spin 40 times before finding a solution; now it often lands in 3-5.

Context windows got large enough. A loop that loads a full codebase on each iteration was previously impractical. At 1M tokens, the full project fits. The agent has complete context on every retry.

Costs came down enough. Loop engineering is inherently more expensive than a single prompt. The cost drop in 2025-2026 made the math work for more use cases.

The Fable 5 export ban surfaced the question. The sudden removal of the most capable model from most of the world's developers — detailed in our Fable 5 ban coverage — forced the question: are you getting value from your AI tools, or are you just prompting and hoping? Loop engineering is the answer to that question at the methodology level.

The Skill Stack in 2026

If you are mapping what to learn, the picture the discourse is painting looks like this:

Layer	Skill	Who does it
Task definition	Writing precise specs and acceptance criteria	Engineer/PM
Check design	Writing fast, cheap, deterministic verification	Engineer
Loop architecture	`/loop`, `/goal`, cron harnesses, retry logic	Platform/DevOps
Exit handling	Token budgets, hard timeouts, escalation paths	Engineer
Human review	Accepting/rejecting the final output	Engineer/QA

Prompt engineering — the skill of writing a single instruction to get a good response — sits below all of this. It is still useful inside each loop iteration. But the meta-skill is now the loop design, not the individual prompt.

What This Means for Your Workflow

If you ship with Claude Code today:

Use /loop for tasks with a deterministic success condition (all tests pass, PR comments resolved, lint clean)
Use /goal for longer-horizon objectives with intermediate checkpoints
Do not auto-apply CLAUDE.md suggestions generated by the agent — review them first
Set token budgets before starting any multi-step loop
Let the loop retry; review the final output, not each attempt

If you are building agent infrastructure:

Design the verification layer before designing the prompt — what does "done" look like in machine-readable terms?
Treat AI-as-judge as a last resort, not the default check
Build your logs around loop iteration count, not just input/output pairs

Introduction to Loop Engineering (Udemy course) — beginner video: /loop, guardrails, and loop types
Anthropic overtakes OpenAI in U.S. business adoption (Ramp AI Index) — why enterprises paying for Claude Code drove the May 2026 crossover
Andrew Ng's Three Loops for 0-to-1 Products — agentic coding, developer feedback, external feedback (June 30, 2026)
Claude Code Loops Official Guide (July 2026)
What Is Loop Engineering? The New Paradigm Beyond Prompt Engineering
Loop Engineering: Coding Agent Loops That Run While You Sleep (Full Guide)
Anthropic Engineer: Stop Prompting, Build Loops (Harness Engineering)
Fable 5 Loop Design: Self-Correction and Memory Patterns
Agent Harness Engineering: Seven Planes

From Niche Technique to Twitter Trend

Here is what the discourse actually says.

The Core Idea in One Paragraph

That is it. The power is in the verifiable check. Without it, you have a one-shot prompt. With it, you have a system.

Video course: Introduction to Loop Engineering (~1 hour) — loop types, Claude Code /loop in Cursor, termination, memory, and guardrails. Course hub →

A walkthrough of loop engineering — what it is and why it matters for AI-assisted development.

Who Is Driving the Conversation

Peter Steinberger (OpenAI)

Boris Cherny (Anthropic, Claude Code)

Claude Code's /loop and /goal commands are the direct infrastructure expression of this philosophy — covered in depth in our implementation guide.

0xMarioNawfal and Mike (@mikenevermiss)

The Honest Account: Dan Bochman's 13-Hour Loop

Dan Bochman (@DanBochman), co-founder at fashn.ai, posted the funniest and most accurate description of what loop engineering looks like in practice:

Typical coding day with Claude (Opus 4.8):

explain to Claude the task (5 minutes)

Claude implements task (10 minutes)

me: "Why is this necessary?"

Claude: "You're right to push back! I over-engineered this!"

Repeat ×87 times (13 hours)

The Backlash: Matt Pocock's Warning

Matt Pocock (@mattpocockuk), author of Total TypeScript and one of the most careful thinkers in the TypeScript community, pushed back on a specific variant: self-improvement loops.

"I have a deep distrust of almost any 'self-improvement' loop in coding agents — automatically created memories, CLAUDE.md suggestions applied after every session. Often the suggestions themselves are shit. But even if they're good, the agent often over-indexes on them."

The practical implication: use loops for task verification, not for unsupervised self-modification. Human review before any agent-written instruction becomes persistent context.

This is consistent with what the best loop engineering practitioners actually do — the check in a well-designed loop is external and objective, not the agent's own assessment of its output.

What Makes a Loop Work

Based on the discourse and the underlying practice, the elements that separate functional loops from expensive infinite-retry cycles:

1. A verifiable exit criterion Not "does this look good?" — tests pass, diff under N lines, eval score above threshold, API call returns 200. Something the system can check without a human.

Why Now

The technique itself is not new. Retry-until-pass patterns have existed since the first coding agents. What changed:

Costs came down enough. Loop engineering is inherently more expensive than a single prompt. The cost drop in 2025-2026 made the math work for more use cases.

The Skill Stack in 2026

If you are mapping what to learn, the picture the discourse is painting looks like this:

Layer	Skill	Who does it
Task definition	Writing precise specs and acceptance criteria	Engineer/PM
Check design	Writing fast, cheap, deterministic verification	Engineer
Loop architecture	`/loop`, `/goal`, cron harnesses, retry logic	Platform/DevOps
Exit handling	Token budgets, hard timeouts, escalation paths	Engineer
Human review	Accepting/rejecting the final output	Engineer/QA

What This Means for Your Workflow

If you ship with Claude Code today:

Use /loop for tasks with a deterministic success condition (all tests pass, PR comments resolved, lint clean)
Use /goal for longer-horizon objectives with intermediate checkpoints
Do not auto-apply CLAUDE.md suggestions generated by the agent — review them first
Set token budgets before starting any multi-step loop
Let the loop retry; review the final output, not each attempt

If you are building agent infrastructure:

Design the verification layer before designing the prompt — what does "done" look like in machine-readable terms?
Treat AI-as-judge as a last resort, not the default check
Build your logs around loop iteration count, not just input/output pairs

Introduction to Loop Engineering (Udemy course) — beginner video: /loop, guardrails, and loop types
Anthropic overtakes OpenAI in U.S. business adoption (Ramp AI Index) — why enterprises paying for Claude Code drove the May 2026 crossover
Andrew Ng's Three Loops for 0-to-1 Products — agentic coding, developer feedback, external feedback (June 30, 2026)
Claude Code Loops Official Guide (July 2026)
What Is Loop Engineering? The New Paradigm Beyond Prompt Engineering
Loop Engineering: Coding Agent Loops That Run While You Sleep (Full Guide)
Anthropic Engineer: Stop Prompting, Build Loops (Harness Engineering)
Fable 5 Loop Design: Self-Correction and Memory Patterns
Agent Harness Engineering: Seven Planes

Loop Engineering Is Now the Most-Discussed AI Skill on Developer Twitter

From Niche Technique to Twitter Trend

The Core Idea in One Paragraph

Who Is Driving the Conversation

Peter Steinberger (OpenAI)

Boris Cherny (Anthropic, Claude Code)

0xMarioNawfal and Mike (@mikenevermiss)

The Honest Account: Dan Bochman's 13-Hour Loop

The Backlash: Matt Pocock's Warning

What Makes a Loop Work

Why Now

The Skill Stack in 2026

What This Means for Your Workflow

Loop Engineering Is Now the Most-Discussed AI Skill on Developer Twitter

From Niche Technique to Twitter Trend

The Core Idea in One Paragraph

Who Is Driving the Conversation

Peter Steinberger (OpenAI)

Boris Cherny (Anthropic, Claude Code)

0xMarioNawfal and Mike (@mikenevermiss)

The Honest Account: Dan Bochman's 13-Hour Loop

The Backlash: Matt Pocock's Warning

What Makes a Loop Work

Why Now

The Skill Stack in 2026

What This Means for Your Workflow

Related posts

Claude Code Loops Official Guide: Turn-Based, /goal, /loop, and /schedule (July 2026)

Context vs Prompt vs Loop vs Harness Engineering: The Four-Layer Agent Stack

Matthew Berman Loop Library: Free Agent Workflows for Developers (2026)

Related posts

Claude Code Loops Official Guide: Turn-Based, /goal, /loop, and /schedule (July 2026)

Context vs Prompt vs Loop vs Harness Engineering: The Four-Layer Agent Stack

Matthew Berman Loop Library: Free Agent Workflows for Developers (2026)

From Niche Technique to Twitter Trend

The Core Idea in One Paragraph

Who Is Driving the Conversation

Peter Steinberger (OpenAI)

Boris Cherny (Anthropic, Claude Code)

0xMarioNawfal and Mike (@mikenevermiss)

The Honest Account: Dan Bochman's 13-Hour Loop

The Backlash: Matt Pocock's Warning

What Makes a Loop Work

Why Now

The Skill Stack in 2026

What This Means for Your Workflow

Related Reading

From Niche Technique to Twitter Trend

The Core Idea in One Paragraph

Who Is Driving the Conversation

Peter Steinberger (OpenAI)

Boris Cherny (Anthropic, Claude Code)

0xMarioNawfal and Mike (@mikenevermiss)

The Honest Account: Dan Bochman's 13-Hour Loop

The Backlash: Matt Pocock's Warning

What Makes a Loop Work

Why Now

The Skill Stack in 2026

What This Means for Your Workflow

Related Reading

Related posts

Claude Code Loops Official Guide: Turn-Based, /goal, /loop, and /schedule (July 2026)

Context vs Prompt vs Loop vs Harness Engineering: The Four-Layer Agent Stack

Matthew Berman Loop Library: Free Agent Workflows for Developers (2026)

Related posts

Claude Code Loops Official Guide: Turn-Based, /goal, /loop, and /schedule (July 2026)

Context vs Prompt vs Loop vs Harness Engineering: The Four-Layer Agent Stack

Matthew Berman Loop Library: Free Agent Workflows for Developers (2026)