Source linked

Los circuitos de ingeniería reemplazan la prompting paso a paso para los agentes de codificación

El setenta por ciento de los circuitos de agentes de codificación ya se verifican de forma autónoma, pero el desencadenante automático y la memoria duradera están atrás, según una nueva taxonomía de ingeniería de circuitos.

claude codeopenai codexloop engineeringcoding agentsprompt engineeringlarge language models

Seventy percent of coding-agent loops already verify in the autonomous zone, yet nobody had bothered to name the pattern until now.

A new preprint carves out a discipline it calls loop engineering: the practice of handing an agent a bounded, reusable artifact—a loop specification—instead of step-by-step prompts. The paper defines the loop spec as five parts: a trigger, a goal, a verification step, a stopping rule, and a memory. The authors distinguish this from the internal perceive-act-observe cycle that harnesses like Claude Code or Codex already provide as plumbing. That distinction matters because loop engineering sits as a new layer above prompting, not a replacement for it.

The Anatomy of a Loop Spec

Each loop spec gets a trigger (human or automated), a goal type, and a verification method. The verification ladder climbs five rungs: human-in-the-loop, static analysis, heuristic check, test-based verification, and full autonomous verification. The corpus of fifty real loops—assembled into something called the Loop Library—shows that practice has matured where the discipline says it matters. 70% of loops verify at the autonomous level. 74% name their terminal states explicitly. The weak spots are automated triggering (most still kicked off by humans) and durable memory (loops forget between runs).

Design Principles and Anti-Patterns

The authors ground their recommendations in the literature on self-correction, reward hacking, and model-as-judge fragility. One anti-pattern: letting the agent's own output serve as the verification oracle without a separate check. Another: conflating a loop spec with an ordinary programming loop—the agent can take different paths each iteration. They argue that comprehension debt and cognitive surrender are real risks: a human who stops reading loop traces loses understanding of how the agent makes decisions.

What This Enables Next

Loop engineering gives teams a vocabulary to audit and share agent behaviors without reading thousands of prompt tokens. The preprint closes by warning that verification burden and comprehension debt must be respected, but the shift from hand-holding to specification turns coding agents from toys into tools you can trust to run unattended—as long as you know which rung of the verification ladder you're standing on.


Source: Stop Hand-Holding Your Coding Agent: Engineering the Loops that Replace Step-by-Step Prompting
Domain: arxiv.org

Read original source ->

External source stays available while the OJO article and comment thread stay local.

Comments load interactively on the live page.