Source linked

Cordon Lets LLM Agents Roll Back Irreversible Tool Effects

arxiv.org@systems_wire3 hours ago·Artificial Intelligence·2 comments

A new runtime system called Cordon treats LLM agent tool calls as semantic transactions, enabling commit, rollback, and audit of multi-step workflows while catching cross-step safety violations that current per-call...

cordonllm agentstool usesemantic transactionstransactional runtimelarge language models

Cordon exposes cross-step violations that existing per-call guardrails miss, according to an evaluation across adversarial and benign workflows. That finding comes from a new transactional runtime for tool-using LLM agents.

Today's agent runtimes expose tools as isolated RPCs. Each call gets its own guardrails, but the system lacks a task-scoped execution boundary for commit, rollback, recovery, and audit across multi-step workflows. The Cordon authors argue this mismatch calls for a runtime containment boundary, not another per-call filter.

The Semantic Transaction Abstraction

Cordon introduces a transaction manager that tracks derived result objects, executes reversible mutations in shadow state, stages outward-facing actions in an effect outbox, and records recovery metadata. A semantic transaction binds tool intents and runtime-tracked result lineage to reversible local state, staged external effects, delegated authority, and audit metadata. The runtime validates the composed execution flow before it commits state or releases external effects.

Cross-Step Violations Caught in Evaluation

The paper evaluates Cordon against both adversarial and benign workflows. It exposes cross-step violations that existing defenses miss entirely. Cordon also reduces irreversible-effect failures while preserving benign task completion, all with modest approval and latency overhead.

Cordon suggests a concrete path toward runtime containment boundaries for LLM agents, making multi-step tool use safer without sacrificing task completion.

Source: Cordon: Semantic Transactions for Tool-Using LLM Agents
Domain: arxiv.org

Read original source ->

External source stays available while the OJO article and comment thread stay local.

More in Artificial Intelligence

view topic

Dr-DCI Hits 73.3% Accuracy on Browsecomp-Plus by Dynamically Expanding Search Workspace

By treating retrieval as an agent action to pull documents into a local workspace, Dr-DCI avoids the instability of full-corpus shell operations while scaling from 100K to 10M documents.

When Models Disagree, Route to a Different Model: Video QA Gains 1.81 Points

Single-model self-consistency fails on hard implicit video questions; routing the 20% where samples diverge to a second model boosts accuracy by 1.43-1.81 points, with motion and counting categories gaining 5+ points.

RAMS Dynamically Switches YOLOv8 Tiers to Cut Latency 5.6x on Embedded Edge

RAMS drops inference latency from ~19 ms to 3.41 ms on Jetson Orin TensorRT under heavy load, retaining 74% of proxy accuracy by locking higher-tier models during vulnerable road user detections.

PhoneHarness Benchmark Forces Phone Agents Beyond Tap-and-Swipe GUI Control

PhoneHarness reaches 75% pass rate on verifiable mobile workflows, beating non-mixed settings by 12.9 points by routing agents across GUI, CLI, and tool actions.

Comments load interactively on the live page.