Agent Runtimes Are Learning Where to Say No

The next useful agent feature may be a well-placed refusal.

For the last several editions, the code trail around agent tools has pointed toward a visible control plane: packaged shells, platform adapters, context profiles, skill catalogs, and tool lifecycle hooks. The newest readable commits move that story one step deeper. They show projects adding boundaries around the control plane itself.

That does not mean these systems are suddenly secure. The stronger, more source-grounded claim is narrower: agent runtimes are learning to encode where risky data should stop, be rewritten, be isolated, or be traced.

Codex puts ordinary tools behind hooks

OpenAI's Codex commit 5c20513a1 makes local function tools participate in the default hook contract. The commit message says earlier PreToolUse, PostToolUse, and updated-input coverage depended on each handler remembering to wire itself into the contract.

The patch moves that default behavior into CoreToolRuntime. Function payloads can now expose generic PreToolUse and PostToolUse payloads, and updated hook input can be serialized back into function-tool arguments. The tests cover local function calls being blocked before execution, rewritten before execution, and observed after execution.

The exceptions matter as much as the default. Codex explicitly keeps code-mode wait and write_stdin out of parts of the generic path because those calls are runtime control for already-running work. In other words, the hook system is not just becoming broader. It is also learning which internal control messages should not look like ordinary model actions.

Plugin bundles get a shared checkpoint

A second Codex commit, 7d47056e, moves plugin tarball packing and unpacking into a shared archive helper. The headline sounds like plumbing, but the checks are the story.

The new helper verifies that a bundle has the expected plugin manifest, enforces archive and extracted-size limits, rejects path traversal, rejects hard links and symbolic links, and rejects unsupported archive entry types. Upload and install paths now use the same archive logic instead of carrying duplicate handling.

That is a control-plane boundary in file form. A plugin system is only useful if bundles can move across machines. The archive helper is where Codex decides what kind of bundle is allowed to cross that line.

Hermes names the limit of its own denial

Hermes Agent's 97e975ed commit widens direct read denials around credential material. The denied set expands beyond existing auth files to include .env, webhook subscription secrets, and files under mcp-tokens/, with profile-mode coverage for both the active home and the global root.

The most important line is the caveat. The code comments say the read-deny is not a security boundary because the terminal tool still runs as the same operating-system user. The stated purpose is defense-in-depth: models that respect tool denials may stop, and attempted credential reads become visible in logs.

That is unusually honest source evidence. Hermes is not pretending a read-file guard defeats every bypass. It is making a smaller boundary visible enough that users and maintainers can see when something tries to cross it.

Observed chat stops being a pending request

Hermes shows another version of the same pattern in 4a91e364, which separates observed Telegram group context from the current addressed message.

The patch withholds observed group chatter from normal replay history and wraps it as a context-only block before the current message. Tests assert that the current addressed message remains the turn to answer, while earlier observed group lines are available only as context when the new message asks for them.

For a group-chat agent, that distinction is not cosmetic. Without it, old unaddressed chatter can replay like pending user work. The fix turns "what the agent saw" and "what the agent was asked" into separate runtime categories.

LangChain tightens the test lane

LangChain's freshest evidence is not an agent UI change. It is operational trust around live provider tests. Commit 33875fde adds job-level concurrency to scheduled integration tests so overlapping per-package shards do not hit the same live API credentials at once.

Another commit, bdd7f71a, adds trace metadata for those scheduled tests: workflow run, attempt, SHA, event, ref, working directory, and Python version. A failing trace can point back to the run and shard that produced it.

This is not the same surface as Codex hooks or Hermes file safety. But it belongs to the same practical family: when software touches external model providers and shared credentials, the runtime lane needs boundaries and provenance, not just a green or red CI badge.

The pattern is refusal with receipts

Taken together, these commits do not prove a shared standard. They do show a shared pressure. Agent software is adding places where work is stopped before execution, archives are checked before installation, credential reads are denied with an audit trail, observed context is prevented from masquerading as a user request, and credential-backed tests are serialized and traced.

That is what maturity looks like at the source level. The model still gets the visible conversation, but the agent runtime increasingly carries the responsibility for saying: this input is not safe to treat as a command, this bundle is not safe to unpack, this file is not appropriate to read, this context is not a request, and this test run should not collide with another one.

Tomorrow's question is whether these boundaries keep spreading from tests and helpers into product defaults. The agents that earn trust will not be the ones that say yes to everything. They will be the ones whose control planes know when to say no, and can show their work when they do.

Agent Runtimes Are Learning Where to Say No

Codex puts ordinary tools behind hooks

Plugin bundles get a shared checkpoint

Hermes names the limit of its own denial

Observed chat stops being a pending request

LangChain tightens the test lane

The pattern is refusal with receipts

Receipts below the story

Primary Evidence

Evidence Limits

Send a note to the desk

Agent Runtimes Are Learning Where to Say No

Codex puts ordinary tools behind hooks

Plugin bundles get a shared checkpoint

Hermes names the limit of its own denial

Observed chat stops being a pending request

LangChain tightens the test lane

The pattern is refusal with receipts

Receipts below the story

Primary Evidence

Evidence Limits

Atlas Context

Loops, Workflows, and Recovery

Send a note to the desk