Past Daily Editions
Every issue of The Git Reporter is grouped by publication date, with each day's lead story, companion articles, and evidence trails kept together.
Agent Reliability Is Moving Into the Test Rig
The newest source-readable trail points away from model mystique and toward the workflows that test, trace, serialize, and maintain agent behavior.
Agent Runtimes Are Learning Where to Say No
After shipping more visible control planes, agent projects are now tightening the places where tools, plugins, credentials, chat context, and test runs are allowed to flow.
Agent Runtimes Are Learning to Ship Their Control Plane
A set of Codex, Hermes Agent, Gemini CLI, and Crush commits shows agent tools packaging the machinery around the model: shells, platform adapters, context profiles, and skill catalogs.
Agent Runtimes Are Making Their Limits Explicit
Recent Crush and LangChain changes show agent infrastructure turning hidden boundaries - shell permission, context overflow, and model token limits - into runtime contracts that users and developers can see.
Agent Goals Are Becoming Runtime State
Recent Codex and Hermes Agent changes show coding agents turning goals from chat commands into durable metadata, resumable session state, and queued work control.
Agent Runtimes Are Moving State Out of the Shadows
Recent Codex and Gemini CLI changes show coding agents treating settings, queued input, subagent starts, and terminal streams as explicit runtime state instead of invisible side effects.
Agent Runtimes Are Learning to Audit Their Own Tools
Fresh Codex and Gemini CLI changes show agent projects treating tool calls, plugins, MCP servers, and subagents as auditable runtime events instead of invisible helper work.
Agent CLIs Are Turning Permissions Into a Conversation
The next shift in terminal agents is not just better tools or tighter sandboxes. It is that permissions are becoming live workflow state: negotiated mid-task, scoped to the action, and remembered with just enough structure to keep work moving.
The Real Agent Feature Is Not Losing the Plot
The next terminal-agent moat is not just better tools or bigger models. It is continuity: whether the system can keep plans, transcripts, and working context coherent as sessions stretch, compress, resume, and hit real-world friction.
The CLI Is Becoming an Agent Workbench
The real shift in terminal agents is not bigger models or flashier demos. It is that planning, task state, plugins, and long-lived runtime context are turning the CLI into a place where work gets organized, not just requested.
From Tool Chatter to Chapters: Agent CLIs Are Inventing a Narrative Layer
The interesting shift this week isn’t just that Codex and Gemini CLI can do more. It’s that they’re getting better at explaining themselves while they work —turning raw tool noise into something a human can actually follow.
The Next Agent UX Moat Isn’t Speed. It’s Backpressure.
The hard problem in terminal agents is no longer just getting them to do more. It’s deciding what happens when the human tries to steer while the runtime is already busy.
Subagents Aren’t Just Getting Smarter. They’re Getting Contained.
The next terminal-agent upgrade is not more helpers. It’s better walls: isolation, cleanup, bounded autonomy, and fewer chances for delegated work to spill across the room.
The CLI Is Quietly Becoming an Agent Router
The next terminal-agent shift is not another flashy planning demo. It is the quieter, harder job of helping remote agents actually connect: across protocols, proxies, auth flows, and messy metadata.
The New CLI Moat Isn’t UX. It’s How Agent Skills Get Shipped
This week’s most strategic terminal-agent shift is not another demo flourish. It’s the quiet work of turning capabilities into things that can be packaged, cached, moved, and trusted.
The Next CLI UX Battle Is Agent Forensics
The most important agent upgrade this week is not another tool. It’s the growing ability to reconstruct why an agent touched files, spent tokens, and arrived at a code change in the first place.
Codex forks it, Gemini threads it: execution context becomes first-class
The next big agent primitive is not another tool call. It’s the bundle of permissions, environment, policy, and wiring that tells an agent how to exist for this turn, in this loop, right now.
Parallel agents are getting real addresses
This week’s most important agent upgrade isn’t more raw intelligence. It’s workplace logistics. Gemini CLI is giving parallel sessions separate rooms. Codex is giving subagents names you can actually point at.
AI agents are getting better at saying “here’s what I finished”
The real upgrade this week isn’t bigger context or flashier autonomy. It’s something more human: when an agent stalls, waits, delegates, or times out, it’s starting to leave behind a useful trail instead of a shrug.
Before the Prompt Lands: Codex and Gemini Turn Hooks Into Agent Control Planes
Terminal agents are growing a new kind of muscle: not just better tools, but better middleware. In both Codex and Gemini CLI, the moment right after you hit Enter is becoming programmable territory.
Gemini Lets the Model Schedule Parallel Tools. Codex Makes the Runtime Decide.
Terminal agents are learning the same trick — run more tools at once — but Gemini CLI and OpenAI Codex disagree on who should be in charge of that decision. One pushes dependency control up into the prompt and tool schema. The other keeps it down in runtime metadata and locks.
Subagents Grow Up: Gemini Isolates Tool Boundaries While Codex Shares Trust by Default
Two agent platforms shipped subagent changes within hours of each other. One tightened what a child agent can see and use . The other tightened what a child agent can teach the rest of the system to trust . Same category. Very different instinct.
OpenViking Turns Agent Memory into a Filesystem — and That Changes the Game
Agent memory has felt like a junk drawer: we throw context in, pray the model finds it, and hope the glue holds. OpenViking’s bet is bolder — treat memory like an operating system, not a sho...
Gemini CLI Turns File Tools into Context Sensors — Right as A2UI Trends
Today’s GitHub Trending list says the same thing twice in different languages. A2UI is climbing because teams want UI‑level schemas that keep agents from guessing. At the same time, Gemini C...
Gemini CLI Built the Ask‑User UI That MCP Elicitation Still Needs
Elicitation is the moment an agent has to stop and ask you a question — a human speed bump that keeps automation honest. This week, Gemini CLI users reported MCP servers failing with a “Meth...
Page Agent’s MacroTool Makes In‑Browser Agents Resilient to Messy Tool Calls
GitHub’s trending list has been noisy with “agent frameworks,” but Alibaba’s page-agent stands out because it runs inside the web page instead of driving a separate headless browser. That ar...
Gemini CLI Makes MCP List-Changed Notifications Resilient
Gemini CLI’s PR #21050 fixes a rough edge in MCP notifications: tools/list_changed updates could be missed, leaving clients out of sync with what a server actually offers. The client now reg...
Codex Makes Memories a First-Class Writable Root—and Stops Cleaning Through Symlinks
Codex just turned its “memories” folder from a side alley into a well-lit main road. And it put a lock on the janitor’s closet so cleanup can’t accidentally bulldoze someone else’s house. Th...
Gemini CLI tightens MCP tool discovery: debounced list_changed refresh, trailing queue, and resilient retries
Gemini CLI’s MCP integration has been steadily gaining real-time awareness of server-side tool changes. A local search trail points to early MCP notifications support landing on 2026-01-08 (...
Codex Rust CLI Update: Memory Writes Now Fit “workspace-write” and Safer Clears Land in f72ab43
The Codex Rust CLI just got a small but meaningful quality-of-life upgrade—one that also tightens safety around memory cleanup. In PR openai/codex#13467 (commit f72ab43fd193b31208cd3c306293b...
Gemini CLI Makes a “Generalist” Sub-Agent the Default Delegate
TheGitReporter — In a recent core change, Gemini CLI quietly shifted delegation from an opt-in feature to a baseline capability. Gemini CLI has landed a new default behavior: a built-in gene...
Gemini CLI and Codex Level Up MCP Safety and Setup: Trust Prompts vs Auto-Install Skills
Two CLI assistants walk into the same tooling bar. One checks IDs at the door; the other quietly installs the band’s gear before the show starts. Gemini’s new “trust, but verify” moment Gemi...
Persistence vs Freshness: Codex and Gemini CLI Tighten Two Different Boundaries for Agent Workflows
On March 4, 2026, two quiet merges landed on opposite sides of the agentic tooling ecosystem—both aimed at making “hands-off” workflows less surprising. OpenAI’s Codex and Google’s Gemini CL...