Evidence Trail

The Next CLI UX Battle Is Agent Forensics

March 24, 2026 / Daily Edition / 3 source signals.

repo openai/codex main
3 source signals 2 repos e07eaff
> e07eaff / March 24, 2026 / Daily Edition
Read Story Open Edition

Reporter Notes

Daily Article Notes — 2026-03-24

Chosen angle

**The next CLI UX battle is agent forensics**

Thesis: both Codex and Gemini CLI are adding the plumbing needed to reconstruct agent behavior after the fact. Codex is measuring turns as debuggable cost/action units; Gemini is exposing how context was discovered and attaching trace identifiers to edit offers.

Candidate angles considered

1. **Agent observability moves inside the runtime**

  • Strong evidence, but a bit generic.

2. **Context becomes measurable**

  • Good, but too close to the 2026-03-14 context-sensor piece.

3. **The next UX battle is agent forensics** ✅

  • Freshest frame, best reader payoff, least overlap.

Why this is different from yesterday

Yesterday was about execution context becoming first-class (forks vs threads, explicit loop context).

Today is about **post-hoc reconstruction**: cost, tool activity, discovered context, and traceable edit offers.

Grounding evidence

Codex

  • Repo: openai/codex
  • GitHub repo metadata via gh repo view
  • pushedAt: 2026-03-24T05:00:35Z
  • stars: 67170
  • Commit e07eaff0d32cfeat: add metric for per-turn tool count and add tmp_mem flag (#13456)
  • emits codex.turn.tool.call
  • tags metrics with tmp_mem_enabled
  • Commit 49634b7f9c07add metric for per-turn token usage (#13454)
  • snapshots total token usage at turn start
  • diffs totals at turn end to emit per-turn histograms
  • Code evidence from codex-rs/core/src/tasks/mod.rs
  • lines ~243-301 emit:
  • codex.turn.tool.call
  • codex.turn.token_usage with token_type=total/input/cached_input/output/reasoning_output
  • all tagged with tmp_mem_enabled

Gemini CLI

  • Repo: google-gemini/gemini-cli
  • GitHub repo metadata via gh repo view
  • pushedAt: 2026-03-24T04:59:00Z
  • stars: 98849
  • Commit 7b4a822b0ebafeat(core): instrument file system tools for JIT context discovery (#22082)
  • adds discoverJitContext()
  • appends --- Newly Discovered Project Context --- to tool outputs
  • wired into file tools like read-file
  • Commit d7d53981f3c1feat(core): add trajectoryId to ConversationOffered telemetry (#22214)
  • recordConversationOffered(... trajectoryId)
  • createConversationOffered(... trajectoryId)
  • includes trajectoryId in returned telemetry payload
  • Code evidence from:
  • packages/core/src/tools/jit-context.ts
  • packages/core/src/tools/read-file.ts
  • packages/core/src/code_assist/telemetry.ts

gsio evidence

Query: tmp_mem tool count token usage

Top codex hits included:

  • e07eaff0d32c — per-turn tool count + tmp_mem flag
  • 49634b7f9c07 — per-turn token usage metrics

Query: JIT context trajectory telemetry parallel tools

Top gemini-cli hits included:

  • 7b4a822b0eba — shared JIT context discovery utility
  • d7d53981f3c1 — trajectoryId on ConversationOffered telemetry
  • additional tracing / context / memory commits clustered nearby

Web data used

  • GitHub commit pages fetched for:
  • openai/codex@e07eaff0d32c...
  • google-gemini/gemini-cli@7b4a822b0eba...
  • google-gemini/gemini-cli@d7d53981f3c1...
  • GitHub repo pages / API metadata via gh repo view

llm review summary (gpt-5.4)

Winner: **The next UX battle is agent forensics**

Reason: freshest framing; avoids overlap with context-sensor and execution-context pieces by centering on after-action explainability.

Sources — 2026-03-24

Repo metadata

  • gh repo view openai/codex --json name,description,url,defaultBranchRef,stargazerCount,pushedAt
  • gh repo view google-gemini/gemini-cli --json name,description,url,defaultBranchRef,stargazerCount,pushedAt

Git history and code

  • git -C projects/agentic-workflows-gemini-vs-codex/repos/codex log --oneline -n 20
  • git -C projects/agentic-workflows-gemini-vs-codex/repos/gemini-cli log --oneline -n 30
  • git -C .../codex show --stat --summary e07eaff0d32caaec47cc4b0616d1b32c086e904f
  • git -C .../codex show --stat --summary 49634b7f9c0702598bbd5a233ddcb8bad5302e79
  • git -C .../gemini-cli show --stat --summary 7b4a822b0ebaa44f6fe12dd1acc6f956d39cfc1e
  • git -C .../gemini-cli show --stat --summary d7d53981f3c12e5d00963cb9a276732857ec53e8
  • sed -n '1,220p' .../gemini-cli/packages/core/src/tools/jit-context.ts
  • sed -n '1,220p' .../codex/codex-rs/core/src/tasks/mod.rs
  • sed -n '1,220p' .../gemini-cli/packages/core/src/code_assist/telemetry.ts
  • targeted line extraction for read-file.ts, telemetry.ts, tasks/mod.rs

gsio

  • gsio search q 'tmp_mem tool count token usage' -p openai/codex --output summary
  • gsio search q 'JIT context trajectory telemetry parallel tools' -p google-gemini/gemini-cli --output summary

Web fetches

Model review

  • llm -m gpt-5.4 with candidate angles A/B/C and overlap constraints from past articles