codex

mirror of https://github.com/openai/codex.git synced 2026-05-01 03:42:05 +03:00

Author	SHA1	Message	Date
Charles Cunningham	8fda4e0fc2	tui: add slash command help page Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	ad586ba24c	tui: share slash command shlex codec Centralize slash command draft serialization on top of shlex so popup flows and queued replay use the same quoting and tokenization rules. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	7ea03de12b	tui: resume queued replay after idle slash actions Pause queued replay on local slash actions that need app-side updates or popup dismissal, then resume once the app event queue and bottom pane are idle. Add regression coverage for queued theme replay and popup-gated resume. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	bcb3555e70	tui: reject repeated model modifiers Track whether /model effort and scope modifiers were provided so repeated sentinel values like default and global are rejected consistently. Validation: cargo test -p codex-tui; just fix -p codex-tui; just fmt Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	8c45c1acfc	tui: canonicalize interactive slash drafts Route interactive slash-command pickers through canonical serialized drafts so live dispatch and queued replay share one parser/executor path. Validation: cargo test -p codex-tui; just fix -p codex-tui; just fmt Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	217da0c113	tui: make slash command helpers exhaustive Switch supports_inline_args() and requires_interaction() to exhaustive matches so newly added SlashCommand variants must be classified explicitly. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	bb3e61843a	tui: clarify queue replay stop comment Explain that the busy-path Stop return in dispatch_command() only matters for live dispatch because queued replay never reaches that branch. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	173d0ef4e3	tui: resolve interactive slash commands before queueing Mark popup-opening slash commands as requiring interaction, open them immediately instead of queueing bare drafts while a task is running, and make queued replay restore any legacy bare interactive command draft instead of opening UI mid-drain. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	8267155494	tui: document slash dispatch replay contract Explain above dispatch_command that live callers usually ignore its return value while queued replay uses QueueReplayControl to decide whether draining can continue. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	393740dbed	tui: clarify live slash queueing branch Document that the busy-path queueing in dispatch_command is only reached during live slash dispatch, not queued replay, so the returned drain-control value is effectively a throwaway for that branch. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:50 -07:00
Charles Cunningham	ed546bc217	tui: preserve queued slash draft intent Serialize custom /review instructions with an explicit custom marker so queued replay cannot reinterpret branch-like text as a structured review target, and rename queued replay to better reflect that it drains inputs until blocked. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	122e1475d5	tui: simplify interrupt status plumbing Send interrupt ops directly from the bottom pane instead of routing them through the status indicator widget, and remove the now-unused event wiring from the widget and its tests. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	fa50564579	tui: restore interactive slash queue behavior Keep bare /model and /review interactive while preserving serialized queue replay, restore queued slash drafts into the composer on interrupt, and align queued slash parsing with the same feature-gated lookup used by the composer. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	35e8aa9ce0	Simplify queued slash command replay Unify queued slash commands as serialized drafts, route popup actions through the same replay path, and stop replay after commands that submit a turn. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	9ab49d2c37	narrow esc interrupts in tui Limit Esc interrupts to an empty composer when no popup is active, while preserving the pending-steer interrupt path. This keeps dialog dismissal on Esc working normally without interrupting the running conversation. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	e5f1b8435d	queue interactive slash command selections Keep interactive slash flows usable while a turn is running, and queue the resulting action instead of the bare slash token. Also let Esc interrupt through popup-active states so queued drafts restore without dropping queued slash actions. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	a8f1f43c1f	preserve queued slash actions on interrupt Restore only queued user-message drafts into the composer when a turn is interrupted, keep queued slash-command actions replayable, and cover the interrupt replay behavior in codex-tui tests. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	072d5d9e49	remove dead tui helper methods Drop the unused bottom-pane mention-binding drain helpers introduced by the queueing refactor and switch the affected tests to the existing non-destructive composer mention accessor. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Charles Cunningham	9478b34e55	queue slash commands in tui Allow slash commands entered during a running turn to be queued and replayed after the turn completes, including /review and inline slash-command variants tested in codex-tui. Co-authored-by: Codex <noreply@openai.com>	2026-03-15 23:32:49 -07:00
Matthew Zeng	029aab5563	fix(core): preserve tool_params for elicitations (#14769 ) - [x] Preserve tool_params keys.	2026-03-15 23:15:52 -07:00
Charley Cunningham	6fdeb1d602	Reuse guardian session across approvals (#14668 ) ## Summary - reuse a guardian subagent session across approvals so reviews keep a stable prompt cache key and avoid one-shot startup overhead - clear the guardian child history before each review so prior guardian decisions do not leak into later approvals - include the `smart_approvals` -> `guardian_approval` feature flag rename in the same PR to minimize release latency on a very tight timeline - add regression coverage for prompt-cache-key reuse without prior-review prompt bleed ## Request - Bug/enhancement request: internal guardian prompt-cache and latency improvement request --------- Co-authored-by: Codex <noreply@openai.com>	2026-03-15 22:56:18 -07:00
friel-openai	ba463a9dc7	Preserve background terminals on interrupt and rename cleanup command to /stop (#14602 ) ### Motivation - Interrupting a running turn (Ctrl+C / Esc) currently also terminates long‑running background shells, which is surprising for workflows like local dev servers or file watchers. - The existing cleanup command name was confusing; callers expect an explicit command to stop background terminals rather than a UI clear action. - Make background‑shell termination explicit and surface a clearer command name while preserving backward compatibility. ### Description - Renamed the background‑terminal cleanup slash command from `Clean` (`/clean`) to `Stop` (`/stop`) and kept `clean` as an alias in the command parsing/visibility layer, updated the user descriptions and command popup wiring accordingly. - Updated the unified‑exec footer text and snapshots to point to `/stop` (and trimmed corresponding snapshot output to match the new label). - Changed interrupt behavior so `Op::Interrupt` (Ctrl+C / Esc interrupt) no longer closes or clears tracked unified exec / background terminal processes in the TUI or core cleanup path; background shells are now preserved after an interrupt. - Updated protocol/docs to clarify that `turn/interrupt` (or `Op::Interrupt`) interrupts the active turn but does not terminate background terminals, and that `thread/backgroundTerminals/clean` is the explicit API to stop those shells. - Updated unit/integration tests and insta snapshots in the TUI and core unified‑exec suites to reflect the new semantics and command name. ### Testing - Ran formatting with `just fmt` in `codex-rs` (succeeded). - Ran `cargo test -p codex-protocol` (succeeded). - Attempted `cargo test -p codex-tui` but the build could not complete in this environment due to a native build dependency that requires `libcap` development headers (the `codex-linux-sandbox` vendored build step); install `libcap-dev` / make `libcap.pc` available in `PKG_CONFIG_PATH` to run the TUI test suite locally. - Updated and accepted the affected `insta` snapshots for the TUI changes so visual diffs reflect the new `/stop` wording and preserved interrupt behavior. ------ [Codex Task](https://chatgpt.com/codex/tasks/task_i_69b39c44b6dc8323bd133ae206310fae)	2026-03-15 22:17:25 -07:00
Matthew Zeng	d4af6053e2	[apps] Improve search tool fallback. (#14732 ) - [x] Bypass tool search and stuff tool specs directly into model context when either a. Tool search is not available for the model or b. There are not that many tools to search for.	2026-03-15 21:41:55 -07:00
Matthew Zeng	49edf311ac	[apps] Add tool call meta. (#14647 ) - [x] Add resource_uri and other things to _meta to shortcut resource lookup and speed things up.	2026-03-14 22:24:13 -07:00
Colin Young	d692b74007	Add auth 401 observability to client bug reports (#14611 ) CXC-392 [With 401](https://openai.sentry.io/issues/7333870443/?project=4510195390611458&query=019ce8f8-560c-7f10-a00a-c59553740674&referrer=issue-stream) <img width="1909" height="555" alt="401 auth tags in Sentry" src="https://github.com/user-attachments/assets/412ea950-61c4-4780-9697-15c270971ee3" /> - auth_401_: preserved facts from the latest unauthorized response snapshot - auth_: latest auth-related facts from the latest request attempt - auth_recovery_: unauthorized recovery state and follow-up result Without 401 <img width="1917" height="522" alt="happy-path auth tags in Sentry" src="https://github.com/user-attachments/assets/3381ed28-8022-43b0-b6c0-623a630e679f" /> ###### Summary - Add client-visible 401 diagnostics for auth attachment, upstream auth classification, and 401 request id / cf-ray correlation. - Record unauthorized recovery mode, phase, outcome, and retry/follow-up status without changing auth behavior. - Surface the highest-signal auth and recovery fields on uploaded client bug reports so they are usable in Sentry. - Preserve original unauthorized evidence under `auth_401_` while keeping follow-up result tags separate. ###### Rationale (from spec findings) - The dominant bucket needed proof of whether the client attached auth before send or upstream still classified the request as missing auth. - Client uploads needed to show whether unauthorized recovery ran and what the client tried next. - Request id and cf-ray needed to be preserved on the unauthorized response so server-side correlation is immediate. - The bug-report path needed the same auth evidence as the request telemetry path, otherwise the observability would not be operationally useful. ###### Scope - Add auth 401 and unauthorized-recovery observability in `codex-rs/core`, `codex-rs/codex-api`, and `codex-rs/otel`, including feedback-tag surfacing. - Keep auth semantics, refresh behavior, retry behavior, endpoint classification, and geo-denial follow-up work out of this PR. ###### Trade-offs - This exports only safe auth evidence: header presence/name, upstream auth classification, request ids, and recovery state. It does not export token values or raw upstream bodies. - This keeps websocket connection reuse as a transport clue because it can help distinguish stale reused sessions from fresh reconnects. - Misroute/base-url classification and geo-denial are intentionally deferred to a separate follow-up PR so this review stays focused on the dominant auth 401 bucket. ###### Client follow-up - PR 2 will add misroute/provider and geo-denial observability plus the matching feedback-tag surfacing. - A separate host/app-server PR should log auth-decision inputs so pre-send host auth state can be correlated with client request evidence. - `device_id` remains intentionally separate until there is a safe existing source on the feedback upload path. ###### Testing - `cargo test -p codex-core refresh_available_models_sorts_by_priority` - `cargo test -p codex-core emit_feedback_request_tags_` - `cargo test -p codex-core emit_feedback_auth_recovery_tags_` - `cargo test -p codex-core auth_request_telemetry_context_tracks_attached_auth_and_retry_phase` - `cargo test -p codex-core extract_response_debug_context_decodes_identity_headers` - `cargo test -p codex-core identity_auth_details` - `cargo test -p codex-core telemetry_error_messages_preserve_non_http_details` - `cargo test -p codex-core --all-features --no-run` - `cargo test -p codex-otel otel_export_routing_policy_routes_api_request_auth_observability` - `cargo test -p codex-otel otel_export_routing_policy_routes_websocket_connect_auth_observability` - `cargo test -p codex-otel otel_export_routing_policy_routes_websocket_request_transport_observability`	2026-03-14 15:38:51 -07:00
viyatb-oai	9060dc7557	fix: fix symlinked writable roots in sandbox policies (#14674 ) ## Summary - normalize effective readable, writable, and unreadable sandbox roots after resolving special paths so symlinked roots use canonical runtime paths - add a protocol regression test for a symlinked writable root with a denied child and update protocol expectations to canonicalized effective paths - update macOS seatbelt tests to assert against effective normalized roots produced by the shared policy helpers ## Testing - just fmt - cargo test -p codex-protocol - cargo test -p codex-core explicit_unreadable_paths_are_excluded_ - cargo clippy -p codex-protocol -p codex-core --tests -- -D warnings ## Notes - This is intended to fix the symlinked TMPDIR bind failure in bubblewrap described in #14672. Fixes #14672	2026-03-14 13:24:43 -07:00
Michael Bolin	4b31848f5b	Add argument-comment Dylint runner (#14651 )	2026-03-14 08:18:04 -07:00
Channing Conger	70eddad6b0	dynamic tool calls: add param `exposeToContext` to optionally hide tool (#14501 ) This extends dynamic_tool_calls to allow us to hide a tool from the model context but still use it as part of the general tool calling runtime (for ex from js_repl/code_mode)	2026-03-14 01:58:43 -07:00
sayan-oai	e389091042	make defaultPrompt an array, keep backcompat (#14649 ) make plugins' `defaultPrompt` an array, but keep backcompat for strings. the array is limited by app-server to 3 entries of up to 128 chars (drops extra entries, `None`s-out ones that are too long) without erroring if those invariants are violating. added tests, tested locally.	2026-03-14 06:13:51 +00:00
sayan-oai	8ca358a13c	Refresh Python SDK generated types (#14646 ) ## Summary - regenerate `sdk/python` protocol-derived artifacts on latest `origin/main` - update `notification_registry.py` to match the regenerated notification set - fix the stale SDK test expectation for `GranularAskForApproval` ## Validation - `cd sdk/python && python scripts/update_sdk_artifacts.py generate-types` - `cd sdk/python && python -m pytest`	2026-03-14 05:50:33 +00:00
Eric Traut	ae0a6510e1	Enforce errors on overriding built-in model providers (#12024 ) We receive bug reports from users who attempt to override one of the three built-in model providers (openai, ollama, or lmstuio). Currently, these overrides are silently ignored. This PR makes it an error to override them. ## Summary - add validation for `model_providers` so `openai`, `ollama`, and `lmstudio` keys now produce clear configuration errors instead of being silently ignored	2026-03-13 22:10:13 -06:00
sayan-oai	d272f45058	move plugin/skill instructions into dev msg and reorder (#14609 ) Move the general `Apps`, `Skills` and `Plugins` instructions blocks out of `user_instructions` and into the developer message, with new `Apps -> Skills -> Plugins` order for better clarity. Also wrap those sections in stable XML-style instruction tags (like other sections) and update prompt-layout tests/snapshots. This makes the tests less brittle in snapshot output (we can parse the sections), and it consolidates the capability instructions in one place. #### Tests Updated snapshots, added tests. `<AGENTS_MD>` disappearing in snapshots is expected: before this change, the wrapped user-instructions message was kept alive by `Skills` content. Now that `Skills` and `Plugins` are in the developer message, that wrapper only appears when there is real project-doc/user-instructions content. --------- Co-authored-by: Charley Cunningham <ccunningham@openai.com>	2026-03-13 20:51:01 -07:00
viyatb-oai	7f571396c8	fix: sync split sandbox policies for spawned subagents (#14650 ) ## Summary - reapply the live split filesystem and network sandbox policies when building spawned subagent configs - keep spawned child sessions aligned with the parent turn after role-layer config reloads - add regression coverage for both config construction and spawned child-turn inheritance	2026-03-14 03:03:49 +00:00
viyatb-oai	6dc04df5e6	fix: persist future network host approvals across sessions (#14619 ) ## Summary - apply persisted execpolicy network rules when booting the managed network proxy - pass the current execpolicy into managed proxy startup so host approvals selected with "allow this host in the future" survive new sessions	2026-03-14 02:46:10 +00:00
Charley Cunningham	bbd329a812	Fix turn context reconstruction after backtracking (#14616 ) ## Summary - reuse rollout reconstruction when applying a backtrack rollback so `reference_context_item` is restored from persisted rollout state - build rollback replay from the flushed rollout items plus the rollback marker, avoiding the extra reread/fallback path - add regression coverage for rollback after compaction so turn-context diffing stays aligned after backtracking Co-authored-by: Codex <noreply@openai.com>	2026-03-13 19:28:31 -07:00
Ahmed Ibrahim	69c8a1ef9e	Fix Windows CI assertions for guardian and Smart Approvals (#14645 ) - Normalize guardian assessment path serialization to use forward slashes for cross-platform stability. - Seed workspace-write defaults in the Smart Approvals override-turn-context test so Windows and non-Windows selection flows are consistent. --------- Co-authored-by: Codex <noreply@openai.com> Co-authored-by: Charles Cunningham <ccunningham@openai.com>	2026-03-14 02:15:58 +00:00
Eric Traut	4b9d5c8c1b	Add openai_base_url config override for built-in provider (#12031 ) We regularly get bug reports from users who mistakenly have the `OPENAI_BASE_URL` environment variable set. This PR deprecates this environment variable in favor of a top-level config key `openai_base_url` that is used for the same purpose. By making it a config key, it will be more visible to users. It will also participate in all of the infrastructure we've added for layered and managed configs. Summary - introduce the `openai_base_url` top-level config key, update schema/tests, and route the built-in openai provider through it while - fall back to deprecated `OPENAI_BASE_URL` env var but warn user of deprecation when no `openai_base_url` config key is present - update CLI, SDK, and TUI code to prefer the new config path (with a deprecated env-var fallback) and document the SDK behavior change	2026-03-13 20:12:25 -06:00
Michael Bolin	b859a98e0f	refactor: make unified-exec zsh-fork state explicit (#14633 ) ## Why The unified-exec path was carrying zsh-fork state in a partially flattened way. First, the decision about whether zsh-fork was active came from feature selection in `ToolsConfig`, while the real prerequisites lived in session state. That left the handler and runtime defending against partially configured cases later. Second, once zsh-fork was active, its two runtime-only paths were threaded through the runtime as separate arguments even though they form one coherent piece of configuration. This change keeps unified-exec on a single session-derived source of truth and bundles the zsh-fork-specific paths into a named config type so the runtime can pass them around as one unit. In particular, this PR introduces this enum so the `ZshFork` variant can carry the appropriate state with it: ```rust #[derive(Debug, Clone, Eq, PartialEq)] pub enum UnifiedExecShellMode { Direct, ZshFork(ZshForkConfig), } #[derive(Debug, Clone, Eq, PartialEq)] pub struct ZshForkConfig { pub(crate) shell_zsh_path: AbsolutePathBuf, pub(crate) main_execve_wrapper_exe: AbsolutePathBuf, } ``` This cleanup was done in preparation for https://github.com/openai/codex/pull/13432. ## What Changed - Replaced the feature-only `UnifiedExecBackendConfig` split with `UnifiedExecShellMode` in `codex-rs/core/src/tools/spec.rs`. - Derived the unified-exec mode from session-backed inputs when building turn `ToolsConfig`, and preserved that mode across model switches and review turns. - Introduced `ZshForkConfig`, which stores the resolved zsh-fork `AbsolutePathBuf` values for the configured `zsh` binary and `execve` wrapper. - Threaded `ZshForkConfig` through unified-exec command construction and the zsh-fork preparation path so zsh-fork-specific runtime code consumes a single config object instead of separate path arguments. - Added focused tests for constructing zsh-fork mode only when session prerequisites are available, and updated the zsh-fork expectations to be target-platform aware. ## Testing - `cargo test -p codex-core zsh_fork --lib` --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/14633). * #13432 * __->__ #14633	2026-03-13 17:20:01 -07:00
Ahmed Ibrahim	7fa5201365	Use parser-specific realtime voice enum (#14636 ) Model realtime session output voices with an enum and map by parser so v1 uses fathom and v2 uses alloy. Co-authored-by: Codex <noreply@openai.com>	2026-03-13 16:17:13 -07:00
Ahmed Ibrahim	e9050e3e64	Fix realtime transcription session.update tools payload (#14635 ) Only attach session tools for Realtime v2 conversational sessions, and omit tools in transcription mode so realtime startup no longer fails with unknown parameter errors. Co-authored-by: Codex <noreply@openai.com>	2026-03-13 16:08:58 -07:00
Andrei Eternal	9a44a7e499	[hooks] stop continuation & stop_hook_active mechanics (#14532 ) Stop hooks now receive `stop_hook_active` and enable stop hooks to loop forever if they'd like to. In the initial hooks PR, we implemented a simpler mechanic that the stop-blocking could only happen once in a row - support stop hook adding a continuation prompt to add a further task - if multiple stop-blocks happen that have continuation prompts, they are concatenated example run: ``` › hey :) • Running SessionStart hook: lighting the observatory SessionStart hook (completed) warning: Hi, I'm a session start hook for wizard-tower (startup). hook context: Startup context: A wimboltine stonpet is an exotic cuisine from hyperspace • Aloha :) Happy to jam with you. What are we building today? • Running Stop hook: updating the guards Stop hook (blocked) warning: Wizard Tower Stop hook continuing conversation feedback: cook the stonpet • Aloha, here’s the hyperspace move for cooking a wimboltine stonpet: 1. Sear the stonpet in a hot pan with moon-oil until the edges shimmer. 2. Add star-lime, black salt, and a little fermented nebula paste. 3. Lower the heat and let it braise for 8 cosmic minutes with a splash of comet broth. 4. Finish with sky herbs and serve over warm asteroid rice. The vibe: crispy outside, tender center, deep interdimensional savor. • Running Stop hook: updating the guards Stop hook (completed) warning: Wizard Tower Stop hook saw a second pass and stayed calm to avoid a loop. ``` .codex/config.toml ``` [features] codex_hooks = true ``` .codex/hooks.json ``` { "hooks": { "SessionStart": [ { "matcher": "startup\|resume", "hooks": [ { "type": "command", "command": "/usr/bin/python3 .codex/hooks/session_start_demo.py", "timeoutSec": 10, "statusMessage": "lighting the observatory" } ] } ], "Stop": [ { "hooks": [ { "type": "command", "command": "/usr/bin/python3 .codex/hooks/stop_demo_block.py", "timeoutSec": 10, "statusMessage": "updating the guards" } ] } ] } } ``` .codex/hooks/session_start_demo.py ``` #!/usr/bin/env python3 import json import sys from pathlib import Path def main() -> int: payload = json.load(sys.stdin) cwd = Path(payload.get("cwd", ".")).name or "wizard-tower" source = payload.get("source", "startup") source_label = "resume" if source == "resume" else "startup" source_prefix = ( "Resume context:" if source == "resume" else "Startup context:" ) output = { "systemMessage": ( f"Hi, I'm a session start hook for {cwd} ({source_label})." ), "hookSpecificOutput": { "hookEventName": "SessionStart", "additionalContext": ( f"{source_prefix} A wimboltine stonpet is an exotic cuisine from hyperspace" ), }, } print(json.dumps(output)) return 0 if __name__ == "__main__": raise SystemExit(main()) ``` .codex/hooks/stop_demo_block.py ``` #!/usr/bin/env python3 import json import sys def main() -> int: payload = json.load(sys.stdin) stop_hook_active = payload.get("stop_hook_active", False) last_assistant_message = payload.get("last_assistant_message") or "" char_count = len(last_assistant_message.strip()) if stop_hook_active: system_message = ( "Wizard Tower Stop hook saw a second pass and stayed calm to avoid a loop." ) print(json.dumps({"systemMessage": system_message})) else: system_message = ( f"Wizard Tower Stop hook continuing conversation" ) print(json.dumps({"systemMessage": system_message, "decision": "block", "reason": "cook the stonpet"})) return 0 if __name__ == "__main__": raise SystemExit(main()) ```	2026-03-13 15:51:19 -07:00
Charley Cunningham	467e6216bb	Fix stale create_wait_tool reference (#14639 ) ## Summary - replace the stale `create_wait_tool()` reference in `spec_tests.rs` - use `create_wait_agent_tool()` to match the actual multi-agent tool rename from `#14631` - fix the resulting `codex-core` spec-test compile failure on current `main` ## Context `#14631` renamed the model-facing multi-agent tool from `wait` to `wait_agent` and renamed the corresponding spec helper to `create_wait_agent_tool()`. One `spec_tests.rs` call site was left behind, so current `main` fails to compile `codex-core` tests with: - `cannot find function create_wait_tool` Using `create_wait_agent_tool()` is the correct fix here; `create_exec_wait_tool()` would point at the separate exec wait tool and would not match the renamed multi-agent toolset. ## Testing - not rerun locally after the rebase Co-authored-by: Codex <noreply@openai.com>	2026-03-13 15:35:25 -07:00
Charley Cunningham	bc24017d64	Add Smart Approvals guardian review across core, app-server, and TUI (#13860 ) ## Summary - add `approvals_reviewer = "user" \| "guardian_subagent"` as the runtime control for who reviews approval requests - route Smart Approvals guardian review through core for command execution, file changes, managed-network approvals, MCP approvals, and delegated/subagent approval flows - expose guardian review in app-server with temporary unstable `item/autoApprovalReview/{started,completed}` notifications carrying `targetItemId`, `review`, and `action` - update the TUI so Smart Approvals can be enabled from `/experimental`, aligned with the matching `/approvals` mode, and surfaced clearly while reviews are pending or resolved ## Runtime model This PR does not introduce a new `approval_policy`. Instead: - `approval_policy` still controls when approval is needed - `approvals_reviewer` controls who reviewable approval requests are routed to: - `user` - `guardian_subagent` `guardian_subagent` is a carefully prompted reviewer subagent that gathers relevant context and applies a risk-based decision framework before approving or denying the request. The `smart_approvals` feature flag is a rollout/UI gate. Core runtime behavior keys off `approvals_reviewer`. When Smart Approvals is enabled from the TUI, it also switches the current `/approvals` settings to the matching Smart Approvals mode so users immediately see guardian review in the active thread: - `approval_policy = on-request` - `approvals_reviewer = guardian_subagent` - `sandbox_mode = workspace-write` Users can still change `/approvals` afterward. Config-load behavior stays intentionally narrow: - plain `smart_approvals = true` in `config.toml` remains just the rollout/UI gate and does not auto-set `approvals_reviewer` - the deprecated `guardian_approval = true` alias migration does backfill `approvals_reviewer = "guardian_subagent"` in the same scope when that reviewer is not already configured there, so old configs preserve their original guardian-enabled behavior ARC remains a separate safety check. For MCP tool approvals, ARC escalations now flow into the configured reviewer instead of always bypassing guardian and forcing manual review. ## Config stability The runtime reviewer override is stable, but the config-backed app-server protocol shape is still settling. - `thread/start`, `thread/resume`, and `turn/start` keep stable `approvalsReviewer` overrides - the config-backed `approvals_reviewer` exposure returned via `config/read` (including profile-level config) is now marked `[UNSTABLE]` / experimental in the app-server protocol until we are more confident in that config surface ## App-server surface This PR intentionally keeps the guardian app-server shape narrow and temporary. It adds generic unstable lifecycle notifications: - `item/autoApprovalReview/started` - `item/autoApprovalReview/completed` with payloads of the form: - `{ threadId, turnId, targetItemId, review, action? }` `review` is currently: - `{ status, riskScore?, riskLevel?, rationale? }` - where `status` is one of `inProgress`, `approved`, `denied`, or `aborted` `action` carries the guardian action summary payload from core when available. This lets clients render temporary standalone pending-review UI, including parallel reviews, even when the underlying tool item has not been emitted yet. These notifications are explicitly documented as `[UNSTABLE]` and expected to change soon. This PR does not persist guardian review state onto `thread/read` tool items. The intended follow-up is to attach guardian review state to the reviewed tool item lifecycle instead, which would improve consistency with manual approvals and allow thread history / reconnect flows to replay guardian review state directly. ## TUI behavior - `/experimental` exposes the rollout gate as `Smart Approvals` - enabling it in the TUI enables the feature and switches the current session to the matching Smart Approvals `/approvals` mode - disabling it in the TUI clears the persisted `approvals_reviewer` override when appropriate and returns the session to default manual review when the effective reviewer changes - `/approvals` still exposes the reviewer choice directly - the TUI renders: - pending guardian review state in the live status footer, including parallel review aggregation - resolved approval/denial state in history ## Scope notes This PR includes the supporting core/runtime work needed to make Smart Approvals usable end-to-end: - shell / unified-exec / apply_patch / managed-network / MCP guardian review - delegated/subagent approval routing into guardian review - guardian review risk metadata and action summaries for app-server/TUI - config/profile/TUI handling for `smart_approvals`, `guardian_approval` alias migration, and `approvals_reviewer` - a small internal cleanup of delegated approval forwarding to dedupe fallback paths and simplify guardian-vs-parent approval waiting (no intended behavior change) Out of scope for this PR: - redesigning the existing manual approval protocol shapes - persisting guardian review state onto app-server `ThreadItem`s - delegated MCP elicitation auto-review (the current delegated MCP guardian shim only covers the legacy `RequestUserInput` path) --------- Co-authored-by: Codex <noreply@openai.com>	2026-03-13 15:27:00 -07:00
Charley Cunningham	e3cbf913e8	Fix wait_agent expectations in core tests (#14637 ) ## Summary - update stale core tool-spec expectations from `wait` to `wait_agent` - update the prompt-caching tool-name assertion to match the renamed tool - fix the Bazel regressions introduced after #14631 renamed the multi-agent wait tool ## Testing - cargo test -p codex-core tools::spec::tests - cargo test -p codex-core suite::prompt_caching::prompt_tools_are_consistent_across_requests Co-authored-by: Codex <noreply@openai.com>	2026-03-13 15:15:59 -07:00
pakrym-oai	cb7d8f45a1	Normalize MCP tool names to code-mode safe form (#14605 ) Code mode doesn't allow `-` in names and it's better if function names and code-mode names are the same.	2026-03-13 14:50:16 -07:00
Ruslan Nigmatullin	f8f82bfc2b	app-server: add v2 filesystem APIs (#14245 ) Add a protocol-level filesystem surface to the v2 app-server so Codex clients can read and write files, inspect directories, and subscribe to path changes without relying on host-specific helpers. High-level changes: - define the new v2 fs/readFile, fs/writeFile, fs/createDirectory, fs/getMetadata, fs/readDirectory, fs/remove, fs/copy RPCs - implement the app-server handlers, including absolute-path validation, base64 file payloads, recursive copy/remove semantics - document the API, regenerate protocol schemas/types, and add end-to-end tests for filesystem operations, copy edge cases Testing plan: - validate protocol serialization and generated schema output for the new fs request, response, and notification types - run app-server integration coverage for file and directory CRUD paths, metadata/readDirectory responses, copy failure modes, and absolute-path validation	2026-03-13 14:42:20 -07:00
Ahmed Ibrahim	36dfb84427	Stabilize multi-agent feature flag (#14622 ) - make multi_agent stable and enabled by default - update feature and tool-spec coverage to match the new default --------- Co-authored-by: Codex <noreply@openai.com>	2026-03-13 14:38:15 -07:00
Ahmed Ibrahim	cfd97b36da	Rename multi-agent wait tool to wait_agent (#14631 ) - rename the multi-agent tool name the model sees to wait_agent - update the model-facing prompts and tool descriptions to match --------- Co-authored-by: Codex <noreply@openai.com>	2026-03-13 14:38:05 -07:00
Won Park	6720caf778	Slash copy osc52 wsl support (#13201 ) This PR is a followup to the /copy feature to support WSL and SSH!	2026-03-13 14:00:58 -07:00
pakrym-oai	477a2dd345	Add code_mode_only feature (#14617 ) Summary - add the code_mode_only feature flag/config schema and wire its dependency on code_mode - update code mode tool descriptions to list nested tools with detailed headers - restrict available tools for prompt and exec descriptions when code_mode_only is enabled and test the behavior Testing - Not run (not requested)	2026-03-13 13:30:19 -07:00

1 2 3 4 5 ...

4582 Commits