codex

mirror of https://github.com/openai/codex.git synced 2026-05-04 13:21:54 +03:00

Author	SHA1	Message	Date
Ahmed Ibrahim	cd591dc457	Preserve null developer instructions (#16976 ) Preserve explicit null developer-instruction overrides across app-server resume and fork flows.	2026-04-07 09:32:14 -07:00
Eric Traut	feb4f0051a	Fix nested exec thread ID restore (#16882 ) Addresses #15527 Problem: Nested `codex exec` commands could source a shell snapshot that re-exported the parent `CODEX_THREAD_ID`, so commands inside the nested session were attributed to the wrong thread. Solution: Reapply the live command env's `CODEX_THREAD_ID` after sourcing the snapshot.	2026-04-07 09:26:22 -07:00
Eric Traut	82506527f1	Fix read-only apply_patch rejection message (#16885 ) Addresses #15532 Problem: Nested read-only `apply_patch` rejections report in-project files as outside the project. Solution: Choose the rejection message based on sandbox mode so read-only sessions report a read-only-specific reason, and add focused safety coverage.	2026-04-07 09:25:39 -07:00
Eric Traut	3b32de4fab	Stabilize flaky multi-agent followup interrupt test (#16739 ) Problem: The multi-agent followup interrupt test polled history before interrupt cleanup and mailbox wakeup were guaranteed to settle, which made it flaky under CI scheduling variance. Solution: Wait for the child turn's `TurnAborted(Interrupted)` event before asserting that the redirected assistant envelope is recorded and no plain user message is left behind.	2026-04-07 09:24:14 -07:00
jif-oai	4cc6818996	chore: keep request_user_input tool to persist cache on multi-agents (#17009 )	2026-04-07 16:53:31 +01:00
pakrym-oai	413c1e1fdf	[codex] reduce module visibility (#16978 ) ## Summary - reduce public module visibility across Rust crates, preferring private or crate-private modules with explicit crate-root public exports - update external call sites and tests to use the intended public crate APIs instead of reaching through module trees - add the module visibility guideline to AGENTS.md ## Validation - `cargo check --workspace --all-targets --message-format=short` passed before the final fix/format pass - `just fix` completed successfully - `just fmt` completed successfully - `git diff --check` passed	2026-04-07 08:03:35 -07:00
jif-oai	89f1a44afa	feat: /feedback cascade (#16442 ) Example here: https://openai.sentry.io/issues/7380240430/?project=4510195390611458&query=019d498f-bec4-7ba2-96d2-612b1e4507df&referrer=issue-stream	2026-04-07 12:47:37 +01:00
jif-oai	99f167e6bf	chore: hide nickname for debug flag (#17007 )	2026-04-07 11:31:13 +01:00
jif-oai	68e16baabe	chore: send_message and followup_task do not return anything (#17008 )	2026-04-07 11:26:36 +01:00
jif-oai	2a8c3a2a52	feat: drop agent ID from v2 (#17005 )	2026-04-07 10:56:01 +01:00
jif-oai	e2bb45bb24	chore: debug flag to hide some parameters (#17002 )	2026-04-07 10:42:19 +01:00
jif-oai	51f75e2f56	feat: empty role ok (#16999 )	2026-04-07 10:34:08 +01:00
starr-openai	741e2fdeb8	[codex] ez - rename env=>request in codex-rs/core/src/unified_exec/process_manager.rs (#16724 ) # External (non-OpenAI) Pull Request Requirements Before opening this Pull Request, please read the dedicated "Contributing" markdown file or your PR may be closed: https://github.com/openai/codex/blob/main/docs/contributing.md If your PR conforms to our contribution guidelines, replace this text with a detailed and high quality description of your changes. Include a link to a bug report or enhancement request.	2026-04-07 10:17:31 +01:00
Won Park	90320fc51a	collapse dev message into one (#16988 ) collapse image-gen dev message into one	2026-04-06 23:49:47 -07:00
Ahmed Ibrahim	24c598e8a9	Honor null thread instructions (#16964 ) - Treat explicit null thread instructions as a blank-slate override while preserving omitted-field fallback behavior. - Preserve null through rollout resume/fork and keep explicit empty strings distinct. - Add app-server v2 start/fork coverage for the tri-state instruction params.	2026-04-07 04:10:19 +00:00
pakrym-oai	4bb507d2c4	Make AGENTS.md discovery FS-aware (#15826 ) ## Summary - make AGENTS.md discovery and loading fully FS-aware and remove the non-FS discover helper - migrate remote-aware codex-core tests to use TestEnv workspace setup instead of syncing a local workspace copy - add AGENTS.md corner-case coverage, including directory fallbacks and remote-aware integration coverage ## Testing - cargo test -p codex-core project_doc -- --nocapture - cargo test -p codex-core hierarchical_agents -- --nocapture - cargo test -p codex-core agents_md -- --nocapture - cargo test -p codex-tui status -- --nocapture - cargo test -p codex-tui-app-server status -- --nocapture - just fix - just fmt - just bazel-lock-update - just bazel-lock-check - just argument-comment-lint - remote Linux executor tests in progress via scripts/test-remote-env.sh	2026-04-06 20:26:21 -07:00
Ruslan Nigmatullin	232db0613a	app-server: Fix compilation of a test in mcp_resource (#16972 )	2026-04-06 20:17:08 -07:00
viyatb-oai	9d13d29acd	[codex] Add danger-full-access denylist-only network mode (#16946 ) ## Summary This adds `experimental_network.danger_full_access_denylist_only` for orgs that want yolo / danger-full-access sessions to keep full network access while still enforcing centrally managed deny rules. When the flag is true and the session sandbox is `danger-full-access`, the network proxy starts with: - domain allowlist set to `` - managed domain `deny` entries enforced - upstream proxy use allowed - all Unix sockets allowed - local/private binding allowed Caveat: the denylist is best effort only. In yolo / danger-full-access mode, Codex or the model can use an allowed socket or other local/private network path to bypass the proxy denylist, so this should not be treated as a hard security boundary. The flag is intentionally scoped to `SandboxPolicy::DangerFullAccess`. Read-only and workspace-write modes keep the existing managed/user allowlist, denylist, Unix socket, and local-binding behavior. This does not enable the non-loopback proxy listener setting; that still requires its own explicit config. This also threads the new field through config requirements parsing, app-server protocol/schema output, config API mapping, and the TUI debug config output. ## How to use Add the flag under `[experimental_network]` in the network policy config that is delivered to Codex. The setting is not under `[permissions]`. ```toml [experimental_network] enabled = true danger_full_access_denylist_only = true [experimental_network.domains] "blocked.example.com" = "deny" ".blocked.example.com" = "deny" ``` With that configuration, yolo / danger-full-access sessions get broad network access except for the managed denied domains above. The denylist remains a best-effort proxy policy because the session may still use allowed sockets to bypass it. Other sandbox modes do not get the wildcard domain allowlist or the socket/local-binding relaxations from this flag. ## Verification - `cargo test -p codex-config network_requirements` - `cargo test -p codex-core network_proxy_spec` - `cargo test -p codex-app-server map_requirements_toml_to_api` - `cargo test -p codex-tui debug_config_output` - `cargo test -p codex-app-server-protocol` - `just write-app-server-schema` - `just fmt` - `just fix -p codex-config -p codex-core -p codex-app-server-protocol -p codex-app-server -p codex-tui` - `just fix -p codex-core -p codex-config` - `git diff --check` - `cargo clean`	2026-04-06 19:38:51 -07:00
viyatb-oai	806e5f7c69	fix: warn when bwrap cannot create user namespaces (#15893 ) ## Summary - add a Linux startup warning when system `bwrap` is present but cannot create user namespaces - keep the Linux-specific probe, sandbox-policy gate, and stderr matching in `codex-sandboxing` - polish the missing-`bwrap` warning to point users at the sandbox prerequisites and OS package-manager install path ## Details - probes system `bwrap` with `--unshare-user`, `--unshare-net`, and a minimal bind before command execution - detects known bubblewrap setup failures for `RTM_NEWADDR`, `RTM_NEWLINK`, uid-map permission denial, and `No permissions to create a new namespace` - preserves the existing suppression for sandbox-bypassed policies such as `danger-full-access` and `external-sandbox` - updates the Linux sandbox docs to call out the user-namespace requirement --------- Co-authored-by: Codex <noreply@openai.com>	2026-04-06 19:19:35 -07:00
Matthew Zeng	5fe9ef06ce	[mcp] Support MCP Apps part 1. (#16082 ) - [x] Add `mcpResource/read` method to read mcp resource.	2026-04-06 19:17:14 -07:00
Eric Traut	ee12772e80	Validate exec input before starting app-server (#16890 ) Addresses #16443 This was a regression introduced when we moved exec on top of the app server APIs. Problem: codex exec resolved prompt/stdin and output schema after starting the in-process app-server, so early `process::exit(1)` paths could bypass session shutdown. Solution: Resolve prompt/stdin and output schema before app-server startup so validation failures happen before any exec session is created.	2026-04-06 18:13:05 -07:00
Ruslan Nigmatullin	b34a3a6e92	app-server: Unify config changes handling a bit (#16961 )	2026-04-06 18:04:00 -07:00
pakrym-oai	0de7662dab	Add setTimeout support to code mode (#16153 ) The implementation is less than ideal - it starts a thread per timer. A better approach might be to switch to tokio and use their timer imlementation.	2026-04-06 17:46:28 -07:00
pakrym-oai	1f2411629f	Refactor config types into a separate crate (#16962 ) Move config types into a separate crate because their macros expand into a lot of new code.	2026-04-07 00:32:41 +00:00
Curtis 'Fjord' Hawthorne	d2df7c54b2	Promote image_detail_original to experimental (#16957 )	2026-04-06 17:25:16 -07:00
starr-openai	a504d8f0fa	Disable env-bound tools when exec server is none (#16349 ) ## Summary - make `CODEX_EXEC_SERVER_URL=none` map to an explicit disabled environment mode instead of inferring from a missing URL - expose environment capabilities (`exec_enabled`, `filesystem_enabled`) so tool building can gate behavior explicitly and future multi-environment work has a clearer seam - suppress env-backed tools when the relevant capability is unavailable, including exec tools, `js_repl`, `apply_patch`, `list_dir`, and `view_image` - keep handler/runtime backstops so disabled environments still reject execution if a tool path somehow bypasses registration ## Testing - `just fmt` - `cargo test -p codex-exec-server` - `cargo test -p codex-tools disabled_environment_omits_environment_backed_tools` - `cargo test -p codex-tools environment_capabilities_gate_exec_and_filesystem_tools_independently` - remote devbox Bazel build via `codex-applied-devbox`: `//codex-rs/cli:cli`	2026-04-06 17:22:06 -07:00
Eric Traut	9f737c28dd	Speed up /mcp inventory listing (#16831 ) Addresses #16244 This was a performance regression introduced when we moved the TUI on top of the app server API. Problem: `/mcp` rebuilt a full MCP inventory through `mcpServerStatus/list`, including resources and resource templates that made the TUI wait on slow inventory probes. Solution: add a lightweight `detail` mode to `mcpServerStatus/list`, have `/mcp` request tools-and-auth only, and cover the fast path with app-server and TUI tests. Testing: Confirmed slow (multi-second) response prior to change and immediate response after change. I considered two options: 1. Change the existing `mcpServerStatus/list` API to accept an optional "details" parameter so callers can request only a subset of the information. 2. Add a separate `mcpServer/list` API that returns only the servers, tools, and auth but omits the resources. I chose option 1, but option 2 is also a reasonable approach.	2026-04-06 16:27:02 -07:00
rhan-oai	756c45ec61	[codex-analytics] add protocol-native turn timestamps (#16638 ) --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/16638). * #16870 * #16706 * #16659 * #16641 * #16640 * __->__ #16638	2026-04-06 16:22:59 -07:00
Eric Traut	e88c2cf4d7	tui: route device-code auth through app server (#16827 ) Addresses #7646 Also enables device code auth for remote TUI sessions Problem: TUI onboarding handled device-code login directly rather than using the recently-added app server support for device auth. Also, auth screens kept animating while users needed to copy login details. Solution: Route device-code onboarding through app-server login APIs and make the auth screens static while those copy-oriented flows are visible.	2026-04-06 15:47:26 -07:00
Eric Traut	54faa76960	Respect residency requirements in mcp-server (#16952 ) Addresses #16951 Problem: codex mcp-server did not apply the configured residency requirement, so requests from non-US regions could miss the `residency` header and fail with a 401. Solution: Set the default client residency requirement after loading config in the MCP server startup path, matching the existing exec and TUI behavior.	2026-04-06 15:46:55 -07:00
xl-openai	e62d645e67	feat: refresh non-curated cache from plugin list. (#16191 ) 1. Use versions for non-curated plugin (defined in plugin.json) for cache refresh 2. Trigger refresh from plugin/list roots	2026-04-06 15:40:00 -07:00
xl-openai	03edd4fbee	feat: fallback curated plugin download from backend endpint. (#16947 ) Add one more fallback for downloading the curated plugin repo from chatgpt.com. Have to be the last fallback for now as it is a lagging backup.	2026-04-06 15:36:20 -07:00
viyatb-oai	36cd163504	[codex] Allow PyTorch libomp shm in Seatbelt (#16945 ) ## Summary - Add a targeted macOS Seatbelt allow rule for PyTorch/libomp KMP registration shared-memory objects. - Scope the rule to read/create/unlink operations on names matching `^/__KMP_REGISTERED_LIB_[0-9]+$`. - Add a base-policy regression assertion in `seatbelt_tests.rs`. ## Why Importing PyTorch on macOS under the Codex sandbox can abort when libomp attempts to create the KMP registration POSIX shm object and Seatbelt denies `ipc-posix-shm-write-create`. ## Validation - `just fmt` - `cargo test -p codex-sandboxing` - `cargo clippy -p codex-sandboxing --all-targets` - `just argument-comment-lint` - `git diff --check` - End-to-end PyTorch import under `codex sandbox macos` exited `0` with no KMP shm denial - `cargo clean`	2026-04-06 22:12:30 +00:00
Ruslan Nigmatullin	73dab2046f	app-server: Add transport for remote control (#15951 )	2026-04-06 14:55:59 -07:00
joeytrasatti-openai	03c07956cf	Revert "[codex-backend] Make thread metadata updates tolerate pending backfill" (#16923 ) Reverts openai/codex#16877	2026-04-06 21:25:05 +00:00
Matthew Zeng	756ba8baae	Fix clippy warning (#16939 ) - [x] Fix clippy warning	2026-04-06 14:08:55 -07:00
Ruslan Nigmatullin	1525bbdb9a	app-server: centralize AuthManager initialization (#16764 ) Extract a shared helper that builds AuthManager from Config and applies the forced ChatGPT workspace override in one place. Create the shared AuthManager at MessageProcessor call sites so that upcoming new transport's initialization can reuse the same handle, and keep only external auth refresher wiring inside `MessageProcessor`. Remove the now-unused `AuthManager::shared_with_external_auth` helper.	2026-04-06 12:46:55 -07:00
Owen Lin	9bb813353e	fix(sqlite): don't hard fail migrator if DB is newer (#16924 ) ## Description This PR makes the SQLite state runtime tolerate databases that have already been migrated by a newer Codex binary. Today, if an older CLI sees migration versions in `_sqlx_migrations` that it doesn't know about, startup fails. This change relaxes that check for the runtime migrators we use in `codex-state` so older binaries can keep opening the DB in that case. ## Why We can end up with mixed-version CLIs running against the same local state DB. In that setup, treating "the database is ahead of me" as a hard error is unnecessarily strict and breaks the older client even when the migration history is otherwise fine. ## Follow-up We still clean up versioned `state_.sqlite` and `logs_.sqlite` files during init, so older binaries can treat newer DB files as legacy. That should probably be tightened separately if we want mixed-version local usage to be fully safe.	2026-04-06 12:16:31 -07:00
Owen Lin	bd30bad96f	fix(guardian): fix ordering of guardian events (#16462 ) Guardian events were emitted a bit out of order for CommandExecution items. This would make it hard for the frontend to render a guardian auto-review, which has this payload: ``` pub struct ItemGuardianApprovalReviewStartedNotification { pub thread_id: String, pub turn_id: String, pub target_item_id: String, pub review: GuardianApprovalReview, // FYI this is no longer a json blob pub action: Option<JsonValue>, } ``` There is a `target_item_id` the auto-approval review is referring to, but the actual item had not been emitted yet. Before this PR: - `item/autoApprovalReview/started` - `item/autoApprovalReview/completed`, and if approved... - `item/started` - `item/completed` After this PR: - `item/started` - `item/autoApprovalReview/started` - `item/autoApprovalReview/completed` - `item/completed` This lines up much better with existing patterns (i.e. human review in `Default mode`, where app-server would send a server request to prompt for user approval after `item/started`), and makes it easier for clients to render what guardian is actually reviewing. We do this following a similar pattern as `FileChange` (aka apply patch) items, where we create a FileChange item and emit `item/started` if we see the apply patch approval request, before the actual apply patch call runs.	2026-04-06 19:14:27 +00:00
Owen Lin	ded559680d	feat(requirements): support allowed_approval_reviewers (#16701 ) ## Description Add requirements.toml support for `allowed_approvals_reviewers = ["user", "guardian_subagent"]`, so admins can now restrict the use of guardian mode. Note: If a user sets a reviewer that isn’t allowed by requirements.toml, config loading falls back to the first allowed reviewer and emits a startup warning. The table below describes the possible admin controls. \| Admin intent \| `requirements.toml` \| User `config.toml` \| End result \| \|---\|---\|---\|---\| \| Leave Guardian optional \| omit `allowed_approvals_reviewers` or set `["user", "guardian_subagent"]` \| user chooses `approvals_reviewer = "user"` or `"guardian_subagent"` \| Guardian off for `user`, on for `guardian_subagent` + `approval_policy = "on-request"` \| \| Force Guardian off \| `allowed_approvals_reviewers = ["user"]` \| any user value \| Effective reviewer is `user`; Guardian off \| \| Force Guardian on \| `allowed_approvals_reviewers = ["guardian_subagent"]` and usually `allowed_approval_policies = ["on-request"]` \| any user reviewer value; user should also have `approval_policy = "on-request"` unless policy is forced \| Effective reviewer is `guardian_subagent`; Guardian on when effective approval policy is `on-request` \| \| Allow both, but default to manual if user does nothing \| `allowed_approvals_reviewers = ["user", "guardian_subagent"]` \| omit `approvals_reviewer` \| Effective reviewer is `user`; Guardian off \| \| Allow both, and user explicitly opts into Guardian \| `allowed_approvals_reviewers = ["user", "guardian_subagent"]` \| `approvals_reviewer = "guardian_subagent"` and `approval_policy = "on-request"` \| Guardian on \| \| Invalid admin config \| `allowed_approvals_reviewers = []` \| anything \| Config load error \|	2026-04-06 11:11:44 -07:00
joeytrasatti-openai	4ce97cef02	[codex-backend] Make thread metadata updates tolerate pending backfill (#16877 ) ### Summary Fix `thread/metadata/update` so it can still patch stored thread metadata when the list/backfill-gated `get_state_db(...)` path is unavailable. What was happening: - The app logs showed `thread/metadata/update` failing with `sqlite state db unavailable for thread ...`. - This was not isolated to one bad thread. Once the failure started for a user, branch metadata updates failed 100% of the time for that user. - Reports were staggered across users, which points at local app-server / local SQLite state rather than one global server-side failure. - Turns could still start immediately after the metadata update failed, which suggests the thread itself was valid and the failure was in the metadata endpoint DB-handle path. The fix: - Keep using the loaded thread state DB and the normal `get_state_db(...)` fallback first. - If that still returns `None`, open `StateRuntime::init(...)` directly for this targeted metadata update path. - Log the direct state runtime init error if that final fallback also fails, so future reports have the real DB-open cause instead of only the generic unavailable error. - Add a regression test where the DB exists but backfill is not complete, and verify `thread/metadata/update` can still repair the stored rollout thread and patch `gitInfo`. Relevant context / suspect PRs: - #16434 changed state DB startup to run auto-vacuum / incremental vacuum. This is the most suspicious timing match for per-user, staggered local SQLite availability failures. - #16433 dropped the old log table from the state DB, also near the timing window. - #13280 introduced this endpoint and made it rely on SQLite for git metadata without resuming the thread. - #14859 and #14888 added/consumed persisted model + reasoning effort metadata. I checked these because of the new thread metadata fields, but this failure happens before the endpoint reaches thread-row update/load logic, so they seem less likely as the direct cause. ### Testing - `cargo fmt -- --config imports_granularity=Item` completed; local stable rustfmt emitted warnings that `imports_granularity` is unstable - `cargo test -p codex-app-server thread_metadata_update` - `git diff --check`	2026-04-06 13:07:19 -04:00
Eric Traut	54dbbb839e	(tui): Decode percent-escaped bare local file links (#16810 ) Addresses #16622 Problem: bare local file links in TUI markdown render percent-encoded path bytes literally, unlike file:// links. Solution: decode bare path targets before local-path expansion and add regression coverage for spaces and Unicode.	2026-04-06 08:52:18 -07:00
Eric Traut	f44eb29181	Annotate skill doc reads with skill names (#16813 ) Addresses #16303 Problem: Skill doc reads render as plain `Read SKILL.md`, so the TUI hides which skill was opened. Solution: Best-effort annotate exact `SKILL.md` reads with the matching loaded skill name from `skills_all` before rendering exec cells. Before: ``` • Explored └ Read SKILL.md ``` After: ``` • Explored └ Read SKILL.md (pr-babysitter skill) ```	2026-04-06 08:51:34 -07:00
Eric Traut	4294031a93	Fix resume picker timestamp labels and stability (#16822 ) Problem: The resume picker used awkward "Created at" and "Updated at" headers, and its relative timestamps changed while navigating because they were recomputed on each redraw. Solution: Rename the headers to "Created" and "Updated", and anchor relative timestamp formatting to the picker load time so the displayed ages stay stable while browsing.	2026-04-06 08:51:13 -07:00
Eric Traut	fb41a79f37	[regression] Fix ephemeral turn backfill in exec (#16795 ) Addresses #16781 Problem: `codex exec --ephemeral` backfilled empty `turn/completed` items with `thread/read(includeTurns=true)`, which app-server rejects for ephemeral threads. This is a regression introduced in the recent conversion of "exec" to use app server rather than call the core directly. Solution: Skip turn-item backfill for ephemeral exec threads while preserving the existing recovery path for non-ephemeral sessions.	2026-04-06 08:45:58 -07:00
Eric Traut	ab58141e22	Fix TUI fast mode toggle regression (#16833 ) Addresses #16832 Problem: After `/fast on`, the TUI omitted an explicit service-tier clear on later turns, so `/fast off` left app-server sessions stuck on `priority` until restart. Solution: Always submit the current service tier with user turns, including an explicit clear when Fast mode is off, and add a regression test for the `/fast on` -> `/fast off` flow.	2026-04-06 08:43:35 -07:00
Eric Traut	82b061afb2	Fix CJK word navigation in the TUI composer (#16829 ) Addresses #16584 Problem: TUI word-wise cursor movement treated entire CJK runs as a single word, so Option/Alt+Left and Right skipped too far when editing East Asian text. Solution: Use Unicode word-boundary segments within each non-whitespace run so CJK text advances one segment at a time while preserving separator and delete-word behavior, and add regression coverage for CJK and mixed-script navigation. Testing: Manually tested solution by pasting text that includes CJK characters into the composer and confirmed that keyboard navigation worked correctly (after confirming it didn't prior to the change).	2026-04-06 08:37:42 -07:00
Thibault Sottiaux	624c69e840	[codex] add response proxy subagent header test (#16876 ) This adds end-to-end coverage for `responses-api-proxy` request dumps when Codex spawns a subagent and validates that the `x-codex-window-id` and `x-openai-subagent` are properly set.	2026-04-06 08:18:46 -07:00
Eric Traut	e65ee38579	Clarify `codex exec` approval help (#16888 ) Addresses #13614 Problem: `codex exec --help` implied that `--full-auto` also changed exec approval mode, even though non-interactive exec stays headless and does not support interactive approval prompts. Solution: clarify the `--full-auto` help text so it only describes the sandbox behavior it actually enables for `codex exec`.	2026-04-05 23:31:15 -07:00
Eric Traut	d9b899309d	Fix misleading codex exec help usage (#16881 ) Addresses #15535 Problem: `codex exec --help` advertised a second positional `[COMMAND]` even though `exec` only accepts a prompt or a subcommand. Solution: Override the `exec` usage string so the help output shows the two supported invocation forms instead of the phantom positional.	2026-04-05 22:09:19 -07:00

... 2 3 4 5 6 ...

4551 Commits