codex

mirror of https://github.com/openai/codex.git synced 2026-05-02 20:32:04 +03:00

Author	SHA1	Message	Date
Cooper Gamble	46615cc195	[codex-core] Trim inline compaction coverage [ci changed_files] - reduce the server-side compaction test matrix to the highest-signal cases - add comments around the deferred checkpoint rewrite and inline/preflight split Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	d298fbd6bb	[codex-core] Prefer full inline compaction prefix [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	9337146281	[codex-core] Avoid duplicate inline compaction context [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	488b53bceb	[codex-core] Fix inline compaction commit semantics [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	7065f6fa1d	[codex-core] Preserve raw inline compaction event order [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	29ddeb71ad	[codex-core] Fix inline compaction event ordering and token accounting [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	ab9a474c86	[codex-core] Preserve inline compaction checkpoint ordering [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	8da9b522f5	[codex-core] Preserve inline compaction turn prompt state [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	839143c6c4	[codex-core] Fix inline compaction checkpoint history layout [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	d9916b5e6d	[codex-core] Fix inline compaction retry and model-switch preflight [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	9d20427fc2	[core] Fix empty-history inline summary replacement [ci changed_files] Handle repeated inline compactions on turns that started from empty history by stripping leading compaction items after prefix calculation, and add regression coverage for the fresh-session case. Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	bdadb141f2	[core] Always inline OpenAI auto-compaction [ci changed_files] Ignore compact_prompt for OpenAI inline auto-compaction, remove the legacy compat downgrade path, and keep /compact on the point-in-time endpoint. Also skip previous-model preflight remote compaction when inline server-side compaction is available.\n\nCo-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	90ea0076b9	[codex-core] Remove redundant ThreadId clone in compaction test [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	2bcf6ebaa3	[core] Preserve inline compaction turn state [ci changed_files] Preserve current-turn history when inline compaction downgrades fail and replace prior same-turn compaction checkpoints instead of stacking them. Tests: - cargo test -p codex-core codex::tests::build_server_side_compaction_replacement_history_keeps_current_turn_inputs -- --exact - cargo test -p codex-core codex::tests::build_server_side_compaction_replacement_history_replaces_prior_same_turn_summary -- --exact - cargo test -p codex-core codex::tests::downgrade_known_inline_compaction_error_restores_current_turn_when_fallback_fails -- --exact Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	e54a4ae5e2	[codex-core] Refresh config schema for server-side compaction [ci changed_files] Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	c3a7189a0e	[codex-core] Harden inline compaction checkpoint handling [ci changed_files] Keep current-turn inputs in local inline compaction checkpoints and remember known backend incompatibilities after a compat downgrade so later turns skip the failed inline request path. Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	dea81dba3a	[codex-core] Preserve ghost snapshots in compaction fallback [ci changed_files] Keep same-turn ghost snapshots when pre-turn inline compaction downgrades to the legacy client-side path so undo state survives compatibility fallback. Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
Cooper Gamble	ea2a4be3d8	[codex-core] Add feature-flagged server-side compaction [ci changed_files] Move normal auto-compaction onto inline Responses API compaction behind a feature flag, keep the legacy path for manual and compatibility cases, and add observability plus integration coverage. Co-authored-by: Codex <noreply@openai.com>	2026-03-11 02:48:35 +00:00
pakrym-oai	816e447ead	Add snippets annotated with types to tools when code mode enabled (#14284 ) Main purpose is for code mode to understand the return type.	2026-03-10 19:20:15 -07:00
Ahmed Ibrahim	cc417c39a0	Split spawn_csv from multi_agent (#14282 ) - make `spawn_csv` a standalone feature for CSV agent jobs - keep `spawn_csv -> multi_agent` one-way and preserve restricted subagent disable paths	2026-03-11 01:42:50 +00:00
Ahmed Ibrahim	5b10b93ba2	Add realtime start instructions config override (#14270 ) - add `realtime_start_instructions` config support - thread it into realtime context updates, schema, docs, and tests	2026-03-10 18:42:05 -07:00
pakrym-oai	566897d427	Make unified exec session_id numeric (#14279 ) It's a number on the write_stdin input, make it a number on the output and also internally.	2026-03-10 18:38:39 -07:00
pakrym-oai	24b8d443b8	Prefix code mode output with success or failure message and include error stack (#14272 )	2026-03-10 18:33:52 -07:00
Ahmed Ibrahim	3f7cb03043	Stabilize websocket response.failed error delivery (#14017 ) ## What changed - Drop failed websocket connections immediately after a terminal stream error instead of awaiting a graceful close handshake before forwarding the error to the caller. - Keep the success path and the closed-connection guard behavior unchanged. ## Why this fixes the flake - The failing integration test waits for the second websocket stream to surface the model error before issuing a follow-up request. - On slower runners, the old error path awaited `ws_stream.close().await` before sending the error downstream. If that close handshake stalled, the test kept waiting for an error that had already happened server-side and nextest timed it out. - Dropping the failed websocket immediately makes the terminal error observable right away and marks the session closed so the next request reconnects cleanly instead of depending on a best-effort close handshake. ## Code or test? - This is a production logic fix in `codex-api`. The existing websocket integration test already exercises the regression path.	2026-03-10 17:59:41 -07:00
Ahmed Ibrahim	567ad7fafd	Show spawned agent model and effort in TUI (#14273 ) - include the requested sub-agent model and reasoning effort in the spawn begin event\n- render that metadata next to the spawned agent name and role in the TUI transcript --------- Co-authored-by: Codex <noreply@openai.com>	2026-03-11 00:46:25 +00:00
pakrym-oai	37f51382fd	Rename code mode tool to exec (#14254 ) Summary - update the code-mode handler, runner, instructions, and error text to refer to the `exec` tool name everywhere that used to say `code_mode` - ensure generated documentation strings and tool specs describe `exec` and rely on the shared `PUBLIC_TOOL_NAME` - refresh the suite tests so they invoke `exec` instead of the old name Testing - Not run (not requested)	2026-03-11 00:30:16 +00:00
maja-openai	16daab66d9	prompt changes to guardian (#14263 ) ## Summary - update the guardian prompting - clarify the guardian rejection message so an action may still proceed if the user explicitly approves it after being informed of the risk ## Testing - cargo run on selected examples	2026-03-10 17:05:43 -07:00
Celia Chen	295b56bece	chore: add a separate reject-policy flag for skill approvals (#14271 ) ## Summary - add `skill_approval` to `RejectConfig` and the app-server v2 `AskForApproval::Reject` payload so skill-script prompts can be configured independently from sandbox and rule-based prompts - update Unix shell escalation to reject prompts based on the actual decision source, keeping prefix rules tied to `rules`, unmatched command fallbacks tied to `sandbox_approval`, and skill scripts tied to `skill_approval` - regenerate the affected protocol/config schemas and expand unit/integration coverage for the new flag and skill approval behavior	2026-03-10 23:58:23 +00:00
pakrym-oai	18199d4e0e	Add store/load support for code mode (#14259 ) adds support for transferring state across code mode invocations.	2026-03-10 16:53:53 -07:00
Rasmus Rygaard	f8ef154a6b	Pass more params to compaction (#14247 ) Pass more params to /compact. This should give us parity with the /responses endpoint to improve caching. I'm torn about the MCP await. Blocking will give us parity but it seems like we explicitly don't block on MCPs. Happy either way	2026-03-10 16:39:57 -07:00
Leo Shimonaka	de2a73cd91	feat: Add additional macOS Sandbox Permissions for Launch Services, Contacts, Reminders (#14155 ) Add additional macOS Sandbox Permissions levers for the following: - Launch Services - Contacts - Reminders	2026-03-10 23:34:47 +00:00
pakrym-oai	8b33485302	Add code_mode output helpers for text and images (#14244 ) Summary - document how code-mode can import `output_text`/`output_image` and ensure `add_content` stays compatible - add a synthetic `@openai/code_mode` module that appends content items and validates inputs - cover the new behavior with integration tests for structured text and image outputs Testing - Not run (not requested)	2026-03-10 16:25:27 -07:00
Ahmed Ibrahim	bf936fa0c1	Clarify close_agent tool description (#14269 ) - clarify the `close_agent` tool description so it nudges models to close agents they no longer need - keep the change scoped to the tool spec text only Co-authored-by: Codex <noreply@openai.com>	2026-03-10 16:25:08 -07:00
gabec-openai	b73228722a	Load agent metadata from role files (#14177 )	2026-03-10 16:21:48 -07:00
pakrym-oai	e791559029	Add model-controlled truncation for code mode results (#14258 ) Summary - document that `@openai/code_mode` exposes `set_max_output_tokens_per_exec_call` and that `code_mode` truncates the final Rust-side output when the budget is exceeded - enforce the configured budget in the Rust tool runner, reusing truncation helpers so text-only outputs follow the unified-exec wrapper and mixed outputs still fit within the limit - ensure the new behavior is covered by a code-mode integration test and string spec update Testing - Not run (not requested)	2026-03-10 15:57:14 -07:00
pakrym-oai	c7e28cffab	Add output schema to MCP tools and expose MCP tool results in code mode (#14236 ) Summary - drop `McpToolOutput` in favor of `CallToolResult`, moving its helpers to keep MCP tooling focused on the final result shape - wire the new schema definitions through code mode, context, handlers, and spec modules so MCP tools serialize the exact output shape expected by the model - extend code mode tests to cover multiple MCP call scenarios and ensure the serialized data matches the new schema - refresh JS runner helpers and protocol models alongside the schema changes Testing - Not run (not requested)	2026-03-10 15:25:19 -07:00
Won Park	28934762d0	unifying all image saves to /tmp to bug-proof (#14149 ) image-gen feature will have the model saving to /tmp by default + at all times	2026-03-10 15:13:12 -07:00
Ahmed Ibrahim	2895d3571b	Add spawn_agent model overrides (#14160 ) - add `model` and `reasoning_effort` to the `spawn_agent` schema so the values pass through - validate requested models against `model.model` and only check that the selected model supports the requested reasoning effort --------- Co-authored-by: Codex <noreply@openai.com>	2026-03-10 14:04:04 -07:00
xl-openai	2544bd02a2	feat: Allow sync with remote plugin status. (#14176 ) Add forceRemoteSync to plugin/list. When it is set to True, we will sync the local plugin status with the remote one (backend-api/plugins/list).	2026-03-10 13:32:59 -07:00
Matthew Zeng	bda9e55c7e	add(core): arc_monitor (#13936 ) ## Summary - add ARC monitor support for MCP tool calls by serializing MCP approval requests into the ARC action shape and sending the relevant conversation/policy context to the `/api/codex/safety/arc` endpoint - route ARC outcomes back into MCP approval flow so `ask-user` falls back to a user prompt and `steer-model` blocks the tool call, with guardian/ARC tests covering the new request shape - update the TUI approval copy from “Approve Once” to “Allow” / “Allow for this session” and refresh the related snapshots --------- Co-authored-by: Fouad Matin <fouad@openai.com> Co-authored-by: Fouad Matin <169186268+fouad-openai@users.noreply.github.com>	2026-03-10 13:16:47 -07:00
pakrym-oai	46e6661d4e	Reuse McpToolOutput in McpHandler (#14229 ) We already have a type to represent the MCP tool output, reuse it instead of the custom McpHandlerOutput	2026-03-10 10:41:41 -07:00
pakrym-oai	e52afd28b0	Expose strongly-typed result for exec_command (#14183 ) Summary - document output types for the various tool handlers and registry so the API exposes richer descriptions - update unified execution helpers and client tests to align with the new output metadata - clean up unused helpers across tool dispatch paths Testing - Not run (not requested)	2026-03-10 09:54:34 -07:00
Eric Traut	e4edafe1a8	Log ChatGPT user ID for feedback tags (#13901 ) There are some bug investigations that currently require us to ask users for their user ID even though they've already uploaded logs and session details via `/feedback`. This frustrates users and increases the time for diagnosis. This PR includes the ChatGPT user ID in the metadata uploaded for `/feedback` (both the TUI and app-server).	2026-03-10 09:57:41 -06:00
Eric Traut	9a501ddb08	Fix Linux tmux segfault in user shell lookup (#13900 ) Replace the Unix shell lookup path in `codex-rs/core/src/shell.rs` to use `libc::getpwuid_r()` instead of `libc::getpwuid()` when resolving the current user's shell. Why: - `getpwuid()` can return pointers into libc-managed shared storage - on the musl static Linux build, concurrent callers can race on that storage - this matches the crash pattern reported in tmux/Linux sessions with parallel shell activity Refs: - Fixes #13842	2026-03-10 09:57:18 -06:00
Eric Traut	b90921eba8	Fix release-mode integration test compiler failure (#13603 ) Addresses #13586 This doesn't affect our CI scripts. It was user-reported. Summary - add `wiremock::ResponseTemplate` and `body_string_contains` imports behind `#[cfg(not(debug_assertions))]` in `codex-rs/core/tests/suite/view_image.rs` so release builds only pull the helpers they actually use	2026-03-10 08:30:56 -06:00
Ahmed Ibrahim	6b7253b123	Fix unified exec test output assertion (#14184 ) ## Summary - update the unified exec test to use truncated_output() instead of the removed output field - fix the compile failure on latest main after ExecCommandToolOutput changed shape	2026-03-09 23:12:36 -07:00
Ahmed Ibrahim	aa6a57dfa2	Stabilize incomplete SSE retry test (#13879 ) ## What changed - The retry test now uses the same streaming SSE test server used by production-style tests instead of a wiremock sequence. - The fixture is resolved via `find_resource!`, and the test asserts that exactly two outbound requests were sent. ## Why this fixes the flake - The old wiremock sequence approximated early-close behavior, but it did not reproduce the same streaming semantics the real client sees. - That meant the retry path depended on mock implementation details instead of on the actual transport behavior we care about. - Switching to the streaming SSE helper makes the test exercise the real early-close/retry contract, and counting requests directly verifies that we retried exactly once rather than merely hoping the sequence aligned. ## Scope - Test-only change.	2026-03-09 22:34:44 -07:00
Ahmed Ibrahim	2e24be2134	Use realtime transcript for handoff context (#14132 ) - collect input/output transcript deltas into active handoff transcript state - attach and clear that transcript on each handoff, and regenerate schema/tests	2026-03-09 22:30:03 -07:00
Channing Conger	c6343e0649	Implemented thread-level atomic elicitation counter for stopwatch pausing (#12296 ) ### Purpose While trying to build out CLI-Tools for the agent to use under skills we have found that those tools sometimes need to invoke a user elicitation. These elicitations are handled out of band of the codex app-server but need to indicate to the exec manager that the command running is not going to progress on the usual timeout horizon. ### Example Model calls universal exec: `$ download-credit-card-history --start-date 2026-01-19 --end-date 2026-02-19 > credit_history.jsonl` download-cred-card-history might hit a hosted/preauthenticated service to fetch data. That service might decide that the request requires an end user approval the access to the personal data. It should be able to signal to the running thread that the command in question is blocked on user elicitation. In that case we want the exec to continue, but the timeout to not expire on the tool call, essentially freezing time until the user approves or rejects the command at which point the tool would signal the app-server to decrement the outstanding elicitation count. Now timeouts would proceed as normal. ### What's Added - New v2 RPC methods: - thread/increment_elicitation - thread/decrement_elicitation - Protocol updates in: - codex-rs/app-server-protocol/src/protocol/common.rs - codex-rs/app-server-protocol/src/protocol/v2.rs - App-server handlers wired in: - codex-rs/app-server/src/codex_message_processor.rs ### Behavior - Counter starts at 0 per thread. - increment atomically increases the counter. - decrement atomically decreases the counter; decrement at 0 returns invalid request. - Transition rules: - 0 -> 1: broadcast pause state, pausing all active stopwatches immediately. - \>0 -> >0: remain paused. - 1 -> 0: broadcast unpause state, resuming stopwatches. - Core thread/session logic: - codex-rs/core/src/codex_thread.rs - codex-rs/core/src/codex.rs - codex-rs/core/src/mcp_connection_manager.rs ### Exec-server stopwatch integration - Added centralized stopwatch tracking/controller: - codex-rs/exec-server/src/posix/stopwatch_controller.rs - Hooked pause/unpause broadcast handling + stopwatch registration: - codex-rs/exec-server/src/posix/mcp.rs - codex-rs/exec-server/src/posix/stopwatch.rs - codex-rs/exec-server/src/posix.rs	2026-03-09 22:29:26 -07:00
Matthew Zeng	566e4cee4b	[apps] Fix apps enablement condition. (#14011 ) - [x] Fix apps enablement condition to check both the feature flag and that the user is not an API key user.	2026-03-09 22:25:43 -07:00

1 2 3 4 5 ...

2181 Commits