codex

mirror of https://github.com/openai/codex.git synced 2026-05-01 03:42:05 +03:00

Author	SHA1	Message	Date
Owen Lin	3582b74d01	fix(auth): isolate chatgptAuthTokens concept to auth manager and app-server (#10423 ) So that the rest of the codebase (like TUI) don't need to be concerned whether ChatGPT auth was handled by Codex itself or passed in via app-server's external auth mode.	2026-02-05 10:46:06 -08:00
Felipe Coury	b0e5a6305b	feat(tui): add /statusline command for interactive status line configuration (#10546 ) ## Summary - Adds a new `/statusline` command to configure TUI footer status line - Introduces reusable `MultiSelectPicker` component with keyboard navigation, optional ordering and toggle support - Implement status line setup modal that persist configuration to config.toml ## Status Line Items The following items can be displayed in the status line: - Model: Current model name (with optional reasoning level) - Context: Remaining/used context window percentage - Rate Limits: 5-day and weekly usage limits - Git: Current branch (with optimized lookups) - Tokens: Used tokens, input/output token counts - Session: Session ID (full or shortened prefix) - Paths: Current directory, project root - Version: Codex version ## Features - Live preview while configuring status line items - Fuzzy search filtering in the picker - Intelligent truncation when items don't fit - Items gracefully omit when data is unavailable - Configuration persists to `config.toml` - Validates and warns about invalid status line items ## Test plan - [x] Run `/statusline` and verify picker UI appears - [x] Toggle items on/off and verify live preview updates - [x] Confirm selection persists after restart - [x] Verify truncation behavior with many items selected - [x] Test git branch detection in and out of git repos --------- Co-authored-by: Josh McKinney <joshka@openai.com>	2026-02-05 08:50:21 -08:00
gt-oai	3b54fd7336	Add hooks implementation and wire up to `notify` (#9691 ) This introduces a `Hooks` service. It registers hooks from config and dispatches hook events at runtime. N.B. The hook config is not wired up to this yet. But for legacy reasons, we wire up `notify` from config and power it using hooks now. Nothing about the `notify` interface has changed. I'd start by reviewing `hooks/types.rs` Some things to note: - hook names subject to change - no hook result yet - stopping semantics yet to be introduced - additional hooks yet to be introduced	2026-02-05 16:49:35 +00:00
jif-oai	9ee746afd6	Leverage state DB metadata for thread summaries (#10621 ) Summary: - read conversation summaries and cwd info from the state DB when possible so we no longer rely on rollout files for metadata and avoid extra I/O - persist CLI version in thread metadata, surface it through summary builders, and add the necessary DB migration hooks - simplify thread listing by using enriched state DB data directly rather than reading rollout heads Testing: - Not run (not requested)	2026-02-05 16:39:11 +00:00
jif-oai	68e82e5dc9	nit: add DB version is discrepancy recording (#10762 )	2026-02-05 16:24:18 +00:00
jif-oai	901215e310	feat: repair DB in case of missing lines (#10751 )	2026-02-05 16:21:49 +00:00
jif-oai	41f3b1ba0b	feat: add memory tool (#10637 ) Add a tool for memory to retrieve a full memory based on the memory ID	2026-02-05 16:16:31 +00:00
jif-oai	4033f905c6	feat: resumable backfill (#10745 ) ## Summary This PR makes SQLite rollout backfill resumable and repeatable instead of one-shot-on-db-create. ## What changed - Added a persisted backfill state table: - state/migrations/0008_backfill_state.sql - Tracks status (pending\|running\|complete), last_watermark, and last_success_at. - Added backfill state model/types in codex-state: - BackfillState, BackfillStatus (state/src/model/backfill_state.rs) - Added runtime APIs to manage backfill lifecycle/progress: - get_backfill_state - mark_backfill_running - checkpoint_backfill - mark_backfill_complete - Updated core startup behavior: - Backfill now runs whenever state is not Complete (not only when DB file is newly created). - Reworked backfill execution: - Collect rollout files, derive deterministic watermark per path, sort, resume from last_watermark. - Process in batches (BACKFILL_BATCH_SIZE = 200), checkpoint after each batch. - Mark complete with last_success_at at the end. ## Why Previous behavior could leave users permanently partially backfilled if the process exited during initial async backfill. This change allows safe continuation across restarts and avoids restarting from scratch.	2026-02-05 14:34:34 +00:00
jif-oai	040ecee715	Update explorer role default model (#10748 ) Summary - switch the explorer role in core agent configuration to use `gpt-5.1-codex-mini` as the default model override - leave other role defaults untouched Testing - Not run (not requested)	2026-02-05 13:51:53 +00:00
pap-openai	b2424cb635	adding fork information (UI) when forking (#10246 ) - shows `/fork` command that ran in prev session - shows `session forked from name (uuid) \|\| uuid (if name is not set)` as an event in new session	2026-02-05 13:24:55 +00:00
jif-oai	aa46b5cf99	nit: backfill stronger (#10738 )	2026-02-05 12:30:16 +00:00
jif-oai	97582ac52d	Allow user shell commands to run alongside active turns (#10513 ) Summary - refactor user shell command execution into a shared helper and add modes for standalone vs active-turn execution - run user shell commands asynchronously when a turn is already active so they don’t replace or abort the current turn - extend the tests to cover the new behavior and add the generated Codex environment manifest Testing - Not run (not requested)	2026-02-05 11:11:00 +00:00
Dylan Hurd	fe8b474acd	fix(core,app-server) resume with different model (#10719 ) ## Summary When resuming with a different model, we should also append a developer message with the model instructions ## Testing - [x] Added unit tests	2026-02-05 00:40:05 -08:00
Charley Cunningham	dc7007beaa	Fix remote compaction estimator/payload instruction small mismatch (#10692 ) ## Summary This PR fixes a deterministic mismatch in remote compaction where pre-trim estimation and the `/v1/responses/compact` payload could use different base instructions. Before this change: - pre-trim estimation used model-derived instructions (`model_info.get_model_instructions(...)`) - compact payload used session base instructions (`sess.get_base_instructions()`) After this change: - remote pre-trim estimation and compact payload both use the same `BaseInstructions` instance from session state. ## Changes - Added a shared estimator entry point in `ContextManager`: - `estimate_token_count_with_base_instructions(&self, base_instructions: &BaseInstructions) -> Option<i64>` - Kept `estimate_token_count(&TurnContext)` as a thin wrapper that resolves model/personality instructions and delegates to the new helper. - Updated remote compaction flow to fetch base instructions once and reuse it for both: - trim preflight estimation - compact request payload construction - Added regression coverage for parity and behavior: - unit test verifying explicit-base estimator behavior - integration test proving remote compaction uses session override instructions and trims accordingly ## Why this matters This removes a deterministic divergence source where pre-trim could think the request fits while the actual compact request exceeded context because its instructions were longer/different. ## Scope In scope: - estimator/payload base-instructions parity in remote compaction Out of scope: - retry-on-`context_length_exceeded` - compaction threshold/headroom policy changes - broader trimming policy changes ## Codex author: `codex fork 019c2b24-c2df-7b31-a482-fb8cf7a28559`	2026-02-04 23:24:06 -08:00
Ahmed Ibrahim	cd5f49a619	Make steer stable by default (#10690 ) Promotes the Steer feature from Experimental to Stable and enables it by default. ## What is Steer mode? Steer mode changes how message submission works in the TUI: - With Steer enabled (new default): - `Enter` submits messages immediately, even when a task is running - `Tab` queues messages when a task is running (allows building up a queue) - With Steer disabled (old behavior): - `Enter` queues messages when a task is running - This preserves the previous "queue while a task is running" behavior ## How Steer vs Queue work The key difference is in the submission behavior: 1. Steer mode (`steer_enabled = true`): - Enter → `InputResult::Submitted` → sends immediately via `submit_user_message()` - Tab → `InputResult::Queued` → queues via `queue_user_message()` if a task is running - This gives users direct control: Enter for immediate submission, Tab for queuing 2. Queue mode (`steer_enabled = false`, previous default): - Enter → `InputResult::Queued` → always queues when a task is running - Tab → `InputResult::Queued` → queues when a task is running - This preserves the original behavior where Enter respects the running task queue ## Implementation details The behavior is controlled in `ChatComposer::handle_key_event_without_popup()`: - When `steer_enabled` is true, Enter calls `handle_submission(false)` (submit immediately) - When `steer_enabled` is false, Enter calls `handle_submission(true)` (queue) See `codex-rs/tui/src/bottom_pane/chat_composer.rs` for the implementation. ## Documentation For more details on the chat composer behavior, see: - [TUI Chat Composer documentation](docs/tui-chat-composer.md) - Feature flag definition: `codex-rs/core/src/features.rs`	2026-02-04 23:12:59 -08:00
Charley Cunningham	41b4962b0a	Sync collaboration mode naming across Default prompt, tools, and TUI (#10666 ) ## Summary - add shared `ModeKind` helpers for display names, TUI visibility, and `request_user_input` availability - derive TUI mode filtering/labels from shared `ModeKind` metadata instead of local hardcoded matches - derive `request_user_input` availability text and unavailable error mode names from shared mode metadata - replace hardcoded known mode names in the Default collaboration-mode template with `{{KNOWN_MODE_NAMES}}` and fill it from `TUI_VISIBLE_COLLABORATION_MODES` - add regression tests for mode metadata sync and placeholder replacement ## Notes - `cargo test -p codex-core` integration target (`tests/all`) still shows pre-existing env-specific failures in this environment due missing `test_stdio_server` binary resolution; core unit tests are green. ## Codex author `codex resume 019c26ff-dfe7-7173-bc04-c9e1fff1e447`	2026-02-04 23:03:28 -08:00
Dylan Hurd	e482978261	fix(core) switching model appends model instructions (#10651 ) ## Summary When switching models, we should append the instructions of the new model to the conversation as a developer message. ## Test - [x] Adds a unit test	2026-02-05 05:50:38 +00:00
Dylan Hurd	a05aadfa1b	chore(config) Default Personality Pragmatic (#10705 ) ## Summary Switch back to Pragmatic personality ## Testing - [x] Updated unit tests	2026-02-04 21:22:47 -08:00
Dylan Hurd	73f32840c6	chore(core) personality migration tests (#10650 ) ## Summary Adds additional tests for personality edge cases ## Testing - [x] These are tests	2026-02-04 19:03:14 -08:00
gt-oai	1f47e08d66	Cloud Requirements: increase timeout and retries (#10631 ) Add retries and an increased-length timeout for loading Cloud Requirements. Co-authored-by: alexsong-oai <alexsong@openai.com>	2026-02-05 01:52:12 +00:00
Josh McKinney	cddfd1e675	feat(core): add configurable log_dir (#10678 ) Adds a top-level `log_dir` config key (defaults to `$CODEX_HOME/log`) so one-off runs can redirect `codex-tui.log` via `-c`, e.g.: codex -c log_dir=./.codex-log Also resolves relative paths in CLI `-c/--config` overrides for `AbsolutePathBuf` values against the effective cwd (when available). Tests: - cargo test -p codex-core	2026-02-05 01:23:30 +00:00
pakrym-oai	0e8d359da9	Session-level model client (#10664 ) Make ModelClient a session-scoped object. Move state that is session level onto the client, and make state that is per-turn explicit on corresponding methods. Stop taking a huge Config object, instead only pass in values that are actually needed. --------- Co-authored-by: Josh McKinney <joshka@openai.com>	2026-02-04 16:58:48 -08:00
Owen Lin	5ea107a088	feat(app-server, core): allow text + image content items for dynamic tool outputs (#10567 ) Took over the work that @aaronl-openai started here: https://github.com/openai/codex/pull/10397 Now that app-server clients are able to set up custom tools (called `dynamic_tools` in app-server), we should expose a way for clients to pass in not just text, but also image outputs. This is something the Responses API already supports for function call outputs, where you can pass in either a string or an array of content outputs (text, image, file): https://platform.openai.com/docs/api-reference/responses/create#responses_create-input-input_item_list-item-function_tool_call_output-output-array-input_image So let's just plumb it through in Codex (with the caveat that we only support text and image for now). This is implemented end-to-end across app-server v2 protocol types and core tool handling. ## Breaking API change NOTE: This introduces a breaking change with dynamic tools, but I think it's ok since this concept was only recently introduced (https://github.com/openai/codex/pull/9539) and it's better to get the API contract correct. I don't think there are any real consumers of this yet (not even the Codex App). Old shape: `{ "output": "dynamic-ok", "success": true }` New shape: ``` { "contentItems": [ { "type": "inputText", "text": "dynamic-ok" }, { "type": "inputImage", "imageUrl": "data:image/png;base64,AAA" } ] "success": true } ```	2026-02-04 16:12:47 -08:00
Ahmed Ibrahim	f9c38f531c	add none personality option (#10688 ) - add none personality enum value and empty placeholder behavior\n- add docs/schema updates and e2e coverage	2026-02-04 15:40:33 -08:00
Eric Traut	7bcc552325	Added support for live updates to skills (#10478 ) Add a centralized FileWatcher in codex-core (using notify) that watches skill roots from the config layer stack (recursive) Send `SkillsChanged` events when relevant file system changes are detected On `SkillsChanged`: * Invalidate the skills cache immediately in ThreadManager * Emit EventMsg::SkillsUpdateAvailable to active sessions ~~* Broadcast a new app-server notification: SkillsListUpdatedNotification~~ This change does not inject new items into the event stream. That means the agent will not know about new skills, so it won't be able to implicitly invoke new skills. It also won't know about changes to existing skills, so if it has already read the contents of a modified skill, it will not honor the new behavior. This change also does not detect modifications to AGENTS.md. I plan to address these limitations in a follow-on PR modeled after #9985. Injection of new skills and AGENTS was deemed to risky, hence the need to split the feature into two stages. The changes in this PR were designed to easily accommodate the second stage once we have some other foundational changes in place. Testing: In addition to automated tests, I did manual testing to confirm that newly-created skills, deleted skills, and renamed skills are reflected in the TUI skill picker menu. Also confirmed that modifications to behaviors for explicitly-invoked skills are honored. --------- Co-authored-by: Xin Lin <xl@openai.com>	2026-02-04 15:25:03 -08:00
gt-oai	d452bb3ae5	Add /debug-config slash command (#10642 ) <img width="409" height="175" alt="image" src="https://github.com/user-attachments/assets/76efe9c5-8375-4af3-b6af-bd9e162c1bc3" />	2026-02-04 22:26:17 +00:00
Matthew Zeng	acdbd8edc5	[apps] Cache MCP actions from apps. (#10662 ) MCP actions take a long time to load for users with lots of apps installed. Adding a cache for these actions with 1hr expiration, given that they are almost always aren't going to change unless people install another app, which means they also need to restart codex to pick it up.	2026-02-04 13:51:57 -08:00
jif-oai	4922b3e571	feat: add phase 1 mem db (#10634 ) - Schema: thread_id (PK, FK to threads.id with cascade delete), trace_summary, memory_summary, updated_at. - Migration: creates the table and an index on (updated_at DESC, thread_id DESC) for efficient recent-first reads. - Runtime API (DB-only): - `get_thread_memory(thread_id)`: fetch one memory row. - `upsert_thread_memory(thread_id, trace_summary, memory_summary)`: insert/update by thread id and always advance updated_at. - `get_last_n_thread_memories_for_cwd(cwd, n)`: join thread_memory with threads and return newest n rows for an exact cwd match. - Model layer: introduced ThreadMemory and row conversion types to keep query decoding typed and consistent with existing state models.	2026-02-04 21:38:39 +00:00
Ahmed Ibrahim	7a253076fe	Persist pending input user events (#10656 ) - Persist user-message events for mid-turn injected input by emitting user message turn items when pending input is recorded.	2026-02-04 11:47:10 -08:00
viyatb-oai	ae4de43ccc	feat(linux-sandbox): add bwrap support (#9938 ) ## Summary This PR introduces a gated Bubblewrap (bwrap) Linux sandbox path. The curent Linux sandbox path relies on in-process restrictions (including Landlock). Bubblewrap gives us a more uniform filesystem isolation model, especially explicit writable roots with the option to make some directories read-only and granular network controls. This is behind a feature flag so we can validate behavior safely before making it the default. - Added temporary rollout flag: - `features.use_linux_sandbox_bwrap` - Preserved existing default path when the flag is off. - In Bubblewrap mode: - Added internal retry without /proc when /proc mount is not permitted by the host/container.	2026-02-04 11:13:17 -08:00
gt-oai	1b153a3d4a	Cloud Requirements: take precedence over MDM (#10633 ) Cloud Requirements should be applied before MDM requirements.	2026-02-04 18:40:56 +00:00
jif-oai	e9335374b9	feat: add phase 1 mem client (#10629 ) Adding a client on top of https://github.com/openai/openai/pull/672176	2026-02-04 17:59:36 +00:00
jif-oai	71e63f8d10	fix: flaky test (#10644 )	2026-02-04 17:59:22 +00:00
canvrno-oai	282f42c0ce	Add option to approve and remember MCP/Apps tool usage (#10584 ) This PR adds a new approval option for app/MCP tool calls: “Allow and remember” (session-scoped). When selected, Codex stores a temporary approval and auto-approves matching future calls for the rest of the session. Added a session-scoped approval key (`server`, `connector_id`, `tool_name`) and persisted it in `tool_approvals` as `ApprovedForSession`. On subsequent matching calls, approval is skipped and treated as accepted. - Updated the approval question options to conditionally include: - Accept - Allow and remember (conditional) - Decline - Cancel The new “Allow and remember” option is only shown when all of these are true: 1. The call is routed through the Codex Apps MCP server (codex_apps). 2. The tool requires approval based on annotations: - read_only_hint == false, and - destructive_hint == true or open_world_hint == true. 3. The tool includes a connector_id in metadata (used to build the remembered approval key). If no `connector_id` is present, the prompt still appears (when approval is required), but only with the existing choices (Accept / Decline / Cancel). Approval prompting in this path has an explicit early return unless server == `codex_apps`.	2026-02-04 09:38:41 -08:00
pakrym-oai	7f20357611	Stop client from being state carrier (#10595 ) I'd like to make client session wide. This requires shedding all random state it has to carry.	2026-02-04 09:05:37 -08:00
jif-oai	49dd67a260	feat: land unified_exec (#10641 ) Land `unified_exec` for all non-windows OS	2026-02-04 16:39:41 +00:00
pakrym-oai	0efd33f7f4	Update tests to stop using sse_completed fixture (#10638 ) Summary: - replace the `sse_completed` fixture and related JSON template with direct `responses::ev_completed` payload builders - cascade the new SSE helpers through all affected core tests for consistency and clarity - remove legacy fixtures that were no longer needed once the helpers are in place Testing: - Not run (not requested)	2026-02-04 08:38:06 -08:00
jif-oai	583e5d4f41	Migrate state DB path helpers to versioned filename (#10623 ) Summary - add versioned state sqlite filename helpers and re-export them from the state crate - remove legacy state files when initializing the runtime and update consumers/tests to use the new helpers - tweak logs client description and database resolution to match the new path	2026-02-04 14:31:12 +00:00
Rasmus Rygaard	df000da917	Add a codex.rate_limits event for websockets (#10324 ) When communicating over websockets, we can't rely on headers to deliver rate limit information. This PR adds a `codex.rate_limits` event that the server can pass to the client to inform them about rate limit usage. The client parses this data the same way we parse rate limit headers in HTTP mode. This PR also wires up the etag and reasoning headers for websockets	2026-02-04 06:01:47 -08:00
jif-oai	61aecdde66	fix: make sure file exist in `find_thread_path_by_id_str_in_subdir` (#10618 )	2026-02-04 13:01:17 +00:00
jif-oai	38f6c6b114	chore: simplify user message detection (#10611 ) We don't check anymore the response item with `user` role as they may be instructions etc	2026-02-04 11:14:53 +00:00
gt-oai	1eb21e279e	Requirements: add source to constrained requirement values (#10568 ) If we want to build `/debug-config`, we'll need to know the requirements sources that supplied the values. This PR adds those sources such that we can render them in the UI.	2026-02-04 11:09:48 +00:00
jif-oai	100eb6e6f0	Prefer state DB thread listings before filesystem (#10544 ) Summary - add Cursor/ThreadsPage conversions so state DB listings can be mapped back into the rollout list model - make recorder list helpers query the state DB first (archived flag included) and only fall back to file traversal if needed, along with populating head bytes lazily - add extensive tests to ensure the DB path is honored for active and archived threads and that the fallback works Testing - Not run (not requested) <img width="1196" height="693" alt="Screenshot 2026-02-03 at 20 42 33" src="https://github.com/user-attachments/assets/826b3c7a-ef11-4b27-802a-3c343695794a" />	2026-02-04 09:27:24 +00:00
pakrym-oai	56ebfff1a8	Move metadata calculation out of client (#10589 ) Model client shouldn't be responsible for this.	2026-02-03 21:59:13 -08:00
Anton Panasenko	fcaed4cb88	feat: log webscocket timing into runtime metrics (#10577 )	2026-02-03 18:04:07 -08:00
xl-openai	f38d181795	feat: add APIs to list and download public remote skills (#10448 ) Add API to list / download from remote public skills	2026-02-03 14:09:37 -08:00
viyatb-oai	08926a3fb7	chore(arg0): advisory-lock janitor for codex tmp paths (#10039 ) ## Description ### What changed - Switch the arg0 helper root from `~/.codex/tmp/path` to `~/.codex/tmp/path2` - Add `Arg0PathEntryGuard` to keep both the `TempDir` and an exclusive `.lock` file alive for the process lifetime - Add a startup janitor that scans `path2` and deletes only directories whose lock can be acquired ### Tests - `cargo clippy -p codex-arg0` - `cargo clippy -p codex-core` - `cargo test -p codex-arg0` - `cargo test -p codex-core`	2026-02-03 21:38:31 +00:00
gt-oai	8406bd7672	[codex] Default values from requirements if unset (#10531 ) If we don't set any explicit values for sandbox or approval policy, let's try to use a requirements-satisfying value.	2026-02-03 20:47:34 +00:00
Matthew Zeng	654fcb4962	[apps] Gateway MCP should be blocking. (#10289 ) Make Apps Gateway MCP blocking since otherwise app mentions may not work when apps are not loaded. Messages sent before apps become available will be queued. This only affects when `apps` feature is enabled.	2026-02-03 12:17:53 -08:00
Charley Cunningham	998eb8f32b	Improve Default mode prompt (less confusion with Plan mode) (#10545 ) ## Summary This PR updates `request_user_input` behavior and Default-mode guidance to match current collaboration-mode semantics and reduce model confusion. ## Why - `request_user_input` should be explicitly documented as Plan-only. - Tool description and runtime availability checks should be driven by the same centralized mode policy. - Default mode prompt needed stronger execution guidance and explicit instruction that `request_user_input` is unavailable. - Error messages should report the actual mode name (not aliases that can read as misleading). ## What changed - Centralized `request_user_input` mode policy in `core` handler logic: - Added a single allowed-modes config (`Plan` only). - Reused that policy for: - runtime rejection messaging - tool description text - Updated tool description to include availability constraint: - `"This tool is only available in Plan mode."` - Updated runtime rejection behavior: - `Default` -> `"request_user_input is unavailable in Default mode"` - `Execute` -> `"request_user_input is unavailable in Execute mode"` - `PairProgramming` -> `"request_user_input is unavailable in Pair Programming mode"` - Strengthened Default collaboration prompt: - Added explicit execution-first behavior - Added assumptions-first guidance - Added explicit `request_user_input` unavailability instruction - Added concise progress-reporting expectations - Simplified formatting implementation: - Inlined allowed-mode name collection into `format_allowed_modes()` - Kept `format_allowed_modes()` output for 3+ modes as CSV style (`modes: a,b,c`)	2026-02-03 12:08:38 -08:00

... 3 4 5 6 7 ...

1721 Commits