0.1.7 • Published 8h agoCLI

@galaxy-yearn/codex-deepseek-gateway

Licence

MIT

Version

0.1.7

Deps

Size

380 kB

Vulns

Weekly

Summary Dependency Versions

Codex DeepSeek Gateway

A lightweight local gateway for using DeepSeek models in Codex.

Codex keeps using the OpenAI Responses API wire format. The gateway translates requests to DeepSeek-compatible Chat Completions, calls DeepSeek, then translates the result back to Responses JSON or response.* SSE events.

Package: @galaxy-yearn/codex-deepseek-gateway

DeepSeek is a great company.

Requirements

Node.js 22 or newer
A DeepSeek API key
Codex CLI 0.142.0 or newer is recommended

Install

npm install -g @galaxy-yearn/codex-deepseek-gateway
codex-deepseek-gateway --version
codex-deepseek-gateway install

The runtime is copied to ~/.codex/deepseek-gateway. Put your DeepSeek API key in:

~/.codex/deepseek-gateway/config/gateway.local.json

{
  "upstreamApiKey": "sk-...",
  "codexPromptLanguage": "en"
}

install preserves an existing gateway.local.json. If this is your first install, add the key and start the gateway:

codex-deepseek-gateway start
codex-deepseek-gateway status

status should show "reachable": true.

Codex Config

Add this provider to ~/.codex/config.toml:

model_provider = "deepseek-gateway"
model = "deepseek-v4-pro"
model_reasoning_effort = "xhigh"
model_supports_reasoning_summaries = true
model_reasoning_summary = "auto"

[model_providers.deepseek-gateway]
name = "DeepSeek"
base_url = "http://127.0.0.1:3000/v1"
wire_api = "responses"

Restart Codex after editing config.toml.

Usage

Start a new Codex conversation with gateway overrides:

codex-deepseek-gateway new

Resume a Codex session from the current project:

codex-deepseek-gateway sessions

When using the gateway, prefer these new / sessions commands over plain codex / codex resume. The launcher adds the gateway provider, model catalog, model, and reasoning overrides.

In the interactive session picker, use Up/Down to move through a scrolling window of sessions. Press n to start a new conversation instead of resuming an existing session.

Useful non-interactive forms:

codex-deepseek-gateway new --model deepseek-v4-flash --reasoning-effort low  # start with explicit model and effort
codex-deepseek-gateway sessions --print                                      # list resume commands
codex-deepseek-gateway sessions --all                                        # include sessions from all projects
codex-deepseek-gateway sessions --exec <id-or-row>                           # resume directly by row or session id

new chooses a model, then Codex reasoning effort. sessions chooses a session first, then model and reasoning effort. Both launch Codex with:

codex -c model_provider=deepseek-gateway -c model=<model> -c model_reasoning_effort=<effort> -c model_supports_reasoning_summaries=true -c model_reasoning_summary=auto

The installed launcher also passes model_catalog_json pointing at the packaged gateway Codex catalog, so Codex-native multi-agent validation accepts the DeepSeek model aliases and low|medium|high|xhigh reasoning efforts. This Codex setting replaces the default model catalog for that Codex process; it is not merged into it.

Even if the deepseek-v4 model supports a 1M context window, setting context_window and max_context_window too high in the catalog is not recommended, because excessive context can sharply degrade instruction following and cause attention drift.

System prompt language is selected in ~/.codex/deepseek-gateway/config/gateway.local.json with codexPromptLanguage. Supported values are en and zh; invalid values fall back to en. The launcher chooses the matching packaged catalog:

en -> ~/.codex/deepseek-gateway/config/codex-model-catalog.json
zh -> ~/.codex/deepseek-gateway/config/codex-model-catalog.zh.json

Codex /personality continues to work with the catalog's personality_default, personality_friendly, and personality_pragmatic entries.

Inside a launcher-started Codex TUI, /model can switch between the packaged DeepSeek models and reasoning efforts.

Model aliases are read from:

~/.codex/deepseek-gateway/config/model-aliases.json

model-aliases.json is managed by this package and is refreshed on install. The packaged Codex catalog currently allows the default aliases deepseek-v4-flash and deepseek-v4-pro for Codex-native sub-agent validation.

Commands

codex-deepseek-gateway install    # copy runtime into ~/.codex/deepseek-gateway
codex-deepseek-gateway start      # start the local gateway
codex-deepseek-gateway stop       # stop the local gateway
codex-deepseek-gateway status     # show process and endpoint status
codex-deepseek-gateway doctor     # inspect config and request mapping
codex-deepseek-gateway new        # start a Codex conversation through the launcher
codex-deepseek-gateway sessions   # pick and resume a Codex session through the launcher
codex-deepseek-gateway uninstall  # remove the local runtime

doctor checks the active Codex config, DeepSeek request shape, reasoning mode, and optional web-search backend readiness.

To remove the local runtime and then uninstall the global package:

codex-deepseek-gateway uninstall
npm uninstall -g @galaxy-yearn/codex-deepseek-gateway

Reasoning

Codex effort maps to DeepSeek V4 thinking mode:

Codex effort	DeepSeek request
`low`	`thinking.type = disabled`
`medium`	`thinking.type = enabled`, `reasoning_effort = high`
`high`	`thinking.type = enabled`, `reasoning_effort = high`
`xhigh`	`thinking.type = enabled`, `reasoning_effort = max`

When DeepSeek returns reasoning_content, the gateway preserves the raw text for DeepSeek history and sends a display-only Markdown-cleaned copy through Codex's reasoning summary UI. The raw reasoning is not duplicated into visible message content.

Web Search

Web search is optional and off by default. Configure Tavily for search:

{
  "tavilyApiKey": "tvly-...",
  "tavilyWebSearchEnabled": true
}

Configure Firecrawl if you also want opened-page reading:

{
  "firecrawlApiKey": "fc-...",
  "firecrawlWebFetchEnabled": true
}

Codex can keep requesting web_search / web_search_preview. The gateway exposes compact internal web tools to DeepSeek, executes Tavily/Firecrawl calls itself, feeds tool results back to the model, and returns Codex-compatible web_search_call items. TAVILY_MAX_SEARCH_ROUNDS defaults to 10 and is only a runaway/cost guardrail; when reached, the gateway disables web tools for one final-answer turn.

Final answers should include useful source titles and URLs directly.

Limits

Chat Completions is not a full Responses API replacement.

Hosted tools without a local Codex executor are represented as function shims. Web search is the only hosted tool the gateway emulates directly.
Tavily/Firecrawl web emulation is text-focused; it does not provide browser control, screenshots, raw HTML, cookies, crawl jobs, or private-network access.
OpenAI file_id values are passed through; the gateway cannot fetch private OpenAI-hosted files.
In-memory previous_response_id / conversation history is lost when the gateway process restarts.
Plain codex commands do not automatically load the packaged model catalog. Use the launcher when you want TUI /model and sub-agent validation to use the DeepSeek catalog.

License

MIT. See LICENSE.

Keywords

codex deepseek openai responses-api chat-completions gateway proxy codex-cli openai-responses deepseek-v4 deepseek-v4-flash deepseek-v4-pro function-calling tool-calls web-search tavily firecrawl