Windmill AI
Windmill AI can use Codex Pooler through Windmill’s customai provider. This
keeps Windmill on its OpenAI-compatible chat path while Codex Pooler handles Pool
routing, upstream eligibility, limit checks, and request accounting.
Use a dedicated Pool API key for Windmill. Do not reuse operator MCP tokens, browser sessions, upstream account tokens, or imported account material.
Runtime shape
Section titled “Runtime shape”Windmill should call Codex Pooler’s OpenAI-compatible /v1 surface:
https://codex-pooler.example.com/v1For local setup, use:
http://localhost:4000/v1The setup uses Windmill’s customai resource type. It is a chat-completions
integration, not Codex backend compatibility and not full OpenAI API parity.
Store the Pool API key in Windmill
Section titled “Store the Pool API key in Windmill”Create a Windmill secret variable for the Pool API key. The path is only an example; use the owner or folder convention for your workspace.
wmill variable add '<pool-api-key>' \ u/<owner>/codex_pooler_windmill_codegen \ --workspace <workspace>If you mirror Windmill resources with wmill sync, update the remote secret
first and then pull the encrypted export. Do not put a raw Pool API key in a
tracked *.variable.yaml file.
Create a customai resource
Section titled “Create a customai resource”Create a Windmill customai resource whose base_url points at Codex Pooler’s
/v1 surface and whose api_key references the secret variable.
description: Codex Pooler API credentials for Windmill AIvalue: api_key: '$var:u/<owner>/codex_pooler_windmill_codegen' base_url: https://codex-pooler.example.com/v1 headers: {}resource_type: customaiFor local setup, change base_url to http://localhost:4000/v1.
Use headers only for synthetic routing metadata required by your gateway or
proxy. Do not put bearer tokens or account secrets in those headers.
Configure workspace AI
Section titled “Configure workspace AI”In Windmill, open the workspace settings and configure Windmill AI with the
customai provider. Use the customai resource path and model id your assigned
Pool can serve.
providers: customai: resource_path: u/<owner>/codex_pooler_windmill_codegen models: - gpt-5.5default_model: provider: customai model: gpt-5.5metadata_model: provider: customai model: gpt-5.5This enables Windmill AI chat, navigation chat, script and flow assistance, fixes, summaries, metadata generation, and form-filling features that use chat completion style requests.
Self-hosted private URL guard
Section titled “Self-hosted private URL guard”Self-hosted Windmill validates AI provider base URLs before sending requests. If your deployed Codex Pooler hostname resolves to a private or internal address from the Windmill app/server runtime, Windmill rejects the request unless the server explicitly opts in.
Set this environment variable on the Windmill app/server process:
ALLOW_PRIVATE_AI_BASE_URLS=trueFor the Windmill Helm chart, put it on the app/server container environment:
windmill: app: extraEnv: - name: ALLOW_PRIVATE_AI_BASE_URLS value: "true"This setting belongs to Windmill, not Codex Pooler. Use it only when the Codex Pooler base URL is intentionally private from Windmill’s point of view.
Connection Check Through Windmill
Section titled “Connection Check Through Windmill”After saving the resource and workspace AI settings, call Windmill’s AI proxy
route. Use a Windmill user token for the Windmill API request and let Windmill
resolve the Pool API key from its customai resource.
curl -sS -X POST \ -H "Authorization: Bearer <windmill-user-token>" \ -H "Content-Type: application/json" \ -H "X-Provider: customai" \ --data '{ "model": "gpt-5.5", "messages": [ { "role": "user", "content": "Reply with only: ok" } ], "stream": false, "max_completion_tokens": 16 }' \ "$WINDMILL_URL/api/w/<workspace>/ai/proxy/chat/completions"Windmill’s agent request field is max_completion_tokens; provider adapters map it to OpenAI Responses max_output_tokens or chat max_completion_tokens as needed. Do not use max_tokens for GPT-5/O-series Windmill AI requests.
A working setup returns an OpenAI-shaped chat completion. If the response says
the URL resolves to a private or internal IP address, set
ALLOW_PRIVATE_AI_BASE_URLS=true on the Windmill app/server runtime and restart
or roll out the app.
Compatibility notes
Section titled “Compatibility notes”Leave Windmill’s code completion model unset unless you have separately configured a provider with fill-in-the-middle autocomplete support. Windmill’s autocomplete path is distinct from chat/codegen, and generic chat-completions providers are not a drop-in replacement for a fill-in-the-middle completion endpoint.
Codex Pooler’s /v1 surface is narrow OpenAI-compatible support over Codex Pool
routing. It does not provide full OpenAI API parity, and OpenAI Realtime SDK
routes are unsupported.