Skip to content

Windmill AI

Windmill AI can use Codex Pooler through Windmill’s customai provider. This keeps Windmill on its OpenAI-compatible chat path while Codex Pooler handles Pool routing, upstream eligibility, limit checks, and request accounting.

Use a dedicated Pool API key for Windmill. Do not reuse operator MCP tokens, browser sessions, upstream account tokens, or imported account material.

Windmill should call Codex Pooler’s OpenAI-compatible /v1 surface:

https://codex-pooler.example.com/v1

For local setup, use:

http://localhost:4000/v1

The setup uses Windmill’s customai resource type. It is a chat-completions integration, not Codex backend compatibility and not full OpenAI API parity.

Create a Windmill secret variable for the Pool API key. The path is only an example; use the owner or folder convention for your workspace.

Terminal window
wmill variable add '<pool-api-key>' \
u/<owner>/codex_pooler_windmill_codegen \
--workspace <workspace>

If you mirror Windmill resources with wmill sync, update the remote secret first and then pull the encrypted export. Do not put a raw Pool API key in a tracked *.variable.yaml file.

Create a Windmill customai resource whose base_url points at Codex Pooler’s /v1 surface and whose api_key references the secret variable.

description: Codex Pooler API credentials for Windmill AI
value:
api_key: '$var:u/<owner>/codex_pooler_windmill_codegen'
base_url: https://codex-pooler.example.com/v1
headers: {}
resource_type: customai

For local setup, change base_url to http://localhost:4000/v1.

Use headers only for synthetic routing metadata required by your gateway or proxy. Do not put bearer tokens or account secrets in those headers.

In Windmill, open the workspace settings and configure Windmill AI with the customai provider. Use the customai resource path and model id your assigned Pool can serve.

providers:
customai:
resource_path: u/<owner>/codex_pooler_windmill_codegen
models:
- gpt-5.5
default_model:
provider: customai
model: gpt-5.5
metadata_model:
provider: customai
model: gpt-5.5

This enables Windmill AI chat, navigation chat, script and flow assistance, fixes, summaries, metadata generation, and form-filling features that use chat completion style requests.

Self-hosted Windmill validates AI provider base URLs before sending requests. If your deployed Codex Pooler hostname resolves to a private or internal address from the Windmill app/server runtime, Windmill rejects the request unless the server explicitly opts in.

Set this environment variable on the Windmill app/server process:

Terminal window
ALLOW_PRIVATE_AI_BASE_URLS=true

For the Windmill Helm chart, put it on the app/server container environment:

windmill:
app:
extraEnv:
- name: ALLOW_PRIVATE_AI_BASE_URLS
value: "true"

This setting belongs to Windmill, not Codex Pooler. Use it only when the Codex Pooler base URL is intentionally private from Windmill’s point of view.

After saving the resource and workspace AI settings, call Windmill’s AI proxy route. Use a Windmill user token for the Windmill API request and let Windmill resolve the Pool API key from its customai resource.

Terminal window
curl -sS -X POST \
-H "Authorization: Bearer <windmill-user-token>" \
-H "Content-Type: application/json" \
-H "X-Provider: customai" \
--data '{
"model": "gpt-5.5",
"messages": [
{ "role": "user", "content": "Reply with only: ok" }
],
"stream": false,
"max_completion_tokens": 16
}' \
"$WINDMILL_URL/api/w/<workspace>/ai/proxy/chat/completions"

Windmill’s agent request field is max_completion_tokens; provider adapters map it to OpenAI Responses max_output_tokens or chat max_completion_tokens as needed. Do not use max_tokens for GPT-5/O-series Windmill AI requests.

A working setup returns an OpenAI-shaped chat completion. If the response says the URL resolves to a private or internal IP address, set ALLOW_PRIVATE_AI_BASE_URLS=true on the Windmill app/server runtime and restart or roll out the app.

Leave Windmill’s code completion model unset unless you have separately configured a provider with fill-in-the-middle autocomplete support. Windmill’s autocomplete path is distinct from chat/codegen, and generic chat-completions providers are not a drop-in replacement for a fill-in-the-middle completion endpoint.

Codex Pooler’s /v1 surface is narrow OpenAI-compatible support over Codex Pool routing. It does not provide full OpenAI API parity, and OpenAI Realtime SDK routes are unsupported.