Hi, running OpenClaw 2026.4.14 on VPS 12 vCPU 24GB RAM no GPU, Ollama local with qwen2.5:7b.
Every agent turn returns 500 from Ollama after exactly 1m59s. Direct curl works fine in 7s.
openclaw --version: OpenClaw 2026.4.14 (323493f)
openclaw models status:
Default: ollama/qwen2.5:7b
Fallbacks: none
Auth: ollama:default api_key configured
openclaw config get models.providers.ollama:
baseUrl: http://127.0.0.1:11434
api: ollama (no /v1)
model: qwen2.5:7b
contextWindow: 32768
supportsTools: false
idleTimeoutSeconds: 300
openclaw config get agents.defaults:
primary: ollama/qwen2.5:7b
llm.idleTimeoutSeconds: 300
Ollama logs:
POST /api/chat returns 500 after 1m59s
completion request prompt=26848 tokens
num_ctx=32768
aborting completion request due to client closing the connection
System:
RAM total 23GB, available 21GB
Ollama model size 5.8 GiB
CPU only, no GPU
ollama list: qwen2.5:7b 4.7GB
Direct curl test works in 7s:
curl http://127.0.0.1:11434/api/generate -d {"model":"qwen2.5:7b","prompt":"hola","stream":false}
Response: Hola! Como estas?
Already tried: supportsTools false reduced prompt from 66K to 26K tokens but still 500. idleTimeoutSeconds 300 set and gateway restarted.
Question: why does Ollama return 500 after exactly 1m59s with 21GB free RAM, prompt 26K tokens under the 32K context window, and direct curl working fine?