#openrouter/auto thinking issue
1 messages · Page 1 of 1 (latest)
Let me verify the exact behavior in docs/source so I don’t hand you a made-up config key.
Short version: yes, that retry can add latency.
What’s happening:
openrouter/autois a moving target, so thinking support is not stable.- OpenClaw sees a thinking level, OpenRouter rejects it for the routed backend, then OpenClaw retries with
minimal.
Best fix:
- Paste
openclaw --versionfirst. - If you want stable thinking/quality, stop using
openrouter/autoand switch to a concrete OpenRouter model. - If you want to stay on
openrouter/auto, turn thinking off for that session/model:- chat:
/think off - or config:
{ agents: { defaults: { - chat:
models: {
"openrouter/auto": {
params: { thinking: "off" },
},
},
},
},
}
You can also use agents.defaults.thinkingDefault: "off" if you want it globally.
I checked the docs/source, and OpenClaw’s OpenRouter docs say supported non-auto routes get reasoning injection, while openrouter/auto should skip it. So if you’re on a recent version and still seeing this a lot, it may be version-specific.
Docs:
If you paste your openclaw --version, I can tell you whether this looks expected on that build or bug-ish.
OpenClaw 2026.4.21
Let me check whether that looks like known-by-code behavior on 2026.4.21 or more like a rough edge.
Thanks, that helps. On 2026.4.21, I’d treat this as a real rough edge, not just “normal tuning.”
Why:
- OpenClaw’s docs/source say
openrouter/autoshould skip reasoning injection. - But
openrouter/autois still a router, so the actual backend can change per call. - So yes, the retry can hurt latency, and
autoalso hurts quality consistency even when it doesn’t warn.
What I’d do:
- In that thread, send
/thinkand check whether a session thinking override is stuck. - If you want to stay on
openrouter/auto, send/think off. - If you want stable reasoning behavior, stop using
openrouter/autoand switch to a concrete OpenRouter model.
If you want the config version instead of chat commands:
{
agents: {
defaults: {
models: {
"openrouter/auto": {
params: { thinking: "off" },
},
},
},
},
}
So my blunt recommendation is:
- Convenience: keep
openrouter/auto, but turn thinking off
- Best quality + predictable latency: use a concrete model, not
auto
Docs:
- https://docs.openclaw.ai/providers/openrouter
- https://docs.openclaw.ai/tools/thinking
- https://docs.openclaw.ai/help/faq
If you want current concrete-model recommendations, ask in #models. And if this fixes it, please right-click the solution message → Apps → ✅ Mark Solution so Answer Overflow can index it.