Hi — I'm trying to run OpenClaw fully local using LM Studio as the model provider.
Setup:
OS: Windows 10
GPU: NVIDIA 5090 - 32GB VRAM + 64 GB system RAM
Model: qwen3-coder-30b-a3b-instruct
LM Studio server running on: http://127.0.0.1:1234
Context length in LM Studio: 16k
OpenClaw version: 2026.2.9
Config highlights
Provider: lmstudio
Endpoint: http://127.0.0.1:1234/v1
Model ID returned by /v1/models:
qwen3-coder-30b-a3b-instruct
Behavior:
OpenClaw connects successfully and the model clearly runs, but the agent output is unusable.
Even simple prompts like: "OK" or similar produce very long multi-paragraph outputs that include things like:
-timestamps
-message IDs
-internal analysis
-repeated session status text
-The output keeps generating until I manually press Stop.
This happens even after:
-restarting the gateway
-creating a new session
-editing bootstrap.md to force minimal replies
-limiting max tokens in LM Studio
-temperature = 0
Question
-Is this a known issue when using coder/reasoning models (like Qwen3-Coder) with OpenClaw?
-Do agents require a chat-tuned model (e.g., Qwen-Instruct, Llama-Instruct, etc.) to behave correctly?
-Or is there a configuration step I'm missing when using LM Studio as the provider?
Any guidance appreciated.