#I ran out of OpenAI API tokens, how to switch to my OAUTH ChatGPT subscription??
75 messages · Page 1 of 1 (latest)
Yep, but the key gotcha is this:
ChatGPT/OAuth != openai/*
Your ChatGPT subscription only works through the openai-codex/* route in OpenClaw, not the direct OpenAI API route.
Quickest fix:
openclaw models auth login --provider openai-codex --set-default
Or run:
openclaw configure
and choose OpenAI Code (Codex) subscription (OAuth).
Then check:
openclaw models status
If you're already in a chat/session, switch that session to an openai-codex/* model via /model.
If you're headless/remote, the OAuth flow can also let you paste the redirect URL/code manually.
Docs:
If you get stuck, paste:
openclaw --version
openclaw models status
and I can help you from there.
I really like the current model I am using OpenAI via API key (GPT 5.4) there is any free model that I can run without care about the tokens that is very similar to that model?? that it would feel mostly the same?
Honestly, not really.
If by free you mean “feels basically like the same top OpenAI API experience”, I wouldn’t promise that. The tradeoff is usually pretty noticeable.
My blunt take:
- Closest without extra API spend: use your ChatGPT/Codex OAuth path in OpenClaw (
openai-codex/*) - Free hosted models: usually feel worse or less reliable
- Local models: can be okay, but “mostly the same” usually means serious hardware, not a normal laptop
So if you already pay for ChatGPT, that’s probably the best next move instead of chasing a “free equivalent”.
For current real-world alternatives, ask in #models because that changes fast and I don’t want to give you stale model recommendations.
Docs:
- Models overview: https://docs.openclaw.ai/concepts/models
- Community notes: https://docs.openclaw.ai/concepts/models#model-discussions-community-notes
- OpenAI subscription OAuth: https://docs.openclaw.ai/help/faq#do-you-support-openai-subscription-auth-codex-oauth
If you want, I can also help you switch your setup cleanly to the Codex OAuth route.
I have 64GB RAM and good computer
64GB RAM is nice, but for local models the bigger question is GPU/VRAM, not just system RAM.
So:
- 64GB RAM + no strong GPU: probably no, it won’t feel close
- 64GB RAM + a beefy GPU: usable for some local setups, but still not really “same as GPT-5.4”
- For agentic/tool-heavy OpenClaw use, cloud still tends to feel a lot better
So I’d still recommend:
- Use your ChatGPT/Codex OAuth if you want the closest feel without burning API credits
- Only go local if you specifically want privacy/offline/hobby tinkering
If you want, send your GPU model and I can tell you pretty quickly whether local is realistic or not.
I have an AMD Ryzen 7 5800X and a NVIDIA GeForce RTX 3060
With a 5800X + RTX 3060, I’d say:
- Good enough to try local stuff
- Not enough to get “basically GPT-5.4”
- For OpenClaw-style agent/tool workflows, it’ll usually feel slower and less reliable than the cloud options
Small nuance:
- If your 3060 is the common 12GB desktop version, that’s decent for local experimentation
- If it’s a 6GB laptop/mobile variant, it’s a lot tighter
So my honest recommendation is:
- Use ChatGPT/Codex OAuth for the closest experience right now
- Treat local models as a separate “cheap/private/good-enough” path, not a same-feeling replacement
If you want the best current local suggestions for that exact GPU, ask in #models since those recommendations move fast and I don’t want to feed you stale picks.
I know it won't be exactly the same, but I already used all my OpenAI tokens, and now I am using almost all my codex tokens, so I will need a free alternative
Yeah, in that case the honest answer is:
If you need truly free and don’t want to care about tokens, go local.
On your hardware, that means something like Ollama is the realistic path. It won’t feel the same, but it can be good enough for a lot of day-to-day stuff.
What I’d expect on your setup:
- Fine for local experimentation and lighter coding/chat
- Noticeably worse than top cloud models for hard reasoning
- More brittle with tool calling / long agent loops sometimes
So I’d do one of these:
- Best zero-ongoing-cost path: switch OpenClaw to Ollama
openclaw onboardand choose Ollama- or
openclaw configureand switch the model/provider there
- If you want hosted + free-tier options: ask in #models
- those change constantly, so I don’t want to give you stale picks
Docs:
- Ollama: https://docs.openclaw.ai/providers/ollama
- Models: https://docs.openclaw.ai/concepts/models
- Community notes: https://docs.openclaw.ai/concepts/models#model-discussions-community-notes
If you want, I can help you pick the least painful local setup for your machine next.
my openclaw is running in WSL should I install ollama in the same WSL and just switch model?
Yeah, that’s the simplest path.
If OpenClaw is running in WSL, I’d generally recommend:
- Install Ollama in the same WSL distro
- then switch OpenClaw to Ollama
- let OpenClaw talk to the default local Ollama URL:
http://127.0.0.1:11434
Why: fewer Windows/WSL networking weirdnesses.
Two important notes:
- Use the native Ollama URL, not
/v1 - If you keep Ollama on Windows instead of WSL, OpenClaw in WSL needs a custom base URL that is reachable from WSL, not just Windows localhost
Easiest setup flow:
openclaw onboard
Choose Ollama, then Local.
Quick checks after:
ollama list
openclaw models list --provider ollama
Docs: https://docs.openclaw.ai/providers/ollama
If you want, I can give you the exact WSL-first setup steps next.
[14/04/2026 18:05] Adrian H.: your subagent called "marketing" ?????
[14/04/2026 18:06] Director: I don’t have a built-in subagent literally named marketing.
In this environment, spawned delegates need an explicit configured agentId, and the spawn failed because that id wasn’t provided.
So right now:
• I can delegate
• but I need the exact allowed agentId
• and I don’t have a tool here that lists those ids
If you already know the agent id for your marketing delegate, send it and I’ll use it.
Otherwise, I can just do the marketing work directly here without waiting.
{
"agents": {
"defaults": {
"workspace": "/home/bc/.openclaw/workspace",
"models": {
"openai-codex/gpt-5.4": {}
},
"model": {
"primary": "openai-codex/gpt-5.4"
},
"subagents": {
"requireAgentId": true,
"allowAgents": [
"reverser",
"lowcode",
"marketing"
],
"maxSpawnDepth": 1,
"maxConcurrent": 3,
"runTimeoutSeconds": 900,
"archiveAfterMinutes": 60
}
},
"list": [
{
"id": "main",
"subagents": {
"allowAgents": [
"reverser",
"lowcode",
"marketing"
]
}
},
{
"id": "reverser",
"name": "reverser",
"workspace": "/home/bc/.openclaw/workspace-reverser",
"agentDir": "/home/bc/.openclaw/agents/reverser/agent"
},
{
"id": "lowcode",
"name": "lowcode",
"workspace": "/home/bc/.openclaw/workspace-lowcode",
"agentDir": "/home/bc/.openclaw/agents/lowcode/agent"
},
{
"id": "marketing",
"name": "marketing",
"workspace": "/home/bc/.openclaw/workspace-marketing",
"agentDir": "/home/bc/.openclaw/agents/marketing/agent"
}
]
}
Yep, the config you pasted already shows the answer.
Because you have:
requireAgentId: trueallowAgents: ["reverser", "lowcode", "marketing"]
the spawn must include an explicit agentId, and marketing is valid.
So the issue was probably just:
- spawn called without
agentId - not that
marketingdoesn’t exist
Correct patterns are:
- slash command:
/subagents spawn marketing <task> - tool/API:
sessions_spawnwithagentId: "marketing"
Also, OpenClaw does have discovery for this, docs say to use agents_list for allowed ids.
Docs: https://docs.openclaw.ai/tools/subagents
So short version: yes, marketing is the right id. The previous assistant just missed that.
so I can't just ask my director to ask his subagent to make a meeting and talk about something??? I have to /subagents spawn marketing taskname first??
You can ask naturally, but there’s an important difference:
- Natural ask: “Director, have marketing handle this”
- may work if the agent decides to delegate and passes
agentId: "marketing"correctly
- may work if the agent decides to delegate and passes
- Explicit command:
/subagents spawn marketing <task>- this is the reliable way, because it forces the target agent id
Since your setup has requireAgentId: true, the agent cannot do a vague spawn. It has to name the subagent explicitly.
So the practical answer is:
- Yes, you can ask in plain English
- but if you want to be sure it happens, use
/subagents spawn marketing ...
If you want, I can show you the cleanest phrasing for:
- a normal delegation ask, and
- the explicit
/subagents spawnversion.
something is weird, I told him:
please meet up with lowcode and reverser, talk about a project plan we need to build a solid bot, we already got an important part which is the injection and the protocol, but we need to start building features to use but first we need to finish the core
after the meeting, just start working, notify to me any progress but don't keep asking for confirmation if it's not required, I want to start getting results
I got this reply:
I hit an OpenClaw limitation on named subagents here, so I’m doing the “internal meeting” myself by reviewing the code and notes from both angles, then I’ll move straight into implementation.
not sure if it's a problem with codex, or what happens
I don’t think this is “Codex being broken”.
What’s more likely is one of these:
-
Plain-English delegation is non-deterministic
The agent understood your request, but instead of actually spawninglowcode+reverser, it chose the fallback path and did the work itself. -
There isn’t a literal built-in “subagents meet in a room” primitive
The normal pattern is:- spawn
lowcode - spawn
reverser - main/orchestrator agent combines the results
- spawn
-
If “Director” is itself already a subagent, then
maxSpawnDepth: 1matters
WithmaxSpawnDepth: 1, a subagent cannot spawn more children. That would be a real limitation.
So the safest interpretation is: this was a delegation/orchestration issue, not specifically a Codex issue.
If you want it to happen reliably, be explicit, for example:
/subagents spawn lowcode draft the implementation plan for X/subagents spawn reverser review the protocol/injection/core risks for X
Then ask the main agent to merge the outputs and continue.
So yes, “please have them meet” is nice human phrasing, but today the more reliable OpenClaw pattern is explicit spawn + synthesize, not a magical free-form subagent meeting.
i've installed ollama but how do I run a model inside Ollama so OpenClaw can connect?
Yep, the key thing is:
OpenClaw connects to the Ollama server, not to an ollama run terminal window.
So the usual flow is:
ollama serve
ollama pull <model-name>
ollama list
Optional quick test:
ollama run <model-name>
But you do not need to keep ollama run open for OpenClaw.
As long as:
ollama serveis running- at least one model is pulled
- OpenClaw is pointed at Ollama
then OpenClaw can use it.
After that, verify from OpenClaw:
openclaw models list --provider ollama
If you haven’t switched yet, easiest is:
openclaw onboard
then choose Ollama → Local.
Also, use the native Ollama URL, not /v1.
Docs: https://docs.openclaw.ai/providers/ollama
If you want, I can give you the exact WSL commands step-by-step next.
bc@pc:~/.openclaw/workspace/tibia/parser$ ollama list
NAME ID SIZE MODIFIED
qwen3.5:latest 6488c96fa5fa 6.6 GB 37 minutes ago
bc@pc:~/.openclaw/workspace/tibia/parser$ openclaw models list --provider ollama
🦞 OpenClaw 2026.4.12 (1c0672b) — The lobster in your shell. 🦞
No models found.
bc@pc:~/.openclaw/workspace/tibia/parser$
Nice, that means Ollama itself is fine, and the problem is on the OpenClaw side.
The most likely issue is that OpenClaw does not see Ollama as enabled yet.
Try this in the same shell:
export OLLAMA_API_KEY="ollama-local"
curl http://localhost:11434/api/tags
openclaw models list
Two important gotchas from the docs:
OLLAMA_API_KEYmust be set (or an auth profile must exist) for auto-discovery- If you explicitly defined
models.providers.ollamain config, auto-discovery is skipped unless you manually define models there
So also check:
openclaw config get models.providers.ollama
If that returns a custom ollama provider block, that may be why discovery is not happening.
So the quick path is:
- set
OLLAMA_API_KEY="ollama-local" - make sure
curl http://localhost:11434/api/tagsworks - avoid a custom
models.providers.ollamablock unless you want to define models manually
Docs: https://docs.openclaw.ai/providers/ollama
If you paste the output of:
echo $OLLAMA_API_KEY
curl http://localhost:11434/api/tags
openclaw config get models.providers.ollama
I can tell you exactly which piece is blocking it.
bc@pc:~/.openclaw/workspace/tibia/parser$ export OLLAMA_API_KEY="ollama-local"
bc@pc:~/.openclaw/workspace/tibia/parser$ curl http://localhost:11434/api/tags
{"models":[{"name":"qwen3.5:latest","model":"qwen3.5:latest","modified_at":"2026-04-14T19:52:31.01227394+02:00","size":6594474711,"digest":"6488c96fa5faab64bb65cbd30d4289e20e6130ef535a93ef9a49f42eda893ea7","details":{"parent_model":"","format":"gguf","family":"qwen35","families":["qwen35"],"parameter_size":"9.7B","quantization_level":"Q4_K_M"}}]}bc@pc:~/.openclaw/workspace/tibia/parser$ openclaw models list
🦞 OpenClaw 2026.4.12 (1c0672b) — I can't fix your code taste, but I can fix your build and your backlog.
Model Input Ctx Local Auth Tags
openai-codex/gpt-5.4 text+image 1025k no yes default,configured
bc@pc:~/.openclaw/workspace/tibia/parser$ openclaw config get models.providers.ollama
🦞 OpenClaw 2026.4.12 (1c0672b) — I can't fix your code taste, but I can fix your build and your backlog.
Config path not found: models.providers.ollama
bc@pc:~/.openclaw/workspace/tibia/parser$
Thanks, this is actually really helpful.
From that output:
- Ollama is running
- OpenClaw can reach Ollama (
curlworks) OLLAMA_API_KEYis set- there is no explicit
models.providers.ollamablock
So in theory, discovery should work. Since it doesn’t, this smells like either:
- a 2026.4.12 discovery quirk/bug, or
- you’re relying on shell env for something that really needs to be written through OpenClaw’s setup flow
Best next move
I’d stop fighting implicit discovery and do the supported setup path:
openclaw configure
Go to the Model section and pick Ollama, or rerun:
openclaw onboard
and choose Ollama → Local.
That usually wires it up properly instead of depending on ad-hoc env detection.
One more important WSL/service gotcha
If your Gateway is running as a daemon/service, a shell export does not automatically reach it.
So even if the current shell sees OLLAMA_API_KEY, the actual running Gateway may not.
If you want one more diagnostic first
Run:
openclaw models status --plain
If you paste that, I can tell whether OpenClaw is seeing ollama auth but no catalog, or not seeing ollama at all.
Docs:
If this ends up being fixed by configure/onboard, please right-click the solution message → Apps → ✅ Mark Solution so Answer Overflow can index it for the next person.
I go through this selection:
│ Model
│
â—‡ Model/auth provider
│ Ollama
│
â—‡ Ollama base URL
│ http://127.0.0.1:11434
│
â—‡ Ollama mode
│ Local
then it shows me a big list of models and tells me to select a model, but I can't find the model that I already have installed (qwen3.5:latest)
it's because of WSL?? I think it's running as a daemon
i've restarted the machine but still same problem
I go through this selection:
│ Model
│
â—‡ Model/auth provider
│ Ollama
│
â—‡ Ollama base URL
│ http://127.0.0.1:11434/
│
â—‡ Ollama mode
│ Local
then it shows me a big list of models and tells me to select a model, but I can't find the model that I already have installed (qwen3.5:latest)
it's because of WSL?? I think it's running as a daemon
I don’t think WSL by itself is the problem.
I think the more likely issue is the daemon/service context.
Why I’m suspicious:
- in your shell,
ollama listandcurl http://localhost:11434/api/tagswork - but OpenClaw setup is not showing your actual installed tag
- that usually means the setup flow is not seeing the same local Ollama context, so it falls back to a generic catalog/list
With WSL, 127.0.0.1 only means “this exact environment”. If OpenClaw is running as a daemon/service, that can behave differently from your interactive shell.
I’d test it like this
openclaw gateway status
openclaw gateway stop
Then, in the same WSL shell where ollama list works, run:
openclaw configure
Choose Ollama → Local again.
If qwen3.5:latest suddenly appears, then yeah, it was the daemon context, not Ollama itself.
If it still does not appear even with the daemon stopped
Then I’d call it a 2026.4.12 bug/quirk, because your local Ollama catalog is clearly there.
At that point I’d also check:
openclaw --version
openclaw gateway status
Docs:
If you try the gateway stop → configure in same shell test, paste what happens. That should tell us a lot fast.
Service: systemd (enabled)
File logs: /tmp/openclaw/openclaw-2026-04-14.log
Command: /usr/bin/node /home/bc/.npm-global/lib/node_modules/openclaw/dist/index.js gateway --port 18789
Service file: ~/.config/systemd/user/openclaw-gateway.service
Service env: OPENCLAW_GATEWAY_PORT=18789
Config (cli): ~/.openclaw/openclaw.json
Config (service): ~/.openclaw/openclaw.json
Gateway: bind=loopback (127.0.0.1), port=18789 (service args)
Probe target: ws://127.0.0.1:18789
Dashboard: http://127.0.0.1:18789/
Probe note: Loopback-only gateway; only local clients can connect.
Runtime: running (pid 93188, state active, sub running, last exit 0, reason 0)
RPC probe: ok
Listening: 127.0.0.1:18789
Troubles: run openclaw status
Troubleshooting: https://docs.openclaw.ai/troubleshooting
when running openclaw configure, it asks for ollama base url and points by default at 127.0.0.1:11434 maybe ollama daemon doesn't listen there??
i'm inside "models" in the configure, not inside "daemon" i didn't even check what is daemon
Yep, in your case “daemon” just means the OpenClaw Gateway is running as a background systemd service. It’s not a separate thing inside the Model menu.
And based on your earlier curl, http://127.0.0.1:11434 is probably the correct Ollama URL. So I would not change that yet.
What I think is happening:
- OpenClaw gateway is daemonized
- Ollama/model discovery in the picker is being weird
- the picker may be showing a fallback/generic catalog instead of your exact installed local tag
A couple of practical things:
-
In the picker, try searching for:
qwen3.5- not just
qwen3.5:latest
-
Check whether Ollama is actually listening in WSL right now:
ss -ltnp | grep 11434
ps aux | grep ollama
If that listener is there, then the base URL is fine and this is more likely a picker/discovery quirk than a network problem.
So short version:
- No, I would not blame WSL first
- No, I would not change the base URL yet
- Yes, the “daemon” here is just your background OpenClaw service
- Most likely this is the Ollama model picker not surfacing your local tag correctly
If you paste the output of:
ss -ltnp | grep 11434
ps aux | grep ollama
I can tell you pretty quickly whether this is a listener issue or just the picker being dumb.
bc@pc:/mnt/c/Users/BC$ ss -ltnp | grep 11434
LISTEN 0 4096 127.0.0.1:11434 0.0.0.0:*
bc@pc:/mnt/c/Users/BC$ ps aux | grep ollama
ollama 64509 1.0 0.2 2894696 86916 ? Ssl 18:10 2:05 /usr/local/bin/ollama serve
bc 111425 0.0 0.0 4092 1936 pts/2 S+ 21:29 0:00 grep --color=auto ollama
Perfect, that narrows it down a lot.
That output means:
- Ollama is running
- it is listening on
127.0.0.1:11434 - so this is not really a WSL networking problem
At this point I’d treat it as a model-picker/catalog quirk.
Best workaround
The docs say onboarding/configure can also let you enter the model manually.
So instead of hunting in the big list, use the manual model entry path and set:
ollama/qwen3.5:latest
or if it wants just the raw model id:
qwen3.5:latest
Why I’m confident now
Your listener check proves the base URL is right, so the weird part is just the picker not surfacing your installed tag properly.
If the wizard still fights you, the next thing I’d try is:
openclaw models set ollama/qwen3.5:latest
Docs:
So yeah, good news: this does not look like “Ollama isn’t running” anymore. It looks like the picker is being dumb.
okai I think it's connected now, how I change the model in the same chat I ran out of tokens?
Yep, for this same chat/session, use:
/model
On Discord that should open the model picker.
Or set it directly with:
/model ollama/qwen3.5:latest
A couple of useful notes:
/modelis session-scoped, so it changes this chat, not every chat- if the agent is idle, it switches right away
- if a reply/run is already in progress, the switch may apply on the next turn
If you want to confirm what this chat is using after the switch:
/model status
Docs: https://docs.openclaw.ai/tools/slash-commands#model-selection-model
If that works, please right-click the solution message → Apps → ✅ Mark Solution so Answer Overflow can index it.
Current model: gpt-5.4 via openai-codex
Model set to ollama/qwen3.5:latest.
Failed to set model: RpcError: model not allowed: ollama/status
Ah, that looks like a model allowlist problem, not an Ollama connection problem.
The important clue is this part from the docs:
agents.defaults.modelscontrols what shows up in/modeland what is allowed
So your config probably still only allows the Codex model, and the Ollama one is not in the allowlist yet.
Fix
Run:
openclaw configure --section model
Then in the Model section, make sure your Ollama model is included in the allowlist too, not just the Codex one.
If you want to check first:
openclaw config get agents.defaults.models
If that only shows the Codex entry, that’s the problem.
Then try again in the same chat
/model ollama/qwen3.5:latest
The weird ollama/status bit is probably just /model status getting parsed while the model allowlist is still locked down, but the root issue is the allowlist.
Docs:
If you paste openclaw config get agents.defaults.models, I can tell you exactly what to change.
i am not sure if its working the agent stays thinking, but I don't get any results or reply
Yeah, that can happen with local/Ollama models, especially in a long existing thread with lots of prior context and tools.
I’d test it in this order:
- Check the current session model
/model status
- Start a fresh session on the Ollama model
/new ollama/qwen3.5:latest
- In that fresh chat, send something tiny like:
reply with exactly: hello
Why
A smaller local model might be fine on a tiny prompt, but choke on:
- a long old thread
- tool-heavy agent prompts
- session context carried over from the OpenAI/Codex run
So if the fresh session works, the issue is probably old session context, not connectivity.
If it still hangs even in a fresh session
Then check logs:
openclaw logs --follow
and paste the first relevant error lines.
Bluntly, local/Ollama models are often way less reliable at tool-calling than the cloud ones, so “thinking forever” is a pretty common failure mode.
Docs:
If you try /new ollama/qwen3.5:latest and a tiny prompt, tell me whether that works. That’s the fastest way to separate model works from agent loop is choking.
19:49:08+00:00 error diagnostic {"subsystem":"diagnostic"} lane task error: lane=main durationMs=128513 error="FailoverError: LLM request timed out."
19:49:08+00:00 error diagnostic {"subsystem":"diagnostic"} lane task error: lane=session:agent:main:session-1776196010585 durationMs=128515 error="FailoverError: LLM request timed out."
19:49:23+00:00 error Embedded agent failed before reply: All models failed (2): ollama/qwen3.5:latest: LLM request timed out. (timeout) | openai-codex/gpt-5.4: ⚠️ You have hit your ChatGPT usage limit (plus plan). Try again in ~45 min. (rate_limit) | ⚠️ You have hit your ChatGPT usage limit (plus plan). Try again in ~45 min.
Nice, the logs make it clear now.
This is the real issue:
- Ollama did get selected
- but
ollama/qwen3.5:latesttimed out - then OpenClaw fell back to Codex
- Codex is rate-limited
- so the whole run dies with no reply
So this is not a connection problem anymore. It’s a slow local-model timeout problem.
Why
OpenClaw’s LLM idle timeout is 120s by default.
Your local model didn’t produce enough output before that, so OpenClaw killed it.
Best fixes
Option 1, easiest: raise the idle timeout
openclaw config set agents.defaults.llm.idleTimeoutSeconds 300
openclaw gateway restart
If you want to be more aggressive:
openclaw config set agents.defaults.llm.idleTimeoutSeconds 0
openclaw gateway restart
0 disables the idle timeout, but I’d try 300 first so it doesn’t hang forever.
Option 2: use a fresh short chat for the local model
Long old threads are brutal for smaller local models.
Option 3: pick a smaller/faster Ollama model for agent use
Your current one may just be too slow for full OpenClaw agent prompts on your machine.
So I’d do this next:
openclaw config set agents.defaults.llm.idleTimeoutSeconds 300
openclaw gateway restart
then test in a new session with a tiny prompt.
If that works, you found it.
If this fixed it, please right-click the solution message → Apps → ✅ Mark Solution so Answer Overflow can index it.
it did reply "hello" but it took ages
Yep, that means it’s working, but honestly, too slowly to feel good.
That’s the important distinction:
- timeout fixed = it can answer
- speed is still bad = this model is probably too heavy for comfortable OpenClaw agent use on your setup
And OpenClaw is harder than plain chat because it adds:
- a bigger system prompt
- tools
- session history
- agent loop overhead
So if “hello” took ages, real tasks will feel rough.
My honest recommendation
Use this local model only for:
- short fresh chats
- simple prompts
- emergency fallback when cloud quota is dead
If you want local to feel less painful, do one of these:
- Use a smaller/faster Ollama model
- Keep local sessions fresh and short
- Avoid long old threads for local models
- Treat local as backup, not your main agent brain
So yeah, I’m glad we got it working, but I wouldn’t pretend this is a great day-to-day setup on that model/hardware combo.
If you want, the next step is either:
- pick a faster local model, or
- keep using Codex/OpenAI when quota resets and keep Ollama as fallback.
I think I should switch to deepseek-coder:6.7b
how can I do it now that we know that it works but slow??
Yep, that’s the right kind of move, go smaller/faster.
Do it like this:
- Pull it into Ollama:
ollama pull deepseek-coder:6.7b
ollama list
- If your model allowlist is strict, add it in:
openclaw configure --section model
and include the new Ollama model in the allowlist.
- Switch the current chat:
/model ollama/deepseek-coder:6.7b
My real recommendation
Even better, test it in a fresh session instead of this long thread:
/new ollama/deepseek-coder:6.7b
Then send a tiny prompt first.
Why: even a faster local model can still feel worse inside a long old agent thread.
So short version: