Hi Krill/helpers. I’m trying to get my Discord agent Charles responsive again after upgrading to OpenClaw 2026.5.7. Codex has been
debugging it locally with me, and we may have improved it, but I want a sanity check on whether we are configuring memory correctly or
over-correcting.
What I’m trying to do:
- Run Charles as a Discord channel agent.
- Keep durable memory/context via Hindsight + Lossless Claw.
- Avoid 8-13s delays on trivial messages like “you there?” / “you back?”
- Understand whether Active Memory is useful or redundant in this setup.
Setup:
- OpenClaw: 2026.5.7
- Hosting: local macOS LaunchAgent gateway
- Gateway command: node .../openclaw/dist/index.js gateway --port 18789
- Agent: cerebro / Charles
- Channel: Discord
- Main model: openai-codex/gpt-5.5
- Memory slot: hindsight-openclaw
- Context engine slot: lossless-claw
- Active Memory was configured for Charles using cerebras/gpt-oss-120b
- Cerebras provider/plugin is enabled now
- Hindsight is still enabled with autoRetain: true
- Currently disabled for latency testing:
- Active Memory config enabled: false
- Hindsight autoRecall: false
What was happening instead:
- Simple Discord messages took 8-13 seconds before Charles responded.
- Some replies printed quickly once they started, so most delay seemed pre-reply.
- At one point the capability runner returned:
No text output returned for provider "cerebras"
Codex’s debugging/fixes so far:
- Verified active config/agent
- Active agent is cerebro.
- Main config is ~/.openclaw/openclaw.json.
- Gateway is using that config.
- Fixed Cerebras plugin/auth shape
Codex found the Cerebras plugin/provider was not fully connected after the update. It enabled/allowlisted the cerebras plugin and corrected
the Charles auth profile shape while preserving/redacting the secret.
After that, the Cerebras issue changed from likely auth-disconnected to the runner returning:
No text output returned for provider "cerebras"
- Found Active Memory tool allowlist errors
Logs showed:
No callable tools remain after resolving explicit tool allowlist
runtime toolsAllow: memory_recall, memory_search, memory_get
This seemed related to upstream issue #77506 / PR #77515.
Since my memory slot is Hindsight, not bundled memory-core, Codex disabled Active Memory for Charles:
"active-memory": {
"enabled": true,
"config": {
"enabled": false,
"agents": ["cerebro"],
"allowedChatTypes": ["direct"],
"model": "cerebras/gpt-oss-120b",
"queryMode": "recent",
"thinking": "off",
"timeoutMs": 15000,
"maxSummaryChars": 220,
"persistTranscripts": false,
"logging": true
}
}
- Measured latency from trajectory/logs
For one slow message, Codex measured roughly:
- ~4s Discord inbound debounce
- ~3.4s Hindsight auto-recall
- ~2.4s GPT-5.5 model generation
- some routing/session overhead
After later tuning, another simple message still measured around:
- ~2.9s Hindsight recall
- ~2.6s GPT-5.5 generation