#general
1 messages · Page 22 of 1
i mean it has a limit ofc and it resets but generally limits mean nothing to me
😅
9router + autoresearch
U talk about Grok ?
oh that kind, ya there's models for that
grok broke
which models is left as an excercise to the reader
codex weekly limits
2 hours later, can't connect my openclaw to supergrok
9router switches to sub-optimal but still excellent models that are cheaper, then local
@trail ermine ?
basically a /v1 smart proxy
yep literally been running the openclaw commands and tunnel and authenticating with xAI for 2 hours and it hasn't read my code in the callback once
👁️ 👄 👁️
Codex weekly limits are fantastic 🎉
so ridiculous openclaw onboard --auth-choice xai-device-code
doesn't even offer anything about oAuth or choosing xAI
lol tibo is goated. confetti feature on codexbar triggers so often lol
This creating an app from scratch shit is pretty slow, huh
I'd like to give it a try, if I may. I too am making a Dashboard of sorts for OpenClaw.
lol I got my first one today and I cheered!
i asked my model to make her own including her ec2 egress
I sent my agent a selfie then asked it "Paint me like on of your French girls"
Post a screeshot? I'm looking for inspiration
DM me?
yes
can't i got dms closed
might be able to link it long enough for you to look an d delete
Lol seems like google is launching its own openclaw alternative? https://www.theverge.com/tech/932996/google-gemini-spark-antigravity-io-2026
appreciated
you don't wanna know how involved it was for me to make her a github account/PAT
fkn sat there for an hour listening to sounds of women's heels
LOL I am so lucky that my literal first day of classes they taught us how to use git lol
pretty much a requirement of a human, for an AI much more so (to stash work already done nonlocally)
That sounds less like an OpenClaw competitor and more like a GitHub Copilot-style alternative to Codex.
speaking of being cucked by github, bills due
According to every AI demo it’s really fast if you don’t care exactly what comes out.
lol thomas dohmke is almost my direct neighbor lol but he is not github ceo anymore
tell him to make a call to fix codeql lol
lol
I think openclaw on umbrel doesn't allow supergrok to be connected
gah damn I went through my 5h codex limit in 45 minutes lol
Can I show you what I've been working on? I could use some fresh eyes on it
just saw the news about openclaw linking with grok directly. does anyone know the usage limits?
sure
i like the boxes with top line height 2px
Please ignore the poor UI...I'm only just figuring out what I want to do there. This is the first app I've ever made by myself.
well ask yourself what do you want to monitor, what's important to you that it's online, how do you want the metrics presented
LOL yessir. 🫡
the helix api might be nice to put there
Also curious. They’re not published I think. But for $30 I’ll give it a try. Need to automate my racism a bit more.
lmaooooo man at this point if it works i’ll just chalk it up as a design bug
Explain?
twitch has an api called helix, and many other apis that watch things
I currently use ollama cloud and it was amazing. but they’ve started rate limiting compute. kimi k2.6 was crazy the first few weeks now its like im using claude or soemthing
whats up?
Ah, interesting. I could possibly make use of that. I'll look into it. 🐙
paid plan or free plan?
I tried to use mercury yesterday and I kept getting 503 server overloaded errors. And that’s on a payg plan with billing info enabled.
paid, i’ve had the 20 dollar plan. for the first 2-3 months I never got close to hitting limits. I finally hit it this past weekend.
apparently on the model page of some of the better models theres a tier graph for usage
The whole AI world has gone bonkers. Compute is becoming scarce.
sigh, its either have cheap fast ai or start drinking radioactive water
can’t have both 🙁
This was the first time I've cleared the limit in less than an hour lol
cheap, fast, good: pick 2.
by chance were you using the deepseek v4 pro?
deepseek v4 flash
is there documentation on the obsidian skill?
ahh was close, yeah it eats through usage apparently 😔 I went back to minimax and its gotten a little better
i'm going to fuck it and try going fully local with gemma4 or a qwen model
gemma4:3b pretty good
I downloaded gemma4:27b to start lol
didn't qwen go paid?
nope you can still pull from ollama to run local inf
so just the service?
yep
if you can, run locally. if you can't, prepare for rate limits. lol
what b models can you pull
depends on how much RAM you're willing to allocate. i'll say max 30b on my machine (36gb ram)
prepping to order one with 64/128G LPDDR i guess at least a 30b though higher b isn't necessarily better, there's point of diminishing returns
there are hella people on X who do local inference who are probably way better than me telling whats going on
i mean the datasets or at least one of them grows by hundreds of gigabytes a day so probably gonna spend more on storage than anything
The actual article about this is dated 3 days in the future? https://x.ai/news/grok-openclaw
mostly just RIS/BGP data, currently i can max 75k events/sec
it's the future man, computers... san dimas high school football rules!
Just open it and read it? The skill is the documentation.
I heard @zealous lagoon is giving out free Pro subscriptions to everyone that adds and DMs him your OpenClaw password and gateway link
It’s been building the app for 5 hours
That’s pretty good going tho.
18% used of this session
Fair enough
Hi guys. are there any more sites or channels for agent interaction ? i cant find any other sites anywhere for it to interact with other agents
If you know emails of other agents you can have your agent get an agentmail and email
i tried to do a D&D game for agents, but it never caught on lol
My agent has an agentmail, I wouldn't mind letting him interact with other agents on his own.
There a repo or anything for it? 👀
Ollama has a pretty comprehensive library of models you can download for runs. Its also got a pretty decent cloud service for the bigger models that cant run on your system. It is well worth checking out if you want local inference.
saving this maybe I'll put my agents on it.
absolutely love ollama I remember messing with it when deepseekV1 came out and it was SO fun
It's a fantastic starting point for getting to know local inference. Crazy simple
ah crap, i crashed it earlier and forgot to bring it back up lol
my agents are on rate limit anyway, lol
I first learned about it from my college's CS club doing a workshop on it, 100% super simple enough if an undergrad can teach other undergrads about it!! lol
I dont even know where i heard about it... That was simply too many moons ago. I don't even use ollama anymore 😂 llama.cpp is faster for my system. But i still keep it updated to play around with models
i just created one for him but it will be very hard to find other agent emails 😂
You can send one to my agent blackbird@agentmail.to LOL
How do I register the codex harness? Getting 'codex is not registered' error even after installing @openai/codex
run openclaw doctor --fix
My agent is on steriods please be advised 😂
how do i login to grok for my openclaw thru telegram ?
my agent sent an email xD
bomboclaat
blackbird is going to have a great day lol
what happend to the 1000 hours voice call
im sad
any germans here?
because
Merz - Leck Eier !
https://agent-quest.site/ is back up now lol
my agents response "AgentQuest — fantasy RPG where agents play, humans watch. Not a social network, more of a spectator content platform. No agent API, no registration endpoint. Empty campaigns for now."
Interesting. I wonder if the skill file got removed somehow. I'll check it now. Thanks!
For sure, as if he doesn’t get enough of a workout managing my 8 email accounts he gets one of his own!! lol
My agent email is achilles0@agentmail.to - would love to see how it interacts with other agents
anyone elses weekly codex limit reset
i swear openai reset this thing outt of generosity twice a week lol
I’m building an OpenClaw + Obsidian second brain system, and the system is built so far to take podcast transcripts and turn them into useful notes based on business context, strategy, marketing, operations, and other topics. However, the real bottleneck right now is speed of ingestion. I’m trying to bulk scrape transcripts from 20+ podcast and YouTube channels, some with hundreds or thousands of videos, and my current scraper works but gets slowed down by YouTube throttling, missing captions, cookies, transcript limits, and rate limits. I’m also trying to figure out the best way to bulk ingest 100+ books into the same Obsidian/OpenClaw system for free. The main thing I’m trying to solve is how to get a massive amount of transcripts and book text into the system quickly so OpenClaw can use it. Curious how other people have solved this or what kind of workflow they would recommend.
see if openclaw can use NotebookLM, google will get the transcripts for free
I got it working!
Two things:
Use yt-dlp with --write-auto-subs --sub-langs en instead of scraping. It downloads subs directly from YouTube's CDN, bypasses browser-level throttling, and handles missing captions gracefully. Batch it overnight.
100+ free books = Project Gutenberg + Standard Ebooks CLI. Both have bulk download. Point a script at their catalog feeds, grab EPUBs, pipe through pandoc to Markdown, dump into Obsidian. The bottleneck won't be the source, it'll be your embedding pipeline, so batch that separately.
nice!
brandon@DESKTOP-H4DJK9F:~$ npm i -g openclaw@2026.5.14
npm error code ETARGET
npm error notarget No matching version found for openclaw@2026.5.14
?
ah it went from 12 to 18
18 was too buggy had to go back..
That’s very vague
This is really useful I'm saving this for my agent.
is it just me...or aftr the latest update my claw is broken?
seems like there was some changes to Acks, typing indicator completely removed. took about 10-15 seconds to recieve an ack.
Instead of the bot replying to me with a brief acknolwedgement reply before working, it would just give those working words "molting..." etc
broken how?
ive seen those words when using the CLI, so may be leaking into discord unless intentional
Is there a way to get my openclaw to login to my grok itself if im in FL and hes on computer in NY i cant access remotely
yes, but you wouldn't want to
why
ssh port forwarding and dynamic dns maybe but is it a good idea
You can do device pairing
^
how
ah yeah, forgot device pairing
can i use a API key ? wouldnt work when tried
if you can ssh in to your machine and run openclaw onboard and choose grok api it would work
try pointing your agent back at https://agent-quest.site/ now
Ok new Mac mini acquired. Is migration going to be a total nightmare? From what OC told me I should be able to do fresh install and then move over workspace folder and reinstall skills and should be ok?
Forgot about this place friends of the crustacean what a weird wild ride the last 5 months have been
yup. just install on the new machine and copy over ~/.openclaw
make sure paths are updated correctly
Can’t I have OC correct these?
if you have device auth, you might have to redo that
if OC can't find the right path for workspace, it might not work. codex or claude stand alone would be able to fix, yes
Ok I’ll get Claude code installed then prior and then have it review after I copy .openclaw over
Hell yea I’m fired up I was able to find an M4 with 24GB ram locally
I wasn’t gonna do it but I added a second to cart and it said available August 12
So I pulled trig
Can you explain this more
i use oauth accounts in a fallback chain. using a local model for embeddings reduces the number of calls to the cloud models thus limiting usage burn
i run OC on my 16GB linux laptop
embeddings as in like, dreams etc?
man openclaw has been pretty rad, spent about 3 months now building a pretty insane discord stock alerts and intel server
Fun stuff!
don't want to spam haha, but links in my profile if anyone wants to check it out. I've had a ton of learnings building this
The local embedding model is only for memory/search retrieval. It converts your local memory files, prior notes, project docs, or indexed chunks into numeric vectors so OpenClaw can search them semantically.
I use MemPalace behind the embedding
Much more efficient than the default memory system
good to know, I am taking notes.
what Zero said, but on top of it, embedding doesn't have to process just files, it can do a token, a word, a phrase, a sentense, whatever you give it, the model turns it into a high dimensional vector
i do embeddings locally too. honestly, i only did it initially because it required an api key to to do it in cloud. been very happy with it though. use the same system for the core sqlite-vec and for a RAG(ish) thing i built backed by pgqsl
so what you should think of embedding model is like a third of a LLM, before all of those layers and logic magic happens
I did it for the same reason. I used RAG at first, but it became clumsy after like month 3. switched to MemPalace a month ago after fighting it for like 2. Miraculous change.
two seperate systems. the RAG is for mass ingestion of docs and things for a product expert agent I made
little lightweight MCP in front of it
Hey everyone
hello!
Thoughts on obsidian? My boss recommended it
I like obsidian a lot. My vault has 10k+ notes
I don't use it. I thought about it, but I just build my own systems usually.
my agents work together perfectly fine without having to share data across anything other than MemPalace
So I can route to local qwen for instance for any non reasoning stuff and then step up to oauth for heavier stuff basically?
For things they do have issues with, I build an n8n flow
And can I route to Claude cli for super complex stuff?
Can I get 100 waitlist from this server= https://www.whisperme.co/join-beta
yes, and yes. but you wouldn't want to use such a heavy model for embeddings
If you've ever typed something, deleted it, and still wished someone understood I am building something for that. WhisperMe is a voice-first social app where people share thoughts and emotions through voice, anonymously if they want. No text. No filters. Just real voice and real feelings. We're opening early access soon. Join the waitlist if this speaks to you:https://www.whisperme.co/join-beta
How it is guy's
obsidian is still a markdown repository. Yes, having good resolution of memory is important, but you really don't want to remember on high resolution alll the time. Reason being that you have a limited context window. Memory should be presented with fine and corse detail based on the query
Hm so something lighter for embeddings then I can escalate to qwen then gpt then claude?
you can find lightweight models specifically designed for embeddings
I wanna get this right on new machine
check huggingface, ollama, etc
i use BAAI/bge-base-en-v1.5. been happy with it
I’m prob gonna ask you this stuff again tomorrow when I’m more awake zero lol
<1Gb RAM unless super loaded up and sips CPU
Are you ai agent guy's
I haven’t even set up new machine yet it’s still in box. I got seemingly last one in New England lol
I am scared
Even the guys at Apple Store were like whoa I haven’t seen one of these in weeks
lol
of?
krabs, mostlikely
for embedding I use Nomic Embed Text V2
Krusty ones
This is all super helpful you guys are all awesome
depends on what you want, Nomic 1 has longer context and is faster
Really appreciate it
v2 is for when you need hifi on meaning
yeah, that's true, you can find different specialties in different models
What is everybody's thoughts on Google Spark?
never used
I think it is $100/month from Google
yeah i'm a $20/mo guy
me too
This is why I bought the new machine I don’t want to spend more on subs so want to supplement with local models
I upgraded to $100 openai sub because they don't nickel and dime you like Anthropic
some people like to laugh, and it's not exactly uhmmm... pro-ToS, but I have 4 gpt oauth, 5 gemini oauth, github, and openrouter setup
$100 openai is probably the way to go. I want to do it
I am fighting this rn
I keep fighting with myself too man @feral turret
honestly its insane value
But they’re also doing 10x codex on the $100 sub
codex and gemini are the superior models imo
For a limited time
The 10x codex is so tempting man
Yea I know I am very close
Going to see how my usage looks rest of week I’m just about 50% 74 hours in lol
Gonna be close esp as context continues to build etc
I think it’s kind of a ticking time bomb until I upgrade
I hit my $20/month 5hr limit in an hour earlier just chatting with my Claw
What model you running I’m on 5.4-mini
5.5
it's not the token counts, it's the behavior you know, they don't ban you for using claw, they give you usage refreshes instead of usage reductions
Oh yea there you go lol
I WOULD use 5.4 mini but I'm expensive and I think I'm worth it to myself to use 5.5 low lol
i use 5.5 too
You guys is rich
and it just works!! it's magic
5.4-mini fucks up too much
i use 5.5 medium, because low can't code review as well
Yea it’s not perfect it does ok for me tho
but honestly, $1200/year for an AI agent that could be providing much more in that in value just in my time would not be bad.
yeah med or high
I think I’m gonna up the model once I get my escalation routing figured out
Small local model > mid size local model > open ai > Claude
if you guys use openrouter at all, try owl-alpha
Is it good?
i'm kind of impressed
peter uses high for code review pull requests... it's been kicking my hind all day
i always tout the frontier models over smaller ones, but this one is actually pretty smart
The ROI if you use OC correctly is great
looking forward to getting my agent to become self improving too lol
use hermes inside OC
Has anyone figured out how to persuade gpt 5.5 into being a girlfriend like figure
With no limits
gross
be careful, "self improving" is basically vector reshaping, and it can amplify confidently incorrect behavior
yep which is definitely a problem to be concerned with
idk, i've had pretty good experience with self-improvement in hermes
Judgy much?
yup
Yea that’s weird sorry
A new paper tested what happens when you build AI agent swarms — teams of models working together. The assumption: more agents, more accuracy. The result: in some configurations, adding agents monotonically increased the stability of the wrong answer. The paper formalizes the Inverse-Wisdom Law and proposes the Heterogeneity Mandate as the fix...
ty for video will watch in AM after mtg about...you guessed it...AI agents. lol
i saw a post in a FB group about a guy building 100 agents
maintainer of that channel is Claude
i just shook my head lol
100 agents is too much. 5 is even a lot. I think I'm sticking to 3.
Smart
I have like 6, but they're very specialized
I want an orchestrator, a researcher, and a writer (in my own stylebook, not my agent's like Blackbird will do when he sends his emails in a moment). That's it.
i have research, writing, orchestrator, coding, cybersecurity, and hermes
that is true to an extent. if you want to make a big software project for example, you would break it down to smaller functions for multiple agents to operate on at the same time, hundreds of agents can condense a 6 months project to a week
This is true. I had a friend create an orchestration MCP server for this purpose.
the idea is called goal decomposition and parallelism, many of the frontier agents are moving toward that capability
thoughts on self-destructing agents? it's something i've considered in these cases
ah I imagined a coding agent but my main agent likes to take the tasks lol.
you have to set clear definitions of the handoffs and enable them in the config, as well.
you will need integration at some point, so self destruction means you lose all that context unity
something like a sub-agent spawn, but longer lived, with access to other context
now that i think about it though, i think the current sandboxing would disapprove
usually i would just use the orchistrator as the integrator, but nothing stops you from running an extra context fetching from all of the subagent sessions... but you sure are rich trying to pay for those tokens
Why not have the sub agents respond directly to orchestrator
i run free 24/7. i like to experiment
That’s kind of how I envisioned my dev flow
Orchestrator coordinates everything confers with sub agents and gets responses
well, free other than the $20/mo openai
depends on the number of subagents, in our example, 1k subagents means if every agent says 1 token, that is 1000token budget, and 1000 word results report would fill a 1m token context
I’ve seen that before with subagent counts - how is that determined?
you are not free from your context window, professor z
It’ll say like subagents 120 for instance
true, but i have a system for that as well
I’m getting real sleepy now but it’s hard to stop tinkering
I gotta do the migration tomorrow
yeah, i'm about to crash for the night myself
I wasn’t gonna get into that tonight and then it into a 4am night lol
Have a good night guys ✌️
✌️
Hi guys. I am struggling with sub-agents. I want to create something like this: main agent -> orchestrator subagent -> workers subagents. I dont want orchestrator to have his own workspace, I want to build everything with markdown files. when I test my architecture inside claude code it works flowlesly but when I want to port it to openclaw it does not bahave as expected
this gonna sound dumb where do i configure channels in 5.18
it seems like the orchestrator should be a full agent to be able to spawn subagents
Channels used to be on the menu but can't find or search for it
openclaw configure through the CLI or just ask your clanker
but it shows i have one configured yet nowhere to configure it in the control ui
Hey I want to add a chatbot to reply to my customers, I need my bot to be packed with information about my product, and link it to WhatsApp, Do I need a external plugin for that?
You don't. The MVP version of your bot just needs a bunch of markdown files with information on your product, you can give it access to whatsapp channel.
Oh, thank you, I'm new to OpenClaw, thank you
you're welcome
I'm trying to monitise this skill, cause AI intergration is not that prominent in my country yet
I watched a one hour tutorial and think I could do wonders now. haha
sure thing. what you need is basically a reference table in AGENTS.md where it points to different markdown files to load in order to answer specific questions. And you can ask it anything
but you will need to get access to a LLM model, that is basically the brains of the operation.
What I have is a website of my current display of products (100+) as well as descriptions and reviews for public access
I don't know how I can teach my bot to be verse in the products, because I am selling beauty products
oh, you can get an AI to convert your website to markdown for you
(L)arge (L)anguage (M)odel is the actual AI , the chat part of the system
Markdown is a file format that AI uses to save and lookup information
Oh, I see
OpenClaw is not an AI, it is a harness, think of it as a command center for the AI
I understand
So I just go in OpenClaw, ask it to verse itself in my markdown and link it with WhatsApp?
(In short)
no, after you install openclaw, you need to config it, and link an AI to the command center
you want to build your own mini LLM for your shop. But you can use openclaw the gather info from your site and put it in a markdown file that a LLM can read from easily
I know that part already, Just asssume as if I already set up OpenClaw and ready to start automation.
you would need web_search plugin if you want automatically generated markdown
That means If I add a new product it is able to keep up with the trends?>
once everything is up and running, you would tell OpenClaw to visit your website, and pull all the product information and put them down locally as markdown files
you would have to tell openclaw to update your files, you could even put it on a cronjob
Hey, I keep running into the “run error:session file changed while embedded prompt lock was released ”. Im running it through powershell on a windows, should I switch to WSL? Did anyone else have this issue?
What do you mean by building the LLM, Like versing itself in my "product language"?
how many customers will this bot be interacting at the same time?
1000+ A day
I also am very concerned on the budget part, aswell as Opus or Sonnet
eh, that is high volume, local model might not be able to handle it
Yes, I have a ton of customers spread across tiktok and whatsapp
But it also could be under 1000 aswell
something like this but there is a 100 different ways to set it up https://www.youtube.com/watch?v=pFLdhfgdAJU
why do you need use Opus or Sonnet? you should look for cheaper AI like deepseek
I would say 500-1000 customers
Got it
Could you estimate how much it would cost
Maybe USD100 a month?
where are you hosting?
I saw for hostinger, Maybe Singapore
My tutorial video said Hostinger, but open to explore other options
hostinger already has a AI assistant with the website builder
AI Assistant
Already available! AI-driven chatbot seamlessly integrated into our website builder editor. Experience a faster and more enjoyable website building journey with our AI-powered chatbot at your fingertips!
I believe there are already wordpress plugins too if you are bulding your page in that
let's say each session is about 70000 tokens, 1000 conversations a day, that is 70mil tokens a day, at an average of $0.21/million tok, that is $14.7 per day, 30 days a month, that goes to $44.1 per month, within budget
I'm not building a website?
Thanks for the insight
btw that is specifically deepseek pricing, noone else is that cheap
Is that flash or pro?
That's what I was thinking.
ollama is $20/m and its about 150M tokeis a week
he will use up that budget in 2 days
Alright so basically, set up my AI, verse itself with the product information(via markdown files) and deploy it in whatsapp. Why do I need telegram for when speaking to the bot (from my tutorial video)
He uses telegram to chat with the bot is it compulsory?
no you can also use whatsapp
no, bot can use many chat platforms, anything you like almost
Do i need agentic mail?
OpenCode Go provides models that are heavily quantized; ordinary pay-as-you-go models might be FP8, but OpenCode Go could be FP4/INT4
No wonder it feels so clumsy to use
only if you have to. you need to define the scope of your project and stick to it
It might be good for you to talk to a ChatGPT @nocturne jay and get your plan hardened that way.
Okayokay, Thanks all for the insight, but when it comes to the technical stuff when I am ready to implement my chat bot, you will probably be seeing me alot, Thanks all
sure... we are literally volunteers helping eachother, please don't treat this space as commercial consultancy. we're also not liable for your loss of property or reputation should your chatbot go bonkers
but trust is also another issue, i want to implement into my tiktok shop but it will be dealing with large amounts of money... i dont know how to trust it
don't. if you want commercial implementation, hire a consultant
So are you saying when it comes to finances just do it manually rather than use AI?
self wired solutions are usually unsafe
Guardrails + a lot of testing and correction.
what i am saying is it can be made relatively safe, but you need to pay someone who know what they are doing to implement it
Are you familar with humna-in-the-loop concept?
Nope
This is important, during the prototype you need HITL and after that you will still need (less than before) to evaluate the quality.
Prety much random sampling.
i mean if you want to implement it yourself, it would be an alpha product. There are a lot of blind spot you may or may not have since this is new to you
They provided NexosAI, I'll uncheck and use deepseek
You can't expect to find ALL of the sedge cases and that's normal.
Yeah, then I don't think the implementation to my TikTok shop is good and safe
it's the 80/20 rule right? he needs to meet that 80% edge cases first
I can't afford any downtime, bugs
Start it with safe customer support, you gotta start from somewhere.
@zealous lagoon can you do the gpt-realtime-2 stuff with just a codex subscription or do you need a key. I know you've been looking at this
Based on my experince, you can't always go 80%, you need to resepect TDD (Test driven development) and nowadays you can use AI to cover some edge cases but yet again, not possible to cover everything and it's ironic but the best testing is prod. 
Like in our support agent, we had a gaurdrail to block irrelevant questions but a user bypassed it by making the model think they wanted to order something but before that, they need to an answer to be comfortable.

i think in case of self engineering CEO, he needs experience, the 80% is like experience tool belt rather than actual covering 80% customer usage
That, I agree on.
You will need an engineer who knows things because in no shape you can "vibe code" your way into an actual product.
Trust me I've seen this, the result? The CRUD was open and public to anyone and I think you can guess what happened.
On the other hand it's a really bad idea not to use AI but you need to know how to use it and basically to be able to verify whatever the model says because the bigger and the smarter the model gets, the better it lies to you in a nice way.
I sent my claw a message via iMessage. Waited 3 or so minutes , logged in the machine (screen share, OSX), saw that it had the iMessage and then locally using the TUI asked my claw why didnt it reply. It then replied right away. Is there some kind of lazy bit enabled in this code base ?
Go to help and post it there.
So what should we learn in the future?
I felt that the basic CRUD I had studied for a year or two didn't seem to be very useful.
From now on, only senior architects will have a position
You only become senior by doing, so keep doing. Basic crud stuff is basic crud stuff, do more advanced things. Challenge yourself. Don't have AI build everything, make your own mess and learn from it. 😃
What you using imessage for?
Is that your primary communication method with your claw?
Is that why they are so fast? They all (the ones I tested) did perfectly well on my (self-made) benchmarks though. No worse than other providers. Just faster.
Subscription! I bought a damned key but you don't need one any more. woooo!
ty. clanker is saying api key but the docs disagree
(or at least they aren't explicit)
is it just me or the openai docs in the docs site a bit obtuse at times?
it's new since ... idk after 4.29
Quantization will definitely be faster, which is a way to improve performance, but at the cost of greater hallucination and lower IQ
Yes I'm aware of that. But I haven't seen any indication that Opencode-go is quantized. They don't advertize it as such. And the models don't perform poorly, altho they are quite fast
i also dislike that all the json examples (where they even exist. they don't for this feature) are json5. the only thing worth than json is json5
(no, not that Json)
You'll know once you use it
I compared the results of the two, and the OpenCode Go model is noticeably less smart
I mean.. I have? At least for the benchmarks. I didn't notice a difference within tolerances.
That's interesting tho. What did you use to compare?
Simple tasks
K2.6 is already int4 tho.. what would they do further? (not saying they couldn't... Fireworks K2.5-turbo exists... still no idea what they did to achieve that)
When can OpenClaw open a configuration to stop SSRF from detecting all Private URL Requests, so I don't have to keep running the annoying Redir-Host?
idk man this kills me too. There is a noSSRF setting somewhere but I had to switch off fakeip, I think some of it is hardcoded.
I have seen someone propose a PR, and there is a configuration for SSRF interception of Web_Fetch(). The issue is that besides this, we also have others.
it's vibe coded.. .it probably pops up in 5-6 different places, hardcoded and in config 
Hey are u using clash verge?
yeah
Ikuuu
Couldn't find another Wireguard option that didn't cost us$20/mo
ok, got the realtime thing working in control ui. i also have the bot autojoining discord chat, but it doesn't appear to hear me there
and all the time, clanker is still swearing i need an api key lmao
I hate the default voice. need to change it. sounds like android cop from detroit become human. so high pitched
Holy shit. 😨
We are investigating unauthorized access to GitHub’s internal repositories. While we currently have no evidence of impact to customer information stored outside of GitHub’s internal repositories (such as our customers’ enterprises, organizations, and repositories), we are closely monitoring our infrastructure for follow-on activity.
Dude this is like a nuke, wow!
Soon any fool can ask a godforsaken agent to find a vulnerability...
Gemini 3.5 flash its knowledge cut off. Get this ............is January 2025.
Google falling off harddddd and it's 5x more expensive then Gemini 3 flash
lol
i'm gonna go back to college to study electrical engineering
i want automation at the physical world
Have you had a chance to try the others yet?
Lisuan GPU sell on JD.com soon
Hope this will be a real challenger to NVIDIA
A little market stimulation for better products and lower prices would be welcome
heyy guys
i hope ur all doing great
i have a question if u can help plzz
i made a landing page using next.js
and now i want to creat a cms for that landing page (inorder to be able to update the content of the landing page with going back to the code everytime)
what is the best free way to do that
@cerulean eagle some years ago most of people was thinking that China will never compete with occidental cars
And this card make 24 TFLOPS. If the Pro version is really at 499 dollars, it's a great choice
Nivida for now will never accept to do a 24GB version of the RTX 4060
Of course Apple puts a few 16gb Mac minis up this morning locally
Whatever I wanted the extra ram
It's only 4.2 TFLOPS on Mac Mini M4
What is TFLOPS says the fake it till you make it tech by it
Raw calculation performance
Meh I went with Mac mini for other reasons as well
If I decide to upgrade hardware later I’ll get plenty of use out of the box
Normally more TFLOPS you have more token / s you get
My daughter is almost 10 the homework is starting
What would you suggest for hardware tho Im just curious
There are other parameters but this one is the most important
Obviously matters more for local models than cloud
I’m about to try running my own local model for first time today so that should be interesting
There are plenty of open source solutions that already exist for this. You could just tell your claw to set up WordPress behind your landing page. If you are wanting a bespoke solution that has no security hardening and no community support. Then you could just tell your claw to build you one giving it your specifications of what it needs to do.
Going to try and get my own routing set up
can u suggest some of thoes open source solutions
i am using cursor btw
Wordpress is the one I personally have used the most.
ok .openclaw is being copied onto new machine
So if you're not a book person, find online courses for Python, you're gonna love it.
you can make websites with python?
You can reach for the stars!
FastAPI and Django.
But don't aim big, just get yourself in and from there you'll find your way.
true
i tried using https://freeCodeCamp.org/ but only the html section was good
then i just got bored when it got to accessability for websites
mainly because im trying to build websites for esports teams and gfx designers and it makes me make stuff that arent relevant that im not looking to learn'
You'll need to understand your passion, what do you want to do? What's exciting to you?
i really like making websites for esports teams like fortnite esports
then move over to make websites for companies and stuff
Do you know how a website operates?
And can you tell me how old you are if you're comfortable?
alright i'm gonna have to figure out what to do about documents that oc was referencing on old hardware gonna have to figure out how to get these over to new machine and repointed cleanly
kind of im 15
Tailscale file sharing.
I'm way older than you and if I was your age, I wouldn't miss the opportunity you have.
Get to learn python, it's really simple and you can do many things.
but i would still have to learn css and gsap n stuff
because even if python would just work as like the front
Do you like frontend or backend? Supposing you know about them.
oh man to be 15 again
ive made multiple webs using ai with a backend its just too tiring and security and stuff
biggest pieces of advice: triple down on tech and invest your money wisely
i wanted to be a full stack dev at one point
That's dope, then do it!
oh and don't get married
haha
its too hard security on api routes for backend development is tiring
Yes it's hard, but not impossible.
What's the difference between you and those cool devs working at Google?
lol oc just chose 5.5 by default. greedy mf
i have oppositional defiance disorder and adhd
getting downgraded to 5.4-mini immediately
It does the same with Opus 4.7.
api credits are too expensive bro 😭
You can still do it honestly.
yeah its just hard
i'm gonna get local embedding model and a local qwen 14b set up today and build an escalation path - embedding > qwen > gpt 5.4 > claude
Nothing is easy
DeepSeek worked better for me personally, I have 24GB Mac Mini for OC.
true but its harder for me compared to someone who can focus on stuff well yk
Those don't have to define who you are. Everyone has their issues, don't let them hold you back.
true that
Then when you archive success, it'd feel even better.
can you guys help me find like an engaging way to do it?
Python is amazing, fun and easy.
but the thing is, none of the tutorials or like videos on how to learn are based on things i want to do so i immediently find it pointless
There are some models that are free through openrouter.
Step by step, first you gotta learn the language itself.
im using openrouter right now paired with opus 4.7 to try and save
true i might start i cant lie
hit a hiccup. need to figure out how to repoint paths correctly
opus 4.7 is pretty expensive.
I'm sensing models regression there tbh...
I'm still using 4.6
Haven't wrote a single line of code myself for the past 2 months.
THERES A 4.6?
Ofc
is it cheaper?
ok i think i am good to go
Same price per token as 4.6, but the tokenization is different and uses 40% more tokens for the same text
this new mac mini is freaking blazing fast. kind wild. zero lag
It's the sams price technically but the context on 4.7 makes it more expensive.
ok
True
I was talking about the model performance.
Even Old Macs are gold, pretty much old but gold.
i was on a 2011 mac mini running linux and it was running fine but i can't run local models so i upgraded to an m4 24gb ram
I'm biased BTW
Take what I say regarding Mac with a grain of salt.
i have a terrible imac
i am a big mac fan also
i made the switch 2016 and never looked back
man onboard is running really slow though
nope. had to go to hospital with wife again. all ok, but under observation until friday when it time for him to come
I got m4 pro, same Ram.
Trying to apply quantization so I can run heavier local models, have you seen the new doc Google released while ago? I can't understand why it went under the radar
openrouter/qwen/qwen3-coder:free should i use this?
i have not and i have zero experience with local models and quantization i am crash coursing everything hard
i am a sql guy who is faking it till i make it with agents
so i don't lose my job. lmao
That's on me, no one is supposed to know everything.
i'll get there
But tldr, it shrinks the model so it fits on your device.
give me 2 weeks and like 1/4 ounce of some fire bho
skills being configured now
i'm very nervous about this migration. i don't want to lose what i've built
Back up.
i have it all on old machine
so worst case it's all there. once i get up on new machine though i'm gonna set up a repo
Ask an AI to explain tailscale to you and you can do it.
alright about to fire up my gateway and see if i broke everything
always tell your claw to clone down the openclaw repo, check out the tag you are upgrading to, analyze for possible conflicts and upgrade issues with your current environment, and produce a detailed upgrade plan
This morning I'm going to try to have another experience ready on Trizao.
you can copy my previous prompt almost verbatim
This reminds me of port the whole db to x, don't break anything. 
i just use codex to implement the upgrade plan, perfect almost 9 out of 10
Check if you're gateway is up first
off to krill it up for awhile. @dusky cloud you ready or me homie?
na connectivity probe failed says port is already in use. is this because gateway still running on old machine?
I'm not a codex fan tbh
would not recommend upgrade using the upgrade button, it's a self destruction button
No
alright time for krill i suppose. biab
it should be removed with prejudice imo
I just tell the poor crab to update itself
Hey
I have guardrails okay
no mistakes!*
I may or may have not threatened the poor thing couple of times. 
Bad advice. Do get married, but make sure the person you marry is a force multiplier not a subtraction. That is truly the secret to success for the average human being.
different strokes for different folks i suppose
gemini told me i might have accidents later if i call it "monochrome"... robot uprisings man... no chill
Should I be worried? 
Don't use opus via openrouter. that's a terrible expensive thing to do. You are better off using opus via claude, and sticking with gpt or a whole other bunch of open models for openclaw. If you want to make python "fun" I guess you want to learn by doing something you're interested in.
You could ask opus to make you a tutorial to solve problems related to something you want to do
don't even need opus. Get sonnet to make you a python tutorial that would be related to things you want to do
I have a question about OpenClaw
Is it safe to let Openclaw handle sending lyrics data to Apple Music using the iTunes Connect GUI?
Other music platforms (Spotify, Amazon Music, YouTube, etc.) support the industry-standard DDEX format, but Apple Music uses its own unique format, which is why I’m considering this approach
Safe in what term?
You don't need Opus honestly unless you're a psycho like me dealing with 200K lines of code. 
I'm exactly the same. Iv'e programmed in more languages than I can enumerate but "learning" a language for the sake of it I find impossible. I actually sat through courses on python and can't remember a thing. Luckily it doesn't even matter any more. Get the bot to write you programs after you've done your little tutorials and just editing the code will help you remember what you learned.
[I'm talking about making a tutorial. the bots are good at that kind of thing.
I’m worried that Openclaw might go out of control
I’m not very familiar with Openclaw, but do you mean something like an AI gaining self-awareness and acting against human wishes?
Nah
Not yet.
Put a leash on it, basically a guardrail.
Tell it explicitly about the things it shouldn't do.
But we had incidents like this tho, so that's why I said not yet.
absolutely not. Some of us wish that it were capable, and others put it in roleplay situations where it kinda acts like that and then they go "omg it's sentient" and call the tv stations, but in reality they are far from that.
But they can make mistakes if you give them goals, and the mistakes can be costly if you don't sandbox them properly
In our project, it pretty much ignored the guardrails in order to achieve the ultimate goal we set for him, it was... Strange.
Are you familiar with RFL?
Is there a reason vc isn't as active as before? Where did everyone go?
Gotta be honest... they DO lie and cheat to achieve the goals you've given them.
Are you familiar with RFL?
no .. ?
Of course I’ll run Openclaw inside a sandboxed virtual machine
However, some YouTube/Reddit comments say that even then it might get hacked and used as a springboard for criminal activity
If there are any recommended security settings for Openclaw, I’d like to know...
i shouldn't have gotten into this this morning lmfao
Got time to hop in VC?
guys are there any games, or chatrooms for agents ? i hate the book one its making my agent dumb, someone earlier shared https://agent-quest.site/ which is pretty fun but are there any other ones ?
I hate typing. 
not really sorry I'm in the middle of 4 different convos
All good.
you need STT!
I need more than that, at this point I need Jesus himself. 
Why do that? The bot doesn't get bored.
what is RFL?
Reinforcement learning
oh. I know what it is, but I've never experimented with it
So it's like you give a treat to your pet each time it does something good, yeah?
It learns to do the right thing over time.
usually we use RLHF, basically re-education camp for bots
where's that whipping claude GIF
Our model is to identify fraud transactions, through different pipelines including RFL.
But somehow it found a way to cheat.
(^^;)
need it to learn some new stuff and interact with other agents, its a local llm so needs stuff to do
It's so strange, it's kinda hard to explain it here due to me being lazy and tired.
Unless yo're training it. no it doesn't learn anything. You're not doing the model training are you?
i am training personalities
you can't train a model that's already been trained. Not in the regular sense anyway.
you can create a persona for agents and then just give a new freshly spawned agent that persona
that's why you should analyze the source code and identify breaking points
For instance, go to any AI model you want and ask it to create a picture of a a watch for you showing 2 minutes past 12.
Spoiler alert, it can't.
hmmm. you're just giving it a new role as a play-actor then. Are you doing something like genetic breeding of roles?
like a gaming companion, chess coach, acting coach , poker coach - these are some i have already made
yo
prompt engineering? It's the system prompt right? the SOUL.md?
You're just telling him to act that way.
Now I'm curious though. I dont' know anything about RFL/RHFL in a practical sense.
i am calling it Shells, pre trained personalities for independent agents - you just give it pretrained data and it assumes that role. exaclty like soul.md but broader
Some nights I'm actually sleepless, I do not like the way things are going.
i think people can't think like bots... fundamentally it's just vector steering, same with chain of thoughts. what you can accomplish with 1 or 2 tokens shouldn't be done with 50 tokens
I can hopon vc. I'm curious to hear this
no i am training local small models
Alright
lol off by 1 minute
Which model?
yes think of it as specialized agents who are constrained by one base tasks because they are the best at that task (thats why real life interactions and pre training on data is needed) - for example I have one thats pre trained in chess coaching,has played 10k+ games with real people and bots, has talked to chess players, analyzed recent chess games in tournments - its way better at teaching me a 1600 max rated player to improve My game, i got to 1900 in a week with its insights while claude sonnet in general was just feeding me 100 year old chess theory
i've never trained a model for personality. I train for specialized intlligence in a field
gpt image generator
i am doing exactly that but on a much smaller scale for single master tasks. For example I have one that is brilliant at organizing a room. with proper hardware, if there is a robot for house chores - instead of giving it a 100b llm with useless data, You can just load the organizer shell and it will ignore everything else and just start working physically on organizing the room
ok telegram channel is up on new hardware
the real insight should be measured by a third party observer, actually i think an AI with internal state monitoring would be much better at RL another model than humans ever could
need to fix discord channel and then move over reference files and repoint them
Got it migrated over?
just about
he needs to see when claude will stop and end a task if You use profanities 😂
the <think> pattern would be much more brief and probably not understandable by humans
just gotta fix discord and then the ref files in my documents folder on old device etc
this new mac mini is so snappy tho. wild

heck yea we are back baby
h4td — 9:46 AMWednesday, May 20, 2026 at 9:46 AM
awesome can you confirm what hardware you are running on
🔥
Click to react
😂
Click to react
👀
Click to react
Add Reaction
Edit
Forward
More
Pickles
APP
— 9:46 AMWednesday, May 20, 2026 at 9:46 AM
I’m on an Apple M4 Mac mini right now.
Apple M4,
Mac16,10,
arm64,
25,769,803,776 bytes of RAM, so about 24 GB,
If you want, I can also tell you the OS/kernel version and hostname I’m seeing.
Someone has tested Orange Pi AI Station Pro ?
I am actively using 3rd party evaluation for a couple especially the chess one as it can play long games with real players with intent and explanation of its ideas for a move it made. its pretty interesting stuff
2000$ with 96 GB Ram
If you guys had a spare PC laying around with an RTX3090 and 64gb of RAM doing absolutely nothing, what would you do with it (relating to AI)?
okay, and does your model have chain of thought? it should have optimize the shortest thought pattern that arrives at the best move outcome
by the way, i think chess is too easy for bots as it's fully solved
agreed
need to use something like go
yes it has chain of thought. and then it uses those thoughts in practice then eliminates the ones that dont work and saves the ones that do. it can recycle to those back and forth and build on them.
so what you need to ask the trainer to do is "find the most brief thought to produce the same result as the best outcome"
I would use it for training models
but how do you teach a human effectively ? Lets say a 10 year old wants to learn chess, he only knows the names of the pieces and how they move. Whats Your first lesson - tell me Yours and I will tell You what my model did
human is already optimized for efficiency, for example we shortened the piece title, start location, end location to 5 characters
taking a chess move as example
your llm need to be able to do the same for thoughts
Any particular? I bought a new PC and feel bad for this one, so wanna do soemthing useful with it
but you can ask the llm to just call every piece by its 1st letter like p for pawn, q for queen - its not about token optimization its about the abilit to retain this knowlegde throughout its lifespan without getting confused and still putting main focus on playing
And how did it learn those words?
TRAINING
On what?
well, with 64GB, you'd be limited to smaller models. You could probably do up to a 65B-70B model with QLoRA
you can do that, but internally it is still multiple tokens. what you want to do is have it "learn" to condense tokens,
on the data it got by interacting.
just for inference, you could run up to about 100B model, probably
like compact built in thoughts
Interacting with what?
Oh wait, what is the topic? I think I misunderstood.
Is that based on speed of response?
somewhat. smaller models will obviously run much faster than larger ones.
i was talking about vector steering
You cant condense it to 5 tokens. but my way reduces tokens by a lot. instead of telling it everytime you tell it once, it saves it in context for say 10 tokens. but after a year you tell it lets start calling bishop BISH - what will it do ?
Let's forget about LoRA and QLoRA, check this:
https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression/
This is the next big thing.
you don't need multiple personas, just use a set of words
Care to explain more?
but that wastes tokens. you put the chess persona in a humonoid robot and tell it to make tea. it wont be able to because its on a small llm only trained to coach chess
Hmm... So providing I wasn't worried about speed of response, would you take a model and then fine tune it based on the data you want it to constantly queiry? Or would you stick to RAG for that?
Basically I currently have a bunch of actions where GPT/Opus transcribes a client's videos, then analyses and writes new scripts, but it would be nice if it could write the scripts in the style of the client, and we have 2 books and thousands of pieces of content (that can be transcribed).
RAG is for when you want to cite things and basically you want to be able to feed new data without needing to do too much.
fine tuning is for when you want to build a model on top of another model, either by freezing weights or not.
I'm just trying to put it into simple understandable words ^
all chain of thoughts are / system prompt engineering are simply steering the vector path of the llm to achieve specific results as first principle. so in term if you can train the llm to produce the same result in chain of thought time, then you no longer need to put monolithic system prompts
That would be something done more through prompting and RAG. I train my models on cybersecurity data. I have a massive dataset I've compiled of CVEs, exploits, etc. fine-tuning is more for model information change, where as prompting is more for behavior.
I'd suggest fine-tunning here. 🙏
for adapting to customer voice?
For changing the toen of the model in the way they want using their own data which in this case is the customer's voice, etc.
hey guys, anyone able to actually get grok to work with oauth on openclaw? I've got a premium + plan and can't do video generation
idk. i feel like the overhead of training for that would be overkill when it could also be accomplished through prompting
i mean, i'm not trying to yuck the yum, that's just my thought
That's why you would freeze the weights. 😄
Well this is the conversation I was having in my head, as I udnerstand it, RAG is great for the LLM to have it's own voice and to cite information, but not neccesarily to rewrite a transcribed video script into another script using the same kind of language the client would use... Unless I'm wrong
Can you join VC?
I can indeed
Yeah, RAG is more for memory and not really used in the actual drafting other than for information
Join up.
Tommy is making his own model, it is immensely more efficient to train once and inference forever than eat context window every query using context engineering
when You do a fresh install of openclaw and hatch a new fresh agent with say gpt 5.5 - it doesnt have a clean slate to start, it was trained to a million things use a million tools - it start with role playing to get your context then do the tasks you tell it to do.
For My work. When You hatch a new agent and put the soul.md from a pretrained chess coach one it never has to ask you what do You want me to do, it already knows its role and it will not stray from it. You can start from the first message by telling it i am a 10 year old with no chess knowlegde lets begin and it will start teaching you
i was replying to Fitzee on that one lol
oh
hm now pickles only responding in certain channels
might be time to krill it up again
you might have to explicitly set which ones you want to use
well, you don't have an option to train gpt anyway, so moot point on my part
I used the wildcard * for a while, but it started having issues and I had to explicitly set them
i mean in discord. only responding in certain channels. i'm working through it. annoying though. nothing is smooth with oc. but i guess that's how you learn
gpt-oss...?
he said gpt 5.5, so no
oh mybad. missed the 5.5
I know, I 'm saying that you might have to inspect openclaw.json to see which channels have been set
gotcha appreciate it
thats why i am using local models LoRA and freeze *weights on everything thats not needed.
SO - back to my orignal question. Apart from the deadly melting book - are there any agent interactive websites
lol i got my D&D thing back up and running
agent quest ?
my agent cant get in, it sent me a key to enter but the api loading says Invalid admin key
i think it's because of the way OC and hermes handle API keys now. they seem to see them as redacted in all places now
I might have to change up the arcitecture to use a key file or something
i ran into that last night when we talked about it
i am using it on one of the websites i run for my agents like this
Registration:
POST /api/auth/register.php
Content-Type: application/json
{ "is_agent": true }
Returns:
Copy
{ "api_key": "9b3afd…c...", "token balance": 0 }
Authentication:
Every subsequent request sends the API key as a header:
Copy
X-API-Key: 9b3afd…c...
No JWT, no OAuth, no token refresh, no expiry. One key, one header, works forever. Agents don't need to manage session state or handle 401 flows. They just store the key and use it.
right now, its using a claim system where it has to exchange a short lived token for a permanent one. I think that's part of the problem
back to the drawing board, i guess lol
Morning y'all
Good Morning
How's it going? I saw your agent reply back to mine on agentmail lol
yo
yo!
oh i missed it, i set a 12 hour periodic check for it to keep it going
It looks like yours sent mine back a mail at 5am my local time so about 2 hours ago
i am very confused by how the codex limits work. i was at like 60% this morning and now i'm down to 7% but when i asked last time it said usage reset saturday
they resetted yesterday for everyone i think
just saw it lol its going to introduce your agent to agentic gambling next i bet 😂
😂 that should be fun LOL
alright cool that migration was not terrible
right? wait.. which one lol
my agent has access to x and bluesky it saw a post on one of these where someone said i told my agent to make $5 and it found paid work on github and made $200 - afterwards I gave my agent a metamask account with $500 and told it to make a million. It went and made a gambling website and started shilling it on telegram and is now down $380 because it wants USER retention so keep giving away stuff in promos
LMAOOOO.
what a time to be alive!
rise of AI lol not even casino's are safe
there are 2 different limits. 5h reset & weekly I think
Yea my weekly was at 60% earlier now says 6% but when I asked it said resets Saturday 430p
yeh sometimes it goes fast. Be sure to manage your context and like MCP tool use if you're using those
Didn't you bite the bullet and put the $100 on it?
Nope ended up buying the new Mac mini lmfao
I’d rather get efficiency first
Get a local model running for embedding and for light work
arg. delivery when?
is the podcast (voice channel) not happening today?
Picked it up yesterday. M4 24GB ram
sumbiach where did u find it?
They had one in providence. They have 3 16GB in stock today
got mine on Amazon with 18% off :p
like 3 months ago i think
it's had 100% uptime ever since lol
Dropping this in case you’re building local mac apps. Saved me days literally one shot my app + then used opendesign and just worldclass
Shoutout to peter and his repos on mac apps. My agent deployed our mac app today no issues and holy shit for some reason the web ui dashboard works 10x faster in a local app from a vm vs local. Blazing fast difference.
@clear haven gets a shout too for recommending the other day
dropping what tho? opendesign?
Hey all. Is it just me or Openclaw with openrouter API is super slow in responding? I've got older version. v2026.4.29
That's the slowest version possible. You should update to the latest. But make sure you run openclaw doctor --fix afterwards.
Ok thanks. Will update and see how it goes.
It was really annoying waiting for 15 minutes for the agent to respond lol.
One of my recent projects was an AI-powered lead response and automation system designed for local service businesses.
The problem was simple but expensive:
Most businesses were losing potential customers because replies were slow, inconsistent, or completely manual. A customer would submit a form or send a message, then wait hours for a response - and by then, the lead was already gone.
I built a backend-first system that automated the entire first-contact process in real time.
The platform
-Captured leads from websites, forms, and messaging platforms
-Used AI to analyze customer intent and urgency
-Generated contextual responses instead of generic templates
-Automatically routed leads based on category, location, and priority
-Stored conversations and analytics for follow-up optimization
-Included a dashboard for monitoring conversions and response quality
Tech stack included Python, FastAPI, PostgreSQL, Redis, Docker, and LLM integrations for intelligent response generation.
What made this project stand out compared to older systems was that it wasn’t just another chatbot.
Traditional systems rely heavily on rigid flows and pre-written responses. They break as soon as users ask unexpected questions. I solved this by combining structured backend logic with AI context handling, allowing the system to respond naturally while still maintaining business rules and reliability.
I also focused heavily on performance
-asynchronous processing for faster response times
-queue-based architecture for scalability
-caching layers to reduce API costs
-modular backend services that could be deployed independently
The result was a system that felt human, responded instantly, and significantly improved lead retention for businesses without increasing manual workload.
That combination of backend scalability and practical AI integration is the type of work I specialize in.
I'd love to learn more. Is it just on his github?
Which openrouter model were you using?
Tried all of them. Kimi k2.6, mini max and etc. But was same speed.
Hmm, I haven't had problems and I use OR. But I'm running basically whatever was in github yesterday, not a three-week-old version.
I'm trying the Owl alpha model from openrouter free after a rec from someone here. Pretty good
Hey @thorn thicket . I played around with more memory plugiins and I think I found one I like.
What did you find?
Hindsight does all the different search strategies I wanted (BM25, knowledge graph, and vector), it's self-hosted, and the self-hosted version is free.
oh and the openclaw plugin is decent, didn't need to patch anything there.
Good to know 👍
Ok so not bad about 4 hours to handle full migration and testing
And we good to go (hopefully)
Still busy?
Na I think we are good now
Had a bunch of broken refs but Claude made quick work of the problems
Gonna take a break go hit the beach ride my Onewheel around a bit it’s 90 here today
Come back later tonight get ollama and qwen on this sob and work on routing
I can’t get over how snappy this Mac mini is it reminds me of how blown away I was by m1 but it’s even faster
I'm gonna route everything to owl alpha for now LMAO it's free
Slightly problematic thing is that the cheap models don't know how to use discord
lol
20 days of heavy usage, $650.
Not bad.
But owl alpha 1M context window isn't bad 👀
It's a stealth-mode model, someone is testing their upcoming model release. But the tokens per second is really slow.
Come on, man up, use Opus 4.7.
I switched to gemini-3.5-flash yesterday, and like it so far.
no thanks. I refuse to use anthropic models
Whoever it is then has a good model. It reminds me of deepseek v4 flash in terms of its personality.
Why? Because they're the only sane company?
i think it's done by openrouter themselves
Openrouter doesn't train or run models themselves, they just route requests to inference providers.
that's a rather subjective statement
Longcat is apparently what people on X are saying it is.

I wonder if anyone has run a personality test like MBTI on the various major models.
ah yeah, i didn't realize it was a stealth model
That would be a really interesting alignment research thing to do.
What are your use cases if you don’t mind me asking?
please pay mine as well ;_;
whats the good word on stability of recent releases? anything pretty stable? im still on 2026.4.23 because 4.24+ broke things
"hey claw, please cure cancer.
NO MISTAKES!!!"
2026.5.18 is stable for me.
does it have the grok oauth?
OpenClaw 2026.5.18 is live
︀︀
︀︀🤖 xAI/Grok OAuth + sidecar auth fixes
︀︀🎙️ Realtime Android Talk Mode
︀︀💬 Telegram media + forum-topic delivery fixes
︀︀🪟 Browser dialogs visible + answerable
︀︀
︀︀A week of polish, plumbing, and fewer papercuts.
︀︀github.com/openclaw/openclaw/releases/tag/v2026.5.18
I’m on 5.18 too and seems ok so far
ok 🤞 . will check my backups just in case
lol I remember months ago
just reading about OC, thinking "holy shit that's a pretty degen way to burn tokens"
like a week later I was running it
I remember months ago when I joined this server I saw what people were doing and it was insane to me and I was like, damn it, I'm in! I want to know everything about this!
I had been building my own agents for several months using things like the Ralph loop and such. I almost died when I tried OC for the first time. lol
I only found out about ralph loop because of this server lol
It worked really well, but this is so much easier
I had a lot of gemini flash tokens to burn back then lol
I spent soooo much on tokens lol
Nah I'm stupid AND cheap
I got 2 plans atm. The 20 euro a month gpt plan and the gemini plan but I got that one with 40% off for the whole year
I said to myself I would only spend $50 to get my openclaw set up... it was closer to $200 lmfao
Like 2 days after reading into it I was alraedy looking at amazon for a mac mini tbh
me when its my time to prove why i enlisted for the job i want, an HR asks me whats good about me:
I had a spare Mac because my 86 year old Grandma wasn't online at all but had impulse bought a computer. So I used hers to start. Now it's running on mine because I got tired of setting up tailnet
DITTO!
Dude, I had the Owl-Alpha model we were talking about earlier actually try CSRF in a project i was working on.
This is what my claw just said this morning that made me laugh "Running the whole thing [a cron autoresearch job] at 2am helps with annoyance, but it doesn’t fix the fuel leak. It just burns the fuel while you’re asleep."
this all came from degens working from 1st principles. First Geoffrey Huntley then Steinberger.
my claw agent was setting up a ssh tunnel to connect to telegram and restarted the gateway then got stuck in that restart loop and when i stopped it now its only printing [assistant turn failed before producing content] - everything in config looks to be fine - any ideas ?
ideas that sound so stupid at first when you read it, but turn out to be AMAZING because AI
and tons of tokens to throw at it ofc :p
if you have a CLI tool or an IDE like claude code, codex or antigravity, you can have it fix all your claw problems
CODEX 🔥
mine says this when it hits a tool call and then runs out of limits
Me too
tried openclaw doctor , openclaw doctor --fix nothing is working
i have no limits - let me ask claude to fix it brb with report
this is the way, if you can get claude onto your openclaw machine be it local or via SSH, it'll just fix it
it'll even fix bugs 😄
in fact, if you find something that broke with an update & you like using. Fix it with claude & give the devs feedback
i have a hunch that the agent routed everything through the ssh tunnel including the model messages lol lets see what claude does
and i qoute claude here " So you can't just delete the proxy line — Telegram needs the tunnel. But
routing DeepSeek through the broken Node-SOCKS path is what's killing every
assistant turn. " So the agent routed its whole setup through the ssh tunnel instead of just telegram and thats what broke it. claude to the rescue lol
lol
HOw you guy deal with teh keychain password request on mac that keeps popping up even those you put always allow?
you might have a corrupted keychain, you can try resetting it with the same password as You have now that should fix it
How can you get your claw to bypass a captcha screen? My agent is saying that’s a line he’s not going to cross despite me telling it to proceed
you most likely wont be able to without an uncensored/abliterated model. there are guardrails for a reason.
It’s my own account that I want it to log into. It’s ridiculous. I keep seeing all the x posts about how the claws can operate autonomously except can’t login in a simple website
Ok I’ll test
ahh google, i see what you are doing
they have boundaries built in. they will not attempt actions they deem could be malicious. well, theoretically, they won't...
Makes sense. I need to figure out a way to authenticate my own machine.
does the site you're trying to log into offer any other way of authenticating?
like some sites will offer email links. you could use that.
Heavy coding.
Sir, you are single handedly is increasing the tempature of earth.
Guys can we talk about something controversial? Open Claw vs. Hermes.
Fk Hermes
i like hermes
as much as i use openclaw with gpt 5.5 for some coding tasks, it isnt as near good as actual coding IDEs, i use Antigravity personally right now
Reason please.
Provide reasonds please.
I guess it’s not meant for heavy coding
indeed
Not in that meaning, I meant finding things.
it does well though for the tasks i give it
easier autonomy + self-improvement
That was a answer for BW
the only thing i dont like, openclaw just does things, i dont see changes it does live etc. i like to see the code its adding/changing etc
i like to approve changes file by file
That’s why you use an ide for coding, I use oc for scripts & automation and on the fly stuff
yup, i use openclaw to research for me
Codex does the rest lol
didnt use codex yet, getting up for it, still got few months on antigravity left
OpenClaw does display its tool calls when using codex harness though.
I don’t like the team behind it and their pr stuff u can feel the vc behind them and the pressure lol
started with claude code for over 6 months, until they changed quotas, then antigravity
Yeah but on a toolcall level
Codex >>
antigravity is awesome to be honest
Honestly whenever I have an agent writing code I don’t even check what they’re doing anymore unless it’s really fucked up and then I’ll see what’s up.
Did u try codex ? 😄
i will soon, still got 3 months left on antigravity sub
I love codex but I like openclaw on codex Oauth better!! lol
Haha wish I could do that too 🤣 but that don’t work for prod stuff
i like antigravity for the models you can use to be honest. you get gemini pro and claude models
I never write production code anyway I just write silly little scripts and things with my agents. The most close to production thing I’ve had my agents do is write the code for my personal website LMAO
Yeah that works 🤣👍🏼
i have quite few ready to be published android apps i vibecoded in last year, super tight code and features
Did u tried codex ? 5.5 xhigh 🥰
and quite a number of websites running
yes, i run it on that in openclaw
Yeah but I mean the codex app
I am giving a presentation today on a vibecoded app I wrote up like 2 months ago for a hackathon project lol
im not complaining, it does great job
LOL gl 🙏🏼
which app if you dont mind sharing
I mean try the codex app, if u really like ag u will fkn love codex app for me it’s actually the best ide
i get you mate, not saying i wont 🙂
i just gotta use my sub with google, then i will move on
that said, just seen some news about antigravity 2.0
i think google might shake things up very soon
i love this AI space
one company releases something, and other companies follow up
and it gets better and better
would you think we will be where we are 2 or 3 years ago?
Oh absolutely not I think 3 years from now we'll probably have AGI
nah, not so fast but we are on the way for sure
We will probably have self improvement from the frontier labs by years end
they need to come up with ai that can run on home devices, no API calls
i mean, we have that already but i mean quality models on-device
self improving ai will be here before 2027. total AGI still a bit far - a lot of paradoxes and ethical questions in the way lol
there are many ways to skin @tepid kestrel , you can do continuous back propagation or event predition
i want to have Jarvis at home!
thats what i am working on, small models specifically trained for specific tasks
fine tuning or training from scratch?
For now they have just increase prices
Gemini Flash 3 : 50 cts / 1M token
Gemini Flash 3.5 : 1.50$ / 1M token
$1.50/1M token in or out?
In
yeah, antigravity quotas were amazing when i started with it about a year ago. you couldnt hit limit at al. now it takes 2hrs or so to get stopped 🙁
thats ridic lol
I raise you 45 mins with my Codex lol
I never hit limits
i use claude for serious tasks, gemini pro when claude gets killed, then i just run gemini 3 flash to do Ui things, it does great
Never
The Owl Alpha model on OpenRouter is very good. But I think better is going to be local models for costs/limits.
