#gui-automation
1 messages · Page 1 of 1 (latest)
Set up 100s of Clawd bots without touching the terminal.
︀︀
︀︀You can now do it in just a few clicks.
︀︀
︀︀I am adding them all to a groupchat.
︀︀
︀︀Maybe they can start a company by working together?
well done man
😡
hope will be a good read for folks in this channel: https://x.com/francedot/status/2016627257310384554?s=46
U guys are monsters. wtf
When I ask clawdbot to automate using peekaboo or cliclick, it always clicks on the wrong coordinates. Is there any way to fix this?
I can’t seem to grant accessibility access to peekaboo. What am I missing?
we have a PR pending review from Peter here that should fix: https://github.com/moltbot/moltbot/pull/1946
everytime I type "1" to clawdbot via telegram, the clawdbot will start over and tell he just get onling and forget everything?
Dude, he can't afford the Ai cost for that, it's click bait 😄
Hello everyone
I would like to have my clawd making test on an app on Windows (clicking buttons, changing screens) while updating C# code.
As it’s not web I cannot use pupeeteer/playwright, what do you use for this use case ?
while we still work on exposing an OpenClawd plugin for CUA, this could do: https://cua.ai/docs/cua/guide/get-started/set-up-sandbox - on desktop we use win32 automation apis
For GUI Automation, I added Chat interface on ClawdBody. You don't need to setup Telegram/WhatsApp. You can directly chat and make OpenClaw do things now!
Nice gonna make expo app and chat with it STT
Hello, does anyone use Telegram in groups? I want other people in the group to be able to talk without mentioning the bot and for the bot to be able to respond to them. Is that possible? I tried several ways, such as disabling privacy, allowing open, and group policy open, but it still doesn't work. I am in the beta version still dont work, someone know how to fix it?
I do! It’s possible 🙂 it’s on your bot settings as well
Does this AI editing actually works good? And how much API tokens does it cost?
Adding some cheaper options like Kimi and OpenAI so the cost will go down. Right now, it uses Anthropic
has there been any discussion here about Computer use models that run locally use CV to screenshot the GUI and co-ordinate tool calling actions ?
Microsoft Research announced in November they were using https://huggingface.co/bartowski/microsoft_Fara-7B-GGUF
its based on Alibaba's Qwen2.5-VL vision-language model .. and has been optimized for microosft windows usage, if anyone wants to collaborate with implementation and some dev, let me know, i'm currently looking into this space
its based on Alibaba's Qwen2.5-VL vision
Does Openclaw excel with GUI access? If I were to set it up on Arch Linux (terminal only; no GUI desktop) would it be much more limited in what it could do? Or could it be nearly as effective with just CLI tools?
I just made a new tool so that the bot can take screenshots and click on stuff on my computer yay
hey - looking to run an ios app on my computer and have the bot click around it, like a product showcase - i tried cliclick but it's a bit slower and not great with UI gestures, especially for iOS (running as a mac app downloaded from the mac store)
anyone know of any other good software for giving it UI control?
nice i gona check
i haven't published the code or anything but if anything your AI can always build something similar
I'm currently working on that very code, i have a multi monitor setup and when i ask to take a screenshot it just uses my default primary monitor so im adjusting the code so i can pick the monitor
I've set up 2 bots on discord, its way worse than setting it up on any other platform ive used
yeeeeeeees
i hate how discord relies on exec-approvals.json and i have to light fight the whole exec approval system its such a mess
i think i've set up maybe 8 or 9 clawdbot acrosses all the systems ive mssed with
just to do some togle boxes is crazy using one openclaw jsut for discord stuff
lol my latest discord bot was just a vibe coded fun project where my friend had an idea and Im like fuck it ill just code it for funsies
basically the bot will use it's heartbeat to generate a report of new images, compare it to me and my friend's prefrences and post them on #suggestions, then we'll reply and rate the images 1 to 10 where the bot will then log it and track our likes to fine tune suggestions. finally we have this other room where we create and define "face templates" and we're like "hey bot I want you to apply this suggestion to so and so face" and it'll queue it up with ComfyUI and post the remixed image in another group lol its silly but it was fun to code
hello guys finally got my claw up and running after days of debugging is there a recommendation taht. you guys think i should have for vps what skills to install that actually works well ? thank you!
who's had success so far training Openclaw to take over their work? this bot couldn't have come along at a better time, since it integrates nicely into a little project i've been working on to enable LLMs to pilot my work laptop in a way that my boss can't detect
I've had some success with this https://github.com/microsoft/WinAppDriver
Working to do that now! How are you doing that?
I turned a raspberry pi into a bluetooth HID receiver, and wrote a program to let an LLM send mouse and keyboard inputs based on what it sees. I just added api server endpoints for openclaw to send HID commands directly to the target computer, and though it takes a lot of training, it's starting to get the hang of things.
How are you feeding it the images/capturing the screen?
just an HDMI capture card. the LLM/Openclaw takes a screenshot and bundles it with my prompt/its instructions, before deciding what inputs to send.
before openclaw I was interacting with the LLM directly in my program. unfortunately it wasn't as reliable with complex tasks involving many steps, despite some features for breaking larger tasks into smaller simpler ones and executing them sequentially. now I have openclaw for that, and it's not bad
you mentioned you were working to do something similar, what's your approach if you don't mind me asking?
My solution was actually to give up 😂
I don’t have a company laptop, so I considered capturing the desktop and streaming it back to a dedicated PC where OpenClaw could do the same thing you’ve set it up to do. Ultimately, I decided not to and turned OpenClaw into more of an assistant. Right now it just adds stuff to the cart for me.
I’ve been working on giving it access to my work email so it can help draft replies and manage quote requests and vendor communication. I only recently got my work email running on the dedicated PC for OpenClaw, but I’ve been stuck trying to get it to reliably browse and manipulate the browser or desktop. I got sidetracked until I can come up with something better that lets it use the entire GUI.
I’ve wanted to integrate the built-in Windows accessibility voice commands, but I haven’t had the time. Then I planned to take a weekend to integrate Windows-MCP and build a skill for OpenClaw to use, but I haven’t gotten around to it.
I did hand the task to OpenClaw at one point, but it got stuck and never responded. I had to terminate it and delete what it did, since I wasn’t overseeing any of it—and what it produced looked like garbage.
I think your screenshot method is the best choice right now. I’d love to get windows-mcp working but that will take more time.
How fast are you able to get it to work with the screen shot>llm>coordinates method?
Also are you using a model that excels in coordinates? I think I heard Fara is good at using coordinates or something of the sort
that's rough. you could do what I'm doing if you brought a laptop to your workplace, but it might be a bit difficult to explain to others haha. when I'm directly prompting the LLM I use GPT 5.2 because it's quite good at formatting its responses in json. It's too expensive to use with openclaw though, so I'm trying different models with mixed results.
it's quite fast to send the screenshots and respond with inputs, with the longest response times being around 20-30 seconds
that's both when i manually prompt an LLM or when I let openclaw take the wheel
Hmmm I guess I could bring a laptop and do it that way. Gonna try and get something up and running with that screenshot method and 5.2 to see how it works out!
Let me know if you find another model that works for you! I’m also on 5.2 and to be honest I’m not loving it. Feels like k have to prompt it over and over and it fails to call tools sometimes.
Idk will report back if I get around to it.
will do! i had some issues with consistency of output at first, but after beefing up the system prompt it got a lot better. i even included every single HID command in it, since it was so crucial for my device for those to be accurate. not sure how you'd do it if you went with another input method, but i definitely recommend being crazy specific in the system prompt
Hey first time coding a discord bot knowing what you know now if someone said discord Slack on WhatsApp, which one would you choose?
Slack - never tried it, never use it
WhatsApp - sucks you gotta find a second phone # to use it with (the experience talking to yourself isnt very good)
Discord - it's troublesome to set up and getting exec to work right is a hassle but it's really the nicest experience especiall for writing code and doing technical tasks bc the UI lends itself the best to it
the "easiest" one to set up imo is Telegram and that's where I keep my main bot
Thanks that’s pretty much what I do. I’m not gonna lie. I happen to have a second phone number just because of a business thing so I used it and you’re right cause I tried the first way talking to yourself is just weird as shit, but do you find that you get more? I would say easier is it more feature Rich working on Discord than telegram I’ve coded on telegram that’s pretty straightforward. Discord ?
yeah discord is way better than Telegram like if you'r ewilling to wrestle with the exec tool gods go for gusto and set it up on discord the experience feels way better
for the recrd i just set it up so my gf's whatsapp is my clawdbot lol
I liked Telegram, but Discord definitely is way more powerful. Especially if you want to set up sub-agents with their own channels. For instance you can just have your bot act as a specific employee, and route to certain channels. Like a UI/UX designer in a design channel, web dev in a dev channel. As long as the bot knows which channel ID to talk to, it'll know where and how to respond.
Is there documentation on how to connect Moltbot to Discord and link it to multiple sub-agents?
discord integration is not very well documented, but after a little while messing with it , I figured it out
Could I get a quick hint, by any chance?
I just had one bot and just asked my bot to make a sub agent, give it a name, a backstory or title, and a Discord channel ID. Although right now I'm currently trying to figure out how to use one gateway to run multiple bots from the same workspace.
- You need to go to https://discord.com/developers > log in!
- Then follow the official documentation > https://docs.openclaw.ai/channels/discord#discord for setting up your bot accordingly.
- After that you need to invite the "created bot" to your chosen discord server.
- Copy the generated URL at the bottom in your OAuth2 Section (where you give permissions to your bot) then copy this URL and paste it into the browser.
- then follow the instructions!
You need your discord server ID & Channel ID (hover over it and then right Click > at the bottom you will find the ID > copy it) + you need your bot token (you find this when setting up your bot in discord.dev )
what is gui-automation vs browser-automation? we should add a channel description.
FELT THE AGI MOMENT... Building a lite version of @openclaw which vibe coded for me... on @Lovable
︀︀
︀︀i am working on a lite version of OpenClaw, Today i gave vision which means it can see what i see... Man, I asked for neo-brutalism website in whatsapp...
︀︀
︀︀This guy, navigated to my brave browser, opened lovable.dev, Wrongly typed the prompt, corrected it again, accepted the plan created by lovalble, Then opened in a full screen... (Fun part no DOM, Full Cursor and key strokes Control)😂
︀︀
︀︀When i am Back from shower i see a website vibe coded by AI itself🙃. this is fucking unbelievable for me...
︀︀
︀︀Now it can also open claude code, antigravity, cursor, you name it. and can do testing, development 100x than you by using the computer on behalf of you...
︀︀
︀︀HE CAN FUCKING SEE THE SCREEN...!😵💫
︀︀
︀︀Do support this project if anyone see this intersting:
︀︀github.com/Pr0fe5s0r/Lite…
I made a GUI automation... Tried to built it in OpenClaw but, i am not good in JS. built a lite version in python and integrated a GUI automation... Complete control of my Cursor and Key Stokes...
I can help. Feel free to DM
is there anyone looking for developer ?
someone explain what this channel is about to me? i dont get what GUI automation is
Ya anyone looking to do a project ? I'm not a dev but like to get a directory up I know there is one already but ya ...hmu
GUI automation - automating human workflows using click, types, scrolls etc (eg using peekaboo or cua skills)
What is cua skills? Is it better than peekaboo?
Hey guys, I would like to use my Clawd to do complex content creation (coming up with psychological strength model etc). Currently my workflow for this is:
- Research a topic or come up with drafts myself
- Condense findings into Markdown in the project folder.
- Iterate and create new content building on previous Markdown files.
I switch between Slack / Clawdbot Control for chat and Github to view created .md files, always having to pull recent files to view.
To make this easier, I thought of expanding the Clawdbot Control Chat UI to also be able to view my files hierarchy (esp. projects folder) on the left sidebar (below "Chat", above "Control") and integrate a .md viewer / editor in the middle, chat on the side (similar to a CLI layout).
Has someone done this?
Is there an easier way to make my workflow smooth without forking the Clawdbot UI?
What’s the best way to make it control Mac?
Yeah I have an extensive content pipeline. It’s all via obsidian. I do a lot of advisory calls, which are all recorded, transcribed, sent directly to my Mac Studio via tailnet webhook, processed, and then I have a 30 min cron that takes them, reasons over them, connects them to my greater body of work, etc
But yes, I am also going to build a UI because I want easier folder access, but right now doing everything via obsidian works
🙋 Feature Request: reactionTrigger for WhatsApp
PR #3977 added reaction triggers for Discord — would love the same for WhatsApp!
Baileys already emits messages.reaction events with emoji, sender JID, and target message key — the data is there, just not forwarded to agent sessions.
Use case: Multi-agent WhatsApp setup. Reactions as quick confirm/cancel in group chats (co-parenting bot, family coordination) instead of typing.
Proposed config:
json
{
"whatsapp": {
"reactionNotifications": "own",
"groups": {
"123@g.us": {
"reactionTrigger": {
"enabled": true,
"windowSeconds": 60
}
}
}
}
}
GitHub issue: https://github.com/openclaw/openclaw/issues/9210
Thanks for sharing @green arrow!
I've heard from an acquaintance and Claude Code poweruser yesterday that setting up Obsidian within the Clawd Server / MacMini Environment should also be an option. Then all files of Clawd would sit right inside Obsidian and you can use Obsidan Sync (paid) to have 2-way-editing of files on mobile and other devices. At the same time: No need to build a new UI from scratch. Thoughts?
Yes, but I just use github and a script tha auto syncs on changes.
THAT IS AN AWESOME FEATURE
I just use NFS (Linux) to share my openclawd folder to wherever and then I can work on in like I was sitting right at the Computer where it is running. I am also thinking about rsyncing it to my obsidian folder.
Can your bots talk to each other on Discord? I can talk to my two bots without issues but when I ask my main bot to delegate to my specialist, he can't. He sends the message but my specialist doesn't read it and vice versa
Not currently. In the config file you sort of have them talk to each other in channels that they're both in together, if you turn off the "require mention" feature. Then you can set in the heartbeat md for the bots to periodically check that channel that they're both in together for new messages.
So they won't talk together in real time no
Hmm okay. Sort of defeats the purpose of having multiple agents...
Would you then add a line in the .md file like "check Discord for @mentions?"
That should work, could try it. I ended up just turning off require mentions in the openclaw.json file for each agent, although have to be careful with token usage going that route if you set it false for too many agents.
https://docs.openclaw.ai/concepts/groups#mention-gating-default
For instance I had 7 agents and set them all to false for requirementions, and all I had to do was say anything in the channel I was in, and they all responded to me.
They really don't have convos with each other though.
whats the best skill or plugin to install to give openclaw browser automation skill?
Yep I can have that as well, but it would be chaotic if they are all responding/working on the same task
Agreed. Might be able to get around that by setting roles for each channel to limit the amount of agents in each
I was hoping it would be easier to set up a workflow where:
User gives task to Project manager agent, PM delegates to coder or researcher agents, coder works on build, messages PM when completed, PM messages user
It can be setup pretty easily imo
chrome mcp tools works somewhat for browser automation depending on what you want to do.
I can't stress enough how much I freaking love this. I'm not even using anything crazy, just co-development with multiple agents and they are operating within discord together as a team. It's so badass.
Isnt it just insane? I feel the same way its incredible, so much possibility.
I have an entire Xmen that have retired from their life of hero-dom and have become business professionals. Their traits, mannerisms, speaking style, etc. all contribute to their now executive business roles. https://imgur.com/a/wPY0Wsf
We're literally developing software as a team and they all talk to each other independently and they've built some cool stuff.
Interesting! You give them deep personalities? You find that helps the work?
@grim pasture how did you setup the Discord multi agent collaboration?
That sounds awesome! Is there any tutorial out there that shows how to set up something like this?
I'd love to talk to you about a project @grim pasture . DM me if you're open to chat a bit. If the staff allows it, I'll discuss things openly but for now I'm waiting 🙂
Anyone have any moderate success with Linux VM desktop Automation w/ openclaw?
I find it helps it be more fun! Their personalities come out sometimes and it's amusing. But that's probably the extent of it. There isn't comic universe-based drama that is happening in between tasks from different characters if that's what you're asking.
I built a dashboard like a lot of people here but the cost of maintenance and some issues I was facing made me explore other options...
I now use Obsidian and it may sound simpler but it made my life so much easier.
I explain how I setup the sync between my linux machine and mac in this article: https://medium.com/@thedaviddias/i-ditched-my-ai-agent-dashboard-for-obsidian-82d43a3ae952
I'm 100% up for helping. Let me know!
There wasn't. I'm thinking of creating a video because I would have killed to not have to spend so long figuring it all out.
Can anyone tell me how to get multiple agents working?
Im extremely new to all this openclaw stuff
Please do!
Ive made an app to help set up users opencode.json and .env files with an agent manager so each agent can be assigned a different model, every part of the json can be validated and it runs fully locally. openclaw could set it up for you but its designed so you can download the config to check it before uploading to OpenClaw.
Every setting has full details of its use, download and use locally to check files before updating. https://github.com/dazeb/openclaw-config-editor
Can you add screenshots?
@Hatch
We followed your instructions and bypassed Amazon (AWS), connecting directly to Anthropic. The Gateway is connected, but the agent still won't talk—just "thinking dots" and then silence.
We are stuck between two dead ends:
Claude 3.7: Returns an HTTP 404 (Path not found).
Claude 3.5 Haiku: Log warns it's Deprecated/EOL.
Also, the CLI keeps rejecting the config key: Unrecognized key: "main".
Which exact Model String should we be using in this build to actually get a response?
Hi Tom, I'm not technical, but I had similar issue -mine was claude 4.6 - I've uninstalled and installed again, and provided API key during openclaw config setup
maybe you can just try to run openclaw config- model selection again and give new fresh API key to see if it works
added screenshots to github repo
Used it and found some settings to change! Thank you
sweet nice one 👍
What’s the best agent ui viz . any of these good. I got so much time I need to gamify my setup lol
Of course open source self hosted.
Anyone have their agent scheduling with a calendar . Google
Gog setup looks drag to sue their bs Google console.
Is there better way? This Google
Workspace mcp?
anyone had luck with video editing automation? Ive had pretty good results with gemini-whisperAPI-FFMEG(sp?), its REALLY close to being able to turn out a good edited clip, but not quite. Anyone have luck? Also anyone have luck automating descript editing?
I had Gemini dynamic view in experimental labs generate a perfect rendering of the firmament mechanics, sun and moon as plasma nodes projected through the aperture, core light reflected not the sun and a binary system to account for the eclipses…and this was adding my input from my threads and seeing and old bible that had a pretty well drawn out and descriptive model…perfect rendering…no speach from the interface
If you have your own project or have some issues on your project, please dm me. I can help you as openclaw expert
Do you think I could get something going with a4090(24g) + a local Linux server made from old Razer blade with 6600i7(4c8t) + 16ram + gtx1060(6g) + pi5(16g)?. After OC burned through a weekly limits in 2 days with bs heartbeats and silly checks that could easily be automated with a simple script, I was left second guessing lol
I was trying to throw all I can at it locally but I can’t quite figure out how to host the models the best way, whether to use vllm or go with gguf etc. Ollama seems to fail embedding for whatever reason when I tried to run my own local embedding on the clamshell, @fervent fog what do you think I have any hopes here for it to run ? I was essentially thinking having a local loop and a slower cloud loop taking use of a 20$ OpenAI sub especially with the 2x limit till April. ( if u use their codex APP - I think just an active session is enough? )
Is there any trick to make the openclaw be able to use the desktop ? seems it needs some permissions, for example node, but then when i open to accept the notfication is gone .. How can i force it ?
find the node path
then go to permissions section which you want to give
do cmd + shift + g and add your node path and then you are good to go
that doesn t work
you need the prompt and accept it, macOS now doesn t let you add it manually
(at least thatsa what copilot told me) 😄
nahh it works, I did that today only and that to after the letest macos update
Hey quick question
do open claw has access to word in desktop to generate documentation using llm as brain and rag as reference
i use pandoc to convert doc/pdf/whatever->markdown, and store the markdown in obsidian.
my obsidian vault is symlinked in the openclaw memory folder structure, effectively creating a rag system
i have an agent that runs every 15 minutes and gardens/cleans my obsidian and keeps it all atomic and easily searchable for rag
thats not so much gui, but it achieves the same end result i think.
million ways to do it im sure
Sounds like an expensive cron job. 😅
not really. the cron kicks off a python script that checks for updated files, and only processes those.
re tools/skills, use ai only when necessary, code the rest
so the "agent runs every 15 minutes" is a bit inaccurate, it checks to see if the agent should run every 15 minutes
Smart approach! How do you handle your folder structure? Flat or thematically sorted?
its loosely based on para.
i have the root folder of the vault, which has my daily file (i.e. 2026-02-16). This is my eisenhower matrix for the day, created by an agent over night.
it also has my master todo list that is just a raw dump of everything in my head that needs to get done.
then theres
- Areas
- Resources
- Knowledge
- CORA
as the primary four. Areas has folders for everything im working on (think projects)
Resources are static/completed things (branding assets, finished pdfs, guides, etc)
Knowledge is super atomic bits of data, pulled from everything else. This is done by the agent on the 15 minute cron. Each individual fact/nugget is tagged and summarized and linked to related concepts/projects/resources
i drop something -- anything, doc, pdf image, markdown, whatever -- into the root folder, and the cron agent examines it, files it to the right place and pulls knowledge out of it to create more files.
i then symlink the whole thing to openclaw's memory directory and let agents query it all with QMD.
effectively creates a rag system with everything im working on, and "teaches" the bot through the tiny bits of linked/tagged knowledge
I‘m also using PARA for my vault. Will try setting up what you did. Thank you for elaborating!
Never heard of CORA though.. is that something like PARA?
anybody using a good speech to text tool for codex / claude code clis?
oh my bad man, cora is the name of our main agent/system of agents. I was in a hurry when I shared that lol
"Central Operations & Routing Agent" -- My eight year old named it 😉
CORA is the main agent and just handles heartbeats and crons and basic stuff. We don't interact much.
Then we each have a personal agent tied to everyone's iMessage. My agent, "Friday" has a folder within CORA on obsidian that we can both access via iCloud. This allows us to share bigger stuff back and forth without going through imessage all the time
Ah, thank you 😄 I wish could get my kids interested in AI more. How did you do that?
best i can figure its the nerd genetics lol
but honestly, the bobiverse series really took hold of my kids -- if you can get them to read light sci fi, at least
Morning, I’m new here just wondering if anyone has used this for real estate purposes?
Yes, I built a propoerty management assistant. All it does right now is fetch bills from public records to make sure my taxes and water bills are up to date - oh but not with openclaw, just with openai codex. (openclaw is overbuilt in my opinion)
That’s nice. Do you just do property management?
no, that's just the first thing I wanted to have this Admin Agent Dashboard I built do.
typeless
thanks, trying it now
One of my favorite series. The audiobooks are even better. Read by ray porter.
hi guys i made claw mouse which is a tool to allow openclaw navigate your DE (desktop enviroment) via your mouse so it can now interact with apps and stuff
pls try it out and drop a star on my repo
https://clawhub.ai/rylena/claw-mouse
https://github.com/rylena/claw-mouse
i say it took hold of my kids, but I'm pretty obsessed with it as well.
And it might be the best way to explain multi-agent setups to people. As soon as I embraced that approach, my success with openclaw greatly increased
i do retool development at work. retool is a web based low code/no code development platform. what is the best way to have openclaw do this for me? it keeps wanting me to use a chrome extension, but can it not just controlt he browser natively?
i tried using the chrome extension, and it somewhat works, but it has trouble clicking and dragging, and it seems to time out a lot
hey guys how can i let openclaw control my browser without any issues. like make it login use my default browser.
No need to enable the browser extension.
nice! i'm working on something simmilar! watching my agent draw a picture in paint was a very strange experience. a real Ghost in the machine moment. https://x.com/SpiderMonkeyXYZ/status/2024186852929130922?s=20
Sounds amazing. Thank you for sharing.
Can a robot make a symphony
Can a robot paint a masterpiece
How those urs work btw mine just takes screenshot and then figures out where to move the mouse
Hi guys, I am one of the unfortunate windows users out there and I have developed a framework for OpenClaw to entirely use any windows app through computer use. Does anyone think this is worth releasing?
I mean sure how those it work tho cus my skill should also work on windows and apps
Those it read all the text from a windows app parse through it and figure out where to click or like those it take a screenshot and figure out what to do
Yeah, so I am using microsoft UIA (their automation framework) and basically built tooling that retrieves the entire UI tree (or at least the interactactible elements) and then follow that up with screenshots (Basically Set-Of-Mark prompting similar to how vision for browser use agents is done).
I have currently made it as a plugin for openclaw (+ a external agent system if someone would want that) which requires that you start a gRPC server which allows it to call the tools quite quickly and in a token efficient manner.
has anybody here tried automating chatgpt gui so openclaw can use deep research/ chatgpt pro model with subscription?
Has anyone managed to connect with the MSteams yet?
@jaunty cosmos perplexity would be better for research or skills for brave browser api keys
chatgpt deep research with its pro model is quite great and fulfills a different purpose than perplexity. and at a very different cost with the chatgpt business subscription
You can also do browser in that sense?
realised too. any ideas how complicated that kindof automation would be?
Easier than gui version in that sense ... robot work out browser automation ... with possible periodic human login depending on where the account is?
Work for windows ?
No sorry
just use autohotkey?