Hi Krill!
I want to understand something.
Rite now in my present openclaw setup i the model "GLM-4.7-Flash-REAP-23B-A3B" via a local server running through LM Studio.
My config is here: https://pastebin.com/GhNUNDAs
At this moment, only the main agent is active and utilizing GLM-4.7-Flash-REAP-23B-A3B.
My common sense tells me, it would be best if decentralize different tasks to different sub agents based on the nature of the task.
For example: A coding sub agent, A Document reading sub agent, A Personal Assistant sub agent, A Research sub agent.
Suppose i use the model:
Qwen3-Coder-Next-GGUF for coding
Qwen3-VL-8B-Instruct-GGUF for document reading
GLM-4.7-Flash-REAP-23B-A3B for personal assistance
GLM-4.7-Flash-REAP-23B-A3B for research.
My deployment is completely local. I have one pc with the specs RTX 3090 Ti with ~24GB VRAM, i9-12900K, 32GB RAM.
Now, my main question is, can i have my main and sub agents spin up these models through LM Studio automatically as and when required depending on the task i provide it and use models one after another so as to not overload the system resulting in openclaw tasks breaking in the process?
How can i go about doing this in terms of config rewrites, execution and testing if it works?