my goal is this:
- 3 agents - an orchestrator, a builder, and qa
- i can task orchestrator with a project name + spec - they will then build it out fully and store it on the filesystem
- i can then talk to the orchestrator to create tasks in that project - they will take those, full spec them out and store it on the filesystem
- followed by breaking the task up in to manageable steps, with clear goals and criterion, and each step also stored separately on filesystem
- when i tell orchestrator to start a task, it'll delegate it to the builder agent as a subagent session, starting with the first step and going through all steps sequentially with the following flow:
- builder takes the spec file for the step and builds it out with tests
- when the builder is done, it will tell the orchestrator
- the orchestrator will then spawn the qa agent as a subagent to ensure the work matches the pass criterion from the step spec file
- qa will then report back to orchestrator on whether it passes or fails, pass = repeat for the next step of the task and fail = orchestrator throws it back to the builder to correct
the files for each task + step must be kept updated at every step matching a required format
having an llm maintain the various files from my testing is always unreliable - formatting changes, some things missed, etc. - i was building some tools for openclaw to solve this, but ideally i'd use something that already exists/other people are using
any suggestions?