This is in response to @junior marlin 's message here - #hermes-agent message .
I just wanna be really clear about this: The default being "per-directory" is a genuinely not-good default for any LLM CLI agent. That kind of thing should be opt-in, not opt-out. Most people's mental model of a CLI tool is that each invocation is independent. Sure, it makes sense for a persistent assistant you want to "pick up where you left off," but that's only the case if you want to, every once in a while, ask one really tight specific question to an agent with a REALLY tight scope. Most conversations with AIs simply don't fit that description, especially when you're setting things up and figuring things out.
With "per-directory", every new session from the same directory calls session.context() on the same persistent Honcho session object, which returns the accumulated messages from all prior runs there. That history gets injected into our context window automatically, and it grows without bound the more we use the agent. That means if your context window is full or nearly full, your context window in the new chat will be too!
We get no obvious signal this is happening until, at some point later, we notice our context window is filing up like crazy - and it does so again and again no matter how many times we restart the conversation - and every single time, just starting the conversation uses up all those tokens again - and I don't think I have to point out why THAT IS REALLY BAD for most people, especially people new to this. You are literally repeatedly paying for the entire conversation all over again every time you start it *(not exactly accurate - it's what it gets back from Honcho from previous conversations - but the context window still fills up immediately upon starting a new conversation, and users pay tokens for that.)
(cont'd...)
