#`write_file` overwrites existing files without confirm

1 messages · Page 1 of 1 (latest)

wary hedge
#

https://github.com/NousResearch/hermes-agent/issues/510#issuecomment-4303266210

Do you have the experience that your agent using write_file tool to overwriting existing files?

I have a suggestion to add a confirm warning to make the agent think twice when it doing such a tool call, what is your thoughts?

GitHub

Overview Hermes tool descriptions are typically 1-15 line strings embedded in Python tool definitions. Kilocode's tool descriptions are mini-manuals — 50-167 lines each — with worked examples, ...

#

@bright hedge

If you dont want it to be able to I'd suggest just using docker backend

But the problem is I'm requesting agent to write a worklog for what we done, I should definitely need to allowed it to able doing this

Using Docker backed won't help this, because it will doing absolutely same thing when inside sandboxed environments.

bright hedge
#

There is no foolproof way to prevent it from writing to the code

#

or the files

#

If you dont approve it it can just find a bunch of ways to write scripts to edit a file

wary hedge
#

Even I added the iron rules to its SOUL.md that "don't use write tool when there's a existing file, use patch tool instead"

But it will still forgot

#

I've setup restic backup every hour and git so I don't really need to worry about data loss, but it is still making me frustrating

wary hedge
wary hedge
bright hedge