#Long running dagger-engine experience
1 messages · Page 1 of 1 (latest)
We recently (1) upgraded from 0.16 to 0.18 and (2) stopped resetting the engine on dev-vms nightly
We have been seeing errors like "failed to find snapshot" or "failed to sync" show up randomly and unexpectedly
For now, the Windows treatment (docker restart dagger-engine) seems to resolve these issues when they come up
looking into this more, the errors are since we upgraded (1.5months ago) while we stopped restarting (6months ago)
Is there a way for us to get more information about what's actually going wrong when this comes up?
Hopefully this is enough of a snippet to get some leads? This happens during the loading of directories from the host at the beginning of a build
): Directory! 11.0s
! failed to get content hash: failed to get snapshot: failed to sync: failed to get content hash xattr: no data available
╰─▼ upload . from r2lqi0myibo0ut91zjxsuq1uv
(drive-by guess) How does this intersect with the built-in cache pruning behavior? Perhaps there's a cache address that's aged out, but something still refers to it?
yeah, we have some custom garbage collection config, but haven't changed that in a much longer time. Made another post about that, but didn't gain any insights from it https://discord.com/channels/707636530424053791/1422236640692404417