Hello,
Yesterday we experienced significant performance degradation during several Dagger executions.
At Betclic we use Dagger extensively across many CI pipelines, so this type of slowdown has a strong impact on our delivery workflows.
For example:
Use case 1
• Slow execution (46 min):
https://dagger.cloud/betclicgroup/traces/fe32031c8a78270343e46759fb7e3daa
• Usual execution time (~5 min):
https://dagger.cloud/betclicgroup/traces/8e159016a8ab80e0f9c8d3325f08ac8c
Use case 2
• Slow execution (31 min):
https://dagger.cloud/betclicgroup/traces/0ca1d1f58f275f78fdeacb214057db25
• Usual execution time (~4 min):
https://dagger.cloud/betclicgroup/traces/f8cf72b07f524094ba5f8a3d033d31be
From the traces it appears that the execution hangs when returning from a module function call.
We also observed the following warnings related to telemetry:
WARN - failures detected while emitting telemetry. trace information incomplete
(failed to upload metrics: context deadline exceeded)
Could these telemetry errors explain the slowdown we observed?
Could you help us understand what might be happening and whether this could be related to telemetry upload failures or another issue?
Thanks in advance for your help.