Hi, I want to discuss the general strategy of using "average" functions in most dashboards. Performance monitoring is (in my opinion) often about identifying bottlenecks and potential system overloads. Looking at averaged values often mislead to the conclusion that everything is fine, while the peaks are flattened out. This gets especially bad over longer time ranges due to PromQL inner workings. The PromQL function "max_over_time" preserves the peaks, especially when looking at bigger time frames.
Can you give me some insight why average was chosen and if you would consider changing it to a max approach?