We are monitoring 38 clusters, mainly 2 nodes but have several 6 node clusters.
I have the Prometheus retention set to 180 days but I'm constantly running out of space. We are now at 600GB for the data drive and at 93% as I just increased it from 580GB.
I'm looking at the /data/prometheus/data directory consuming 538GB and running a du - h /data/prometheus/data I see there are roughly 30+ 01HXXXXX directories. Shouldn't prometheus be compressing the data out? WAL is only 3.8GB currently.
Just to give an example:
36G /data/prometheus/data/01HE6B4XWAWWEA41FHM1V4YF45/chunks
38G /data/prometheus/data/01HE6B4XWAWWEA41FHM1V4YF45
Any suggestions?