I recently deployed a new version of telegraf metric collector and I see that it is now reporting 2 new fields in the commandstats output, one being failed_calls. I am using redis-py and have a question regarding the reported failed CLIENT related calls:
cmdstat_client:calls=114604,usec=552723,usec_per_call=4.82,rejected_calls=0,failed_calls=114604
This is puzzling because my application is running fine and all redis calls are wrapped in exception handling and error logging and nothing is failing. Further I ran MONITOR and no CLIENT calls are being reported which makes this more puzzling. I am just using a fairly standard Redis connection.
redis_conn_decoded = StrictRedis(
password=password,
unix_socket_path=/tmp/redis.sock,
charset='utf-8', decode_responses=True)
Redis is bound to 127.0.0.1 on port 6379 and has socket enabled and the application only uses socket connections. The failed calls seem to coincide with the application use of Redis and they increment almost constantly.
Could someone perhaps enlighten me as to why or what could be responsible for reporting these failed calls?