Hi there,
I am seeking help with understanding an issue where partial replication sync always fails after a replica disconnects from master or after a failover. I am running a 3 node cluster managed by sentinel. I have tried increasing repl-backlog-size to 1gb but that doesn't help. Before failovers, all replicas are caught up and no writes are happening during failover. But I always see partial sync rejected with a message like:
Mar 7 20:06:56.432 redis-perm-repl-backlog-test-9ab11db5 redis-server[26068]: 26068:M 07 Mar 2024 20:06:56.432 * Partial resynchronization not accepted: Requested offset for second ID was 1524031215, but I can reply up to 1524030897
even if I do a brief disconnect of a replica by stopping and restarting redis-server this happens.
I'm not sure what is going on or where to look to get a better clue. Nothing in the logs jumps out at me. I can't find anything helpful online and I didn't see similar posts in this discord. Any insight is appreciated. If this isn't the right place to ask this question I would appreciate a pointer to the right place. Thanks in advance.
My redis version is 6.2.14. I can provide more detail as needed.