When you define a TopK, you give it a height and a width. The height is the number of hashing functions used and correspond to arrays of counts and hashed values. The arrays themselves are width long. When you update the count of a value in a TopK, you hash the value and modulus it with the width to get an index into all of the arrays. If any of those indices are occupied, that is a chance that the count will decrease. That chance is based on the count with larger counts being less likely to decrease.
My guess is that you're getting collisions on that particular word in all of your arrays and it's not adding the word. This is the sort of thing that can happen with probabilisitic data structures. To test this hypothesis, check the count of all the other keys before and after and see if they decrease. If so, it's working as desigend. To mitigate this problem, adjust the height and width of your TopK data structure.
I also have a talk where I explain how TopK works (and some other probabilisitic data structures). It's been recorded a few times and posted to YouTube. It's called Understanding Probabilistic Data Structures with 112,092 UFO Sightings. With a title like that, it should be easy to find. 😉