I just switched from kobold to exllamav2 via tabbyapi. I loaded a model with tabbyAPI-gradio-loader with 32768 context. It works, and fast now, but it's speaking for me. How can I fix this? It's referring to me as "Graw" and other strange token combinations. Example response:
Normal response with description
graw yes i see graw
normal response to yes i see
graw and now what graw
etc. It'll also sometimes not include the second "graw", or start hallucinating details on {{user}} that aren't in the user description.