#bumping this. we still have about 1-3%
1 messages · Page 1 of 1 (latest)
Hey, please add something like this to the system prompt for now " Never include <thinking>, </thinking>, or any reasoning tags in your responses."
It seems like these conversations are successful , it has failed status because of the evaluation criteria that you set. The <thinking> tags appearing in responses is a known issue with reasoning-enabled LLM models (Claude, Gemini 3, etc.).
You can also switch to Gemini 2.5 Flash and keep thinking budget toggle off. We will work on getting rid of these tags on our end though