Hey, uh…I need some help on trying to pick a good non-moderated Ai model to use for roleplaying. I’ve tried everything. Starting from Hermes 3 with 405B, to Mistral Nemo, to Llama 3.1 Nemotron 70B Instruct, to Meta Llama 3 70B Instruct, to Meta Llama 3.2 90B Vision Instruct, to Claude 3.5 Haiku (2024-10-22) which I know is moderated but I used it for the context, to Claude 3.5 Haiku (2024-10-22) with self-moderation that’s non-moderated, to Meta Llama 3.3 70B Instruct, to Mistral Ministral 8B, to Meta Llama 3.1 8B Instruct, to Google Gemini Flash 1.5, to Google Gemini Flash 1.5 Experimental, and lastly, to Qwen QVQ 72B Preview. I know this is a whole lot of models and words that I’m typing and speaking about with it being a mouthful, but I just need help on trying to find a very good model.
And basically, not just any model, I need a good Ai model that’s both for roleplaying and can handle a chatbot that’s over 60k tokens. The site I use with these different OpenRouter models is Chub Venus. Yet, no matter how many times I reroll a response or switch models, I get tons of errors without an answer from the chatbot coming through or if I do get a response, I either end up getting a response that’s super long despite my max new token being under a thousand that also tends to control my RP persona despite me having a pre-history prompt telling the bot to avoid doing so instead of don’t/do not/shouldn’t/should not/won’t/will not. Or…, I end up getting a response that’s full of gibberish nonsense despite my generation temperature being below 1 as I’ve had it at 0.70, which literally isn’t too low but neither is it too high. Or I even end up getting a response that’s disappointing with it being comprehensive yet it doesn’t follow the previous bot’s message correctly or even my previous message correctly (as a result of me using the google models).