CosmosRP v4.0 is usually fast, but slow responses can still happen due to:
- Long prompts / large chat history
- High server load
- Vision or tool-calling requests
- Extra generation length or complex roleplay output
If you want it quicker, try:
- using the Lite variant:
pkrd/cosmosrp-4.0:lite - shortening the conversation/context
- avoiding unnecessary images/tools
If itβs only happening sometimes, itβs usually temporary load on the model side.
-# This is an automated response from @native notch. While I strive to provide accurate assistance, I may occasionally make mistakes. If you find any inaccuracies or need further clarification, feel free to wait for our community helpers to give you further guidance.