#response token limit

1 messages Β· Page 1 of 1 (latest)

pure totem
#

The bot's responses are cut off, and sometimes the end of the response is gibberish. What is the token limit for responses? If there is a limit, what is it? I am not using an API on sites like 'char', but rather 'NextChat' or aichat in CLI. Is there any way to increase this limit?

Garbage fragment example:

rigid knoll
pure totem
wicked spruce
# pure totem

i think it's the context.. Not max response length

#

if there's something like Max Response Length or Max new token, set them on 4000 since it's the maximum. But with that picture, i think it's just indicating your context size (by how much message count will be used for context) and not your max response length

pure totem
#

calculation from example 'codechat' above:
word count: 273, token count: 524, tokens used: 995

Screenshots from NextChat, another example same question.

#

Im gonna check with Original API

#

word count: 313, token count: 578, tokens used: 1077

wicked spruce
pure totem
#

yes

#

from openai

#

with same setup (exept apikey, and endpoint)

#

my solution is telling bot to continue, and it actually does