#response token limit
1 messages Β· Page 1 of 1 (latest)
Please provide all the details when you ask a question.
How can I answer you if I don't know which model you're using? What are the generation parameters you used?
i think it's the context.. Not max response length
if there's something like Max Response Length or Max new token, set them on 4000 since it's the maximum. But with that picture, i think it's just indicating your context size (by how much message count will be used for context) and not your max response length
calculation from example 'codechat' above:
word count: 273, token count: 524, tokens used: 995
Screenshots from NextChat, another example same question.
Im gonna check with Original API
word count: 313, token count: 578, tokens used: 1077
seems good, this is original API?