Continue generating after hitting max output length. | LibreChat | Page 1

woven temple Jan 11, 2024, 2:27 AM

#

Hello, I see that there is a feature of continue generating if you accidentally cut off the message, but there is also the feature in OpenAI interface that once it hits the token output limit, it can keep on generating to continue the text since in most cases the context length is longer than the output length limit.

there-is-now-a-continue-generating-button-that-appears-if-v0-3rkh8bkhm50b1.png

#

Example:
the message is merged into the previous one and allows to continue generating even after cutoff.

chrome sierra Jan 11, 2024, 1:31 PM

#

woven temple Example: the message is merged into the previous one and allows to continue gene...

This is already implemented. The API will send a finish_reason field in the chat completion response which tells us if it got to this point. Except with LibreChat, you can also hit "continue generating" at any time but it only seems to work well if the previous message was cut off

#

I'm re-reading your message to make sure I understand

#

in OpenAI interface that once it hits the token output limit, it can keep on generating to continue the text since in most cases the context length is longer than the output length limit

do you mean it will continue generating by itself?

woven temple Jan 11, 2024, 8:41 PM

#

chrome sierra This is already implemented. The API will send a `finish_reason` field in the ch...

Yes I don’t know I think the model hit the output limit of 4K tokens for 1 message and stopped abruptly, there was no continue generating option.

chrome sierra Jan 11, 2024, 8:42 PM

#

It was likely a failure of the API to give that finish reason field but I can also double check. In short this functionality is already implemented and expected behavior

woven temple Jan 11, 2024, 8:43 PM

#

Ok thanks. I will check if I have the logs when I get home.

woven temple Jan 11, 2024, 11:59 PM

#

I went ahead and checked, there seems to be no finish_reason.

chrome sierra Jan 12, 2024, 12:22 AM

#

woven temple I went ahead and checked, there seems to be no finish_reason.

which endpoint are you using?

#

looks like Bing..

woven temple Jan 12, 2024, 12:24 AM

#

Yes, it is Bing.

chrome sierra Jan 12, 2024, 12:24 AM

#

Bing doesn't return a finish_reason like OpenAI. It's an unofficial API

#Continue generating after hitting max output length.