too much prose! | OpenRouter | Page 1

stuck shuttle Jul 27, 2024, 6:18 PM

#

Anyone know how get shorter responses I mean like I like prose to a certain extent but not like a whole ass essay. It just keeps going and going and going at least until it’s like 6 paragraphs and that’s too much for me

hallow turtle Jul 27, 2024, 9:57 PM

#

@stuck shuttle which model are you using?

There are different methods with different models

stuck shuttle Jul 27, 2024, 10:26 PM

#

Dolphin mixtral

hallow turtle Jul 28, 2024, 3:07 PM

#

Cool - with both models you can

Use prefilling ( pass a part of the assistant response and it will finish that for you )
Add details in a system message ( instruct it with words like: this is a concise conversation, only send short responses )
Give it some example history, as well as a good system message. This involves you editing its first few responses to something ideal - then it should follow the pattern later on

#

In my testing it seems Dolphin doesnt return too much - so I assume its part of learning from its chat history

stuck shuttle Jul 28, 2024, 3:23 PM

#

Thank you so much

#

Do you have a system prompt I could use

hallow turtle Jul 28, 2024, 3:34 PM

#

It really depends on what you are trying to do

If its a natural character chat something like

When you respond, respond in a natural tone, as if you are messaging somebody, or having a natural conversation with them.
Only return 1, maybe 2 sentences max, unless the user explicitly asks you to return more.
You are talking to {Your Name}, and your name is {Character Name Here}.
{Insert some more context and personality here}

#

heres an example of that across both models: https://modelbench.ai/shared/chat/15e692b9-1315-4483-9a74-f190d17a3cd8

stuck shuttle Jul 28, 2024, 3:43 PM

#

Can I put 1 or 2 paragraphs?

#

And I want it for a roleplay is that okay?

hallow turtle Jul 28, 2024, 4:37 PM

#

Absolutely - infact, the clearer the prompt, the better. Just be aware of token limits

#

Dolphin Mix 8x22b has 64k token context window which is pretty large, so you should be plenty fine. the 8b model is half so it depends how long your chat is - but OpenRouter manages that atm anyway I think

stuck shuttle Jul 28, 2024, 5:29 PM

#

hallow turtle Dolphin Mix 8x22b has 64k token context window which is pretty large, so you sho...

What’s the difference with 8b and 22b

#too much prose!