What are Instruction and Chat templates, and how do I use them? | Text Generation WebUI | Page 1

blazing knoll Mar 23, 2024, 2:27 AM

#

I think it may just be a weird model, because the huggingface page says it has no intruction template. My current go-to writing/roleplay model is Sao10K/Fimbulvetr-11B-v2, I highly recommend it. Also, you should experiment with using SillyTavern as an Oobabooga frontend, as it can properly load TavernAI cards, and has much better generation presets and instruction templates.

cunning geyser Mar 23, 2024, 2:29 AM

#

blazing knoll I think it may just be a weird model, because the huggingface page says it has n...

I've already heard of SillyTavern; frankly, I'm already swamped with enough settings with all the setup and testing I did for WebUI, I don't need even more of that just to get a fancier front end. I have no interest in image generation, text-to-speech, or speech-to-text. Does it have more benefits besides those?

#

And thanks for the model suggestion, I'll definitely take a look!

blazing knoll Mar 23, 2024, 2:30 AM

#

cunning geyser I've already heard of SillyTavern; frankly, I'm already swamped with enough sett...

The primary reason I use SillyTavern is actually to not get swamped with settings lol

#

SillyTavern has a bunch more presets for different settings, meaning I dont have to stare at number sliders all day

cunning geyser Mar 23, 2024, 2:31 AM

#

So it completely takes over WebUI, you don't need to change settings in both if you use it as a front end?

blazing knoll Mar 23, 2024, 2:31 AM

#

you need to load a model using Oobaboogas webui, and then SillyTavern can take the wheel from there

cunning geyser Mar 23, 2024, 2:32 AM

#

Also in my case, I don't think I need TavernAI cards either, since I'm making my character entirely from scratch and not basing it on any existing one

#

If it's that much simpler, I'll look into it further

blazing knoll Mar 23, 2024, 2:33 AM

#

I still believe that the issue with your model spewing gibberish is model-related, most models under the model info on their huggingface pages should tell you which instruction template to use, but for some reason Yarn Mistral 7B 128k AWQ doesnt

#

quite odd

cunning geyser Mar 23, 2024, 2:34 AM

#

blazing knoll I still believe that the issue with your model spewing gibberish is model-relate...

It isn't really gibberish; I haven't done enough testing, but it seemed to just not act as it should. 'Chat' mode works a lot more like expected. I don't particularily understand your suggested model's prompt formats either, though:

#

Which one of these would I use, and how would they be extended?

#

I have already written quite long definitions that go on for multiple lines

blazing knoll Mar 23, 2024, 2:36 AM

#

cunning geyser Which one of these would I use, and how would they be extended?

you can use either template, it understands both

cunning geyser Mar 23, 2024, 2:36 AM

#

blazing knoll you can use either template, it understands both

Alright that's cool, but I'm still not sure how to adapt my definitions into that format. What's the difference between Instruction and Input for example?

#

And under Vicuna, aren't System and Assistant the same?

blazing knoll Mar 23, 2024, 2:39 AM

#

Unfortunately I'm not an expert on the science behind LLMs and their prompt templates

cunning geyser Mar 23, 2024, 2:40 AM

#

When you were making your own character, didn't you have to write prompts in one of these formats?

blazing knoll Mar 23, 2024, 2:41 AM

#

No, the prompt template is just for the LLM to see. You can talk to the LLM in normal human type and it will understand.

cunning geyser Mar 23, 2024, 2:42 AM

#

I know it'll understand what I'm saying, but I mean if you want it to behave a certain way like I do you'd need to write prompts like those, right?

blazing knoll Mar 23, 2024, 2:43 AM

#

Depends on what you mean by getting it to respond in a certain way (because there are many different things that can directly change how the LLM generates), give me an example

cunning geyser Mar 23, 2024, 2:43 AM

#

Like adding example messages, so it knows how the character would respond to certain questions

#

And in what tone

#

Stuff like that

#

Like
{{user}}: Hello!
{{char}}: Greetings

blazing knoll Mar 23, 2024, 2:45 AM

#

Ohhhh, like that. That stuff is decided by the character card.

#

or if you dont have a character card, anything typed here

cunning geyser Mar 23, 2024, 2:45 AM

#

Yeah, that's what I've filled out currently and what makes my character work well on the 'chat' mode

#

So I dont need to mess with this at all?

blazing knoll Mar 23, 2024, 2:46 AM

#

nope

#

just set the instruction template to one the AI understands

cunning geyser Mar 23, 2024, 2:47 AM

#

I'm guessing that one came prepackaged with the model

#

I'm not sure if this particular model was built for roleplaying though, it might be made to be used like a sort of assistant or whatever, so I'll try yours too

blazing knoll Mar 23, 2024, 2:48 AM

#

Yeah Mistral 7B is more general task oriented

cunning geyser Mar 23, 2024, 2:48 AM

#

Aight I'll switch then, thanks for the advice!

#

Still very new to this stuff but very happy to learn that all this can be run locally now

#

I was hoping for stuff like this a couple years ago

blazing knoll Mar 23, 2024, 2:49 AM

#

I remember being really facinated with AI Dungeon and NovelAI back a couple years ago

#

amazing that I can run this all locally now

cunning geyser Mar 23, 2024, 2:50 AM

#

Yeah I was always paranoid using sites like character.ai, i didnt want to lose all the writing I did to get it working and just wanted it on my PC, especially since I was there in the early days and they frequently went down

#

now I don't have to worry about that or their restrictions

#

and don't have to worry about other site's usage limits

blazing knoll Mar 23, 2024, 2:52 AM

#

cunning geyser I'm not sure if this particular model was built for roleplaying though, it might...

for Fimbulvetr-v2, download the quantized versions here https://huggingface.co/collections/Lewdiculous/personal-favorites-65dcbe240e6ad245510519aa
I run the q8_0 version on my RTX 4070S, however depending on your GPU, you might want to opt for the lower quant versions

Personal Favorites - a Lewdiculous Collection

cunning geyser Mar 23, 2024, 2:52 AM

#

I've got a 3070

#

Any idea which I should choose for that?

blazing knoll Mar 23, 2024, 2:52 AM

#

I suggest q6_K

#

also, Fimbulvetr-v2 is a GGUF model, which has to be loaded with the llama.cpp loader in Oobabooga. llama.cpp is a bit different than some of the other loaders, as by default it will load the model into CPU memory, and you have to tell it how many layers to offload to the GPU

#

set the threads to the amount of physical cores your cpu has, enable tensorcores, and set n-gpu-layers to maybe like 20-30

cunning geyser Mar 23, 2024, 2:55 AM

#

blazing knoll also, Fimbulvetr-v2 is a GGUF model, which has to be loaded with the llama.cpp l...

Yep found out about this already from a tutorial vid

blazing knoll Mar 23, 2024, 2:55 AM

#

thats great

cunning geyser Mar 23, 2024, 2:55 AM

#

Thanks for all your help again! I'll try this out rn

blazing knoll Mar 23, 2024, 2:55 AM

#

when i first started playing with LLMs i downloaded oobabooga and went head first blind

cunning geyser Mar 23, 2024, 2:56 AM

#

Yep pretty much same, I also tried some other things first that required other complex stuff like GIT LFS

blazing knoll Mar 23, 2024, 2:56 AM

#

cunning geyser Yep found out about this already from a tutorial vid

also read the Oobabooga wiki on the github if you havent yet, super comprehensive and useful

cunning geyser Mar 23, 2024, 2:56 AM

#

I have not, I'll take a look

cunning geyser Mar 23, 2024, 2:57 AM

#

blazing knoll for Fimbulvetr-v2, download the quantized versions here https://huggingface.co/c...

I don't see Fimbulvetr-v2 in here though, I already have the link to it don't I?

blazing knoll Mar 23, 2024, 2:57 AM

#

oh wait i think i sent the wrong link

#

here https://huggingface.co/Sao10K/Fimbulvetr-11B-v2-GGUF/tree/main

Sao10K/Fimbulvetr-11B-v2-GGUF at main

#

I currently have 15 huggingface tabs open so ig i copied the wrong one

cunning geyser Mar 23, 2024, 2:58 AM

#

Oh lol

#

What're the test versions?

#

There's only 2 non-test ones

blazing knoll Mar 23, 2024, 2:58 AM

#

not sure, ask the creator

blazing knoll Mar 23, 2024, 2:59 AM

#

cunning geyser I have not, I'll take a look

One last thing before I go to bed, heres a screenshot from the oobabooga wiki as to which presets to use for different things

cunning geyser Mar 23, 2024, 2:59 AM

#

Oh that would have been helpful a few hours ago lol

blazing knoll Mar 23, 2024, 3:00 AM

#

Instruct ones are for like ChatGPT style stuff, while the Chat ones are more for roleplay and stuff

cunning geyser Mar 23, 2024, 3:00 AM

#

Thanks!

blazing knoll Mar 23, 2024, 3:00 AM

#

np

cunning geyser Mar 23, 2024, 3:00 AM

#

One last thing, on the main page for Fimbulvetr there's this link to the GGUF version

#

For what reason should I use that version?

#

Does the main page version not work as well?

blazing knoll Mar 23, 2024, 3:01 AM

#

the main page version is the full, unquantized (uncompressed) model

#

which is HUGE and uses like 24GB of memory

cunning geyser Mar 23, 2024, 3:01 AM

#

ah so thats why theres multiple large files

#

So there's no point of downloading the main one, I should download from the GGUF page instead, yeah?

blazing knoll Mar 23, 2024, 3:02 AM

#

yeah

#

i recommend using Oobaboogas built in model downloader

cunning geyser Mar 23, 2024, 3:02 AM

#

Yep already do

#

before I thought I had to use Git LFS so that was a pain to install for nothing

#

I'm not sure if I should use a test version and I don't want to bother the creator for something this small so I'll just use this version

blazing knoll Mar 23, 2024, 3:04 AM

#

should still be fine

cunning geyser Mar 23, 2024, 3:04 AM

#

Alright great that should be all then

#

Good night and thanks again!

blazing knoll Mar 23, 2024, 3:05 AM

#

no problem, night night

winged bluff Mar 23, 2024, 3:47 AM

#

General purpose / promptless LLM models are more for "notebook" mode; you type whatever you want at the start of the notebook, then the LLM starts filling in the rest.

LLMs are trained on all sorts of things; novels, wikipedia articles, roleplaying chat logs... it will try to reproduce whatever style you give it, based on what it learned from its training data.

Instruction-following models are fine tuned to follow a particular prompt format. That's what the templates are about; you need to match the templates that were used in training to have the best effect.

Character templates are just prompts that get added in front of (er... roughly) your actual prompt. If the character prompt is "Fontayne Natrerya, a bard" and you type "tell me a story" into the chat, what the LLM will see is something like:

A conversation between Fontayne Natrerya and "Loading".

Fontayne Natrerya is a bard.

Loading: tell me a story

Fontayne Natrerya:

An instruction template would add something like

System: A conversation between Fontayne Natrerya and Loading.

Fontayne is a bard.

Instruction: Continue the following conversation. Play the role of Fontayne Natrerya.

Loading: tell me a story

Response:

cunning geyser Mar 23, 2024, 11:56 AM

#

winged bluff General purpose / promptless LLM models are more for "notebook" mode; you type w...

That makes a lot more sense, but where would I put this? Would this replace the defintions and description I put in Parameters/Chat/Character?

#

It seems to work fine with my format too right now, which is just this:
Steve is a bartender at a small bar in an even smaller town.

{{user}}: Hey bartender!
{{char}}: What can I do ya for?

#

What's the benefit of transforming my definitions into that format? And are those infinitely expandable, or do they just work a line at a time

winged bluff Mar 23, 2024, 12:00 PM

#

If you aren't using an instruction following model, you do not need an instruction template.

#

If you are using an instruction following model, you need the template to match the format the model was trained on.

cunning geyser Mar 23, 2024, 12:01 PM

#

How can I check if I am using an instruction following model? Usually on the model page I just see a bunch of code and technical details I don't understand

#

I've switched to the model Lé Hönque suggested, Sao10K/Fimbulvetr-11B-v2-GGUF. Is it an instruction following model?

winged bluff Mar 23, 2024, 12:03 PM

#

If the model card lists a prompt, that's what the instruction template should spit out; the first one mentioned {none} so you don't need one.

cunning geyser Mar 23, 2024, 12:04 PM

#

I don't believe there is one on the Fimbulvetr page, but I do see this when I load it:

winged bluff Mar 23, 2024, 12:05 PM

#

Prompt Formats - Alpaca or Vicuna is what the model card for that one says

cunning geyser Mar 23, 2024, 12:05 PM

#

Right so I still don't need to worry about the Instruction template with this one?

#

Or should I replace the code that's in that box by default with something written in the Alpaca/Vicuna format

winged bluff Mar 23, 2024, 12:07 PM

#

The default that ooba set it to is probably fine

#

That's alpaca

cunning geyser Mar 23, 2024, 12:08 PM

#

Should I change anything in there?

winged bluff Mar 23, 2024, 12:08 PM

#

The default that ooba set it to is probably fine

cunning geyser Mar 23, 2024, 12:09 PM

#

Alright, so I don't need to fill any fields in there like this example you gave?
A conversation between Fontayne Natrerya and "Loading".

Fontayne Natrerya is a bard.

Loading: tell me a story

Fontayne Natrerya:

#

All that is is code that makes the AI generate responses correctly

#

So all I need to worry about are my definitions, and I should use it on 'chat' mode, not 'chat-instruct' mode

#What are Instruction and Chat templates, and how do I use them?