Silly tavern Api issue | Text Generation WebUI | Page 1

median dune Jan 19, 2024, 4:43 AM

#

I know this has been beaten to death, but I cannot connect oobabooga's api to Silly Tavern. I have api and open ai selected in the setting and I've pasted the 127.0.0.1:5000 address to silly tavern's api spot, but it still tells me it's not connected.

#

#

dark ruin Jan 19, 2024, 8:51 AM

#

do not enable api

#

thats how i helped fix the other person's issue

#

the boolean command-line flag for api is not needed

#

only openai api

median dune Jan 19, 2024, 2:56 PM

#

dark ruin only openai api

I was looking at that one you referred to too. I only had openai checked off, but it was still a no go. I am also doing this all off of linux too btw. I copied the blue address to the API area in Silly Tavern and it would still not connect.

dark ruin Jan 19, 2024, 3:05 PM

#

median dune I was looking at that one you referred to too. I only had openai checked off, bu...

a friend of mine on linux had outdated ST and when he updated ST it worked

#

simply cuz he had the ST version before openai api was introduced

#

make sure you dont have ST legacy api enabled i guess

#

i dont know why this would even be an issue

#

you could also try enabling bypass check and try to gen an output anyway

#

dunno if that will work

median dune Jan 19, 2024, 3:14 PM

#

dark ruin a friend of mine on linux had outdated ST and when he updated ST it worked

I just downloaded it yesterday and I even updated it. I think I may also have to edit the CMD_FLAGS.txt file too.

dark ruin Jan 19, 2024, 3:14 PM

#

Idk if that's used anymore but see how you go

median dune Jan 20, 2024, 6:19 AM

#

dark ruin a friend of mine on linux had outdated ST and when he updated ST it worked

I literally installed this a few days ago. But I'm pretty sure it updates through start.sh file when you open it up through the terminal. I am still having no luck with this.

dark ruin Jan 20, 2024, 6:20 AM

#

yeah ST does

#

not text-gen

#

that should be the easiest step lol

#

could it be a firewall thing? its all local so it shouldnt be

#

i would get in jlllll or someone who knows a lot to have a look

median dune Jan 20, 2024, 6:37 AM

#

dark ruin yeah ST does

I was talking about ST, there is an update file I've already ran on oobabooga.

dark ruin Jan 20, 2024, 6:43 AM

#

does chat completion work?

#

try a reverse proxy

median dune Jan 20, 2024, 6:43 AM

#

dark ruin does chat completion work?

on oobabooga?

dark ruin Jan 20, 2024, 6:43 AM

#

nah on ST

#

well that too

#

just to make sure

median dune Jan 20, 2024, 6:43 AM

#

dark ruin nah on ST

no, chat won't work.

#

I can get it to work on oobabooga, but no ST.

dark ruin Jan 20, 2024, 6:44 AM

#

ahh well thats sus

#

so ST wont work at all for even proxies?

median dune Jan 20, 2024, 6:46 AM

#

dark ruin ahh well thats sus

it doesn't work if you don't have an api running. You can connect it to openai, but I refuse to pay for it.

dark ruin Jan 20, 2024, 6:46 AM

#

alright see if this works

#

https://silent-laws-burn.loca.lt

#

thats a tunnel ive opened to my port 5000 for ooba

#

chuck that in your text completion on ST

#

thats what it looks like

#

may need to put /v1 on the end

#

shouldnt need to tho

#

this should test if its ST being annoying or if its the linux ooba for whatever reason

#

it works on my end

median dune Jan 20, 2024, 6:49 AM

#

I still struggling on what you are trying to have me do.

dark ruin Jan 20, 2024, 6:50 AM

#

thats what it should look like

#

so you see it worked

median dune Jan 20, 2024, 6:52 AM

#

dark ruin thats what it should look like

Ok, that worked, for you system I should say.

dark ruin Jan 20, 2024, 6:52 AM

#

alright so ST for you is working fine

median dune Jan 20, 2024, 6:53 AM

#

dark ruin alright so ST for you is working fine

I even messaged the default bot and it messaged back

dark ruin Jan 20, 2024, 6:53 AM

#

that means text-gen-ui is screwing around with its api

dark ruin Jan 20, 2024, 6:53 AM

#

median dune I even messaged the default bot and it messaged back

yeah it showed up here

#

are you using radeon GPU?

#

no reason to be on linux otherwise

median dune Jan 20, 2024, 6:54 AM

#

dark ruin are you using radeon GPU?

Yes, I am. a XFX Radeon RX 7900t with 20gbs of ram to be precise.

dark ruin Jan 20, 2024, 6:55 AM

#

yeah sounds like a pain

#

i have a friend who went radeon 7900xtx

#

hes had it for months, same time as me, and he still cant run stuiff

#

hes tried like 10 times lol

#

he hasnt tried the simple stuff recently tho

#

like koboldcpp and ollama

#

https://ollama.ai

Ollama

Get up and running with large language models, locally.

median dune Jan 20, 2024, 6:56 AM

#

dark ruin hes had it for months, same time as me, and he still cant run stuiff

on oobabooga I can chat on that with no issues. I just can't connect it to ST

dark ruin Jan 20, 2024, 6:56 AM

#

ok thats good

#

like full gpu offloading working good?

median dune Jan 20, 2024, 6:56 AM

#

AMD is generally better on Linux from what I've heard.

dark ruin Jan 20, 2024, 6:56 AM

#

oh it definitely is for now haha

median dune Jan 20, 2024, 6:58 AM

#

dark ruin like full gpu offloading working good?

I do I know if is? I got psyfighter 13b running faster than my windows 1080ti PC currently. I don't think that's the problem. I think it's my settings. Should I have it like this or is there something else I'm missing.

dark ruin Jan 20, 2024, 6:58 AM

#

dark ruin https://ollama.ai

https://github.com/LostRuins/koboldcpp

GitHub

GitHub - LostRuins/koboldcpp: A simple one-file way to run various ...

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI - GitHub - LostRuins/koboldcpp: A simple one-file way to run various GGML and GGUF models with KoboldAI's UI

dark ruin Jan 20, 2024, 6:59 AM

#

median dune I do I know if is? I got psyfighter 13b running faster than my windows 1080ti PC...

that doesnt tell me anything, but you shouldnt need anymore from there

#

show me your loading model settings?

#

this stuff

median dune Jan 20, 2024, 7:01 AM

#

dark ruin this stuff

#

Wait, I think I got it fixed.

#

@dark ruin Holy fuck... I got it working. Another question, do you know how to get Eurake 1.3 working on oobabooga?

dark ruin Jan 20, 2024, 7:19 AM

#

median dune <@429907325063528450> Holy fuck... I got it working. Another question, do you kn...

nice one!

#

ill have to pop you over to my friend so you can help him lol

#

What is eurake? you mean eurayle or something?

#

if it's 70B, you will need a gguf quant file of it

#

with 20gb vram and 64gb ram you should be able to run quant 4 KM but slowly

median dune Jan 20, 2024, 7:21 AM

#

dark ruin ill have to pop you over to my friend so you can help him lol

https://www.youtube.com/watch?v=wMe4FY1EWb0

YouTube

AI in 5 Minutes

How to use Oobabooga WebUI with SillyTavern

In this tutorial, I show you how to use the Oobabooga WebUI with SillyTavern to run local models with SillyTavern.

▶ Play video

#

I followed this video. I also realized I had it connected, but it would go green and say none. The issue was, I didn't have a model loaded when on oobabooga.

dark ruin Jan 20, 2024, 7:22 AM

#

LOL

#

oh well

median dune Jan 20, 2024, 7:23 AM

#

dark ruin oh well

it looks like this when you put that info on ST. I didn't think it worked because it said none. It wasn't until the video showed how it looked on his end that I connect the dots.

#

Also, wow, the 13b model is so much faster on this than my windows machine.

median dune Jan 20, 2024, 7:26 AM

#

dark ruin if it's 70B, you will need a gguf quant file of it

I downloaded gptq version, so if I download the gguf version, it should work?

dark ruin Jan 20, 2024, 7:36 AM

#

median dune Also, wow, the 13b model is so much faster on this than my windows machine.

thats good news

dark ruin Jan 20, 2024, 7:36 AM

#

median dune I downloaded gptq version, so if I download the gguf version, it should work?

gptq and exl2 are gpu only

#

the whole model MUST fit on the vram

#

gguf is cpu and system ram, with the option to offload layers onto vram to speed things up

median dune Jan 20, 2024, 7:40 AM

#

dark ruin gguf is cpu and system ram, with the option to offload layers onto vram to speed...

I got a 32 × AMD Ryzen 9 5950X 16-Core Processor and 125gb of ram.

dark ruin Jan 20, 2024, 7:42 AM

#

lol thats good as

#

so the idea is to offload as much layers of the model noto vram as possible

#

without overflowing the vram

#

cuz you need some space for context

#

i would start with 20 layers and increase that until it fails to load

median dune Jan 20, 2024, 7:44 AM

#

dark ruin i would start with 20 layers and increase that until it fails to load

I'm downloading a gguf model now, but where would I adjust that, in parameters?

dark ruin Jan 20, 2024, 7:48 AM

#

have a look now at the lama.cpp loader

#

will have n_gpulayers

#

thats where you set it

median dune Jan 20, 2024, 7:52 AM

#

dark ruin have a look now at the lama.cpp loader

is this on the web ui settings or in the file directory?

dark ruin Jan 20, 2024, 7:53 AM

#

web ui

median dune Jan 20, 2024, 7:53 AM

#

dark ruin web ui

here?

#

or here?

dark ruin Jan 20, 2024, 9:37 AM

#

median dune or here?

that one

median dune Jan 20, 2024, 9:40 AM

#

dark ruin that one

how many layers would I set it too?

dark ruin Jan 20, 2024, 9:42 AM

#

as many as you can fit but not too many

#

if you can monitor vram usage that would help a lot

#

start with 20 layers and increase if that works

#

you want to aim for around 1.5-2 tokens/second

median dune Jan 20, 2024, 9:45 AM

#

dark ruin you want to aim for around 1.5-2 tokens/second

ok

dark ruin Jan 20, 2024, 9:54 AM

#

if its too many leyers on gpu, it will be very slow

#

if too little layers, it will be very slow

#Silly tavern Api issue