help-forum
Not sure how setup hosting
Gradio
IndexError: list index out of range
Oobabot installation help
Upload History Not Recognized By Model
Gradio
Prompts & Characters
Windows
Expose server to local LAN. Is there a flag to just allow local LAN access?
Windows
ExLlama not working as intended
GGML models are not using GPU at all
Oobabooga api and streaming url not found
Windows
Using Text-Gen GGML (CPU ONLY)
Setup
LLaMA
Function Calling
LLaMA
Getting error trying to use EX_Llama
Gradual slowdown as the chat gets longer, normal or not?
EleutherAI/pythia-1b-deduped wont load under any model loader.
Windows
Solved | RuntimeError: DefaultCPUAllocator: not enough memory: you tried to allocate 238551040 bytes
Setup
Windows
How to ask questions about custom data?
Not getting a link. No errors just... nothing
New installation, fails to load model
line 134, in download_model_wrapper downloader = downloader_module.ModelDownloader() TypeError: Mode
2 mismatched GPUs
What is the difference between chat, chat-instruct and instruct ?
TypeError: arange() received an invalid combination of arguments
Linux
LLaMA
RuntimeError: CUDA error: an illegal memory access was encountered
Linux
LLaMA
Hardware
How do i open the WebUI?
Selecting GPU over CPU?
Unable to run xformers
0.2 tokens/s when reaching context size limit
Ran Update Script, Now Broken (Windows)
please help to run 2 GPU with ooba
does text-generation-webiui has api?
compile llama on linux
Newest update failing
RuntimeError: data. DefaultCPUAllocator: not enough memory: you tried to allocate 238551040 bytes.
Setup
Windows
.BAT FILE FAIL
Setup
Windows
INCREASE RESPONSE LENGTH?
Gradio
Windows
Exception: Model couldn't generate your text, probably it's too long
API broken after Update - SOLVED
Disable default load model
Update process is outputting errors.
Windows
Different speeds in UI vs console
Model way too creative, inventing too much stuff.
Webui cpu to gpu
Setup
adapter_config.json
SuperHOT 8k Model for ExLlama as GPTQ or FP16?
Setup
LLaMA
Windows
LLamacpp GPU acceleration
Setup
Windows
Installing cuda extensions problem
I'm having problems with the "--api" flag
Setup
Downloading with zip
Linux
Setup
30B Model "out of memory" autogptq-for-llama "only support 2,3,4 and 8 bits"
Linux
Faster Contrastive search?
I cant load models
Setup
Windows