help-forum

805 threads · Page 13 of 17

Not sure how setup hosting 2 messages
Gradio
IndexError: list index out of range 9 messages
Oobabot installation help 2 messages
Upload History Not Recognized By Model 9 messages
Gradio Prompts & Characters Windows
Expose server to local LAN. Is there a flag to just allow local LAN access? 2 messages
Windows
ExLlama not working as intended 21 messages
GGML models are not using GPU at all 20 messages
Oobabooga api and streaming url not found 8 messages
Windows
Using Text-Gen GGML (CPU ONLY) 6 messages
Setup LLaMA
Function Calling 22 messages
LLaMA
Getting error trying to use EX_Llama 20 messages
Gradual slowdown as the chat gets longer, normal or not? 7 messages
EleutherAI/pythia-1b-deduped wont load under any model loader. 2 messages
Windows
Solved | RuntimeError: DefaultCPUAllocator: not enough memory: you tried to allocate 238551040 bytes 6 messages
Setup Windows
How to ask questions about custom data? 8 messages
Not getting a link. No errors just... nothing 2 messages
New installation, fails to load model 4 messages
line 134, in download_model_wrapper downloader = downloader_module.ModelDownloader() TypeError: Mode 12 messages
2 mismatched GPUs 17 messages
What is the difference between chat, chat-instruct and instruct ? 13 messages
TypeError: arange() received an invalid combination of arguments 11 messages
Linux LLaMA
RuntimeError: CUDA error: an illegal memory access was encountered 5 messages
Linux LLaMA Hardware
How do i open the WebUI? 18 messages
Selecting GPU over CPU? 7 messages
Unable to run xformers 4 messages
0.2 tokens/s when reaching context size limit 24 messages
Ran Update Script, Now Broken (Windows) 27 messages
please help to run 2 GPU with ooba 3 messages
does text-generation-webiui has api? 51 messages
compile llama on linux 5 messages
Newest update failing 40 messages
RuntimeError: data. DefaultCPUAllocator: not enough memory: you tried to allocate 238551040 bytes. 5 messages
Setup Windows
.BAT FILE FAIL 13 messages
Setup Windows
INCREASE RESPONSE LENGTH? 4 messages
Gradio Windows
Exception: Model couldn't generate your text, probably it's too long 5 messages
API broken after Update - SOLVED 4 messages
Disable default load model 2 messages
Update process is outputting errors. 2 messages
Windows
Different speeds in UI vs console 5 messages
Model way too creative, inventing too much stuff. 15 messages
Webui cpu to gpu 2 messages
Setup
adapter_config.json 2 messages
SuperHOT 8k Model for ExLlama as GPTQ or FP16? 3 messages
Setup LLaMA Windows
LLamacpp GPU acceleration 19 messages
Setup Windows
Installing cuda extensions problem 6 messages
I'm having problems with the "--api" flag 4 messages
Setup
Downloading with zip 2 messages
Linux Setup
30B Model "out of memory" autogptq-for-llama "only support 2,3,4 and 8 bits" 9 messages
Linux
Faster Contrastive search? 4 messages
I cant load models 10 messages
Setup Windows