help-forum | Text Generation WebUI | Page 12

2 tokens/second 17 messages

Jul 25, 2023, 8:51 AM

KeyError: ‘bos_token_id’ 2 messages

Jul 23, 2023, 9:42 PM Windows

VRAM & "CUDA out of memory." 40 messages

Jul 23, 2023, 8:06 PM Hardware Setup

Cuda error 2 5 messages

Jul 23, 2023, 7:13 PM

general performance-review 9 messages

Jul 23, 2023, 3:42 AM Windows

unsupported tensor dtype in loras LIMARP with Exllama 7 messages

Jul 22, 2023, 7:16 PM

How to convert ggml to ggml v3? 4 messages

Jul 22, 2023, 6:32 PM

Can not run llama2 7b parameters on windows webui (updated) 56 messages

Jul 22, 2023, 5:57 PM

Can't install models with WebUI 26 messages

Jul 21, 2023, 11:48 PM Hardware Setup Windows

Could not find the quantized model in .pt or .safetensors (Solved, wrong model loader.) 2 messages

Jul 21, 2023, 10:43 PM Windows

Loading TheBloke_Llama-2-70B-chat-GPTQ… 10 messages

Jul 21, 2023, 8:31 PM Windows

[RESOLVED] OSError: [WinError -1073741795] Windows Error 0xc000001d 38 messages

Jul 21, 2023, 2:53 AM Setup Windows

any recommendations? 22 messages

Jul 21, 2023, 1:27 AM

All of a sudden my screen now goes off randomly while running ooba 9 messages

Jul 21, 2023, 1:03 AM Hardware

I can't figure out what's wrong. It won't Load Models! 11 messages

Jul 21, 2023, 12:44 AM

In chat-instruct mode where do you add memory of important details about the user LLM is talking to? 7 messages

Jul 20, 2023, 12:54 PM

Can't get any models to work in Text Generation WebUI on Windows 11 laptop 3 messages

Jul 20, 2023, 12:55 AM Windows

How to install GPTQ-for-Llama with venv on Linux? 3 messages

Jul 19, 2023, 7:15 AM Linux Setup LLaMA

Failed building wheel for quant-cuda 6 messages

Jul 19, 2023, 1:06 AM Setup Windows

.safetensors models produce nonsense output. 16 messages

Jul 18, 2023, 9:53 PM

524 error with public api extension - any insight? 2 messages

Jul 18, 2023, 9:22 PM Windows

New to WebUI, Conda errors 2 messages

Jul 17, 2023, 5:33 PM Windows

Auto Installer - Listen on Lan 4 messages

Jul 17, 2023, 1:15 PM Linux Setup

Are there any existing functions to submit prompts in batches? 10 messages

Jul 17, 2023, 12:37 PM Linux Prompts & Characters

Question about Perplexity Results 42 messages

Jul 17, 2023, 10:44 AM

max_new_tokens increased length 6 messages

Jul 16, 2023, 8:46 PM

Gibrish/Empty responses when clicking "continue". 8 messages

Jul 16, 2023, 6:50 PM LLaMA Windows

Error when loading model (WizardLM Uncensored Falcon 40B) 54 messages

Jul 16, 2023, 4:12 PM Hardware Setup Windows

Issues using OpenAi api exstension 7 messages

Jul 16, 2023, 2:53 PM

silero_tts repeating voice clip bug 3 messages

Jul 15, 2023, 11:12 PM

models won't load 51 messages

Jul 15, 2023, 6:15 PM

Hi I enabled public_api in Text Generation WebUI from RunPod service and I need the URL 2 messages

Jul 15, 2023, 1:54 PM Hardware Setup

can't start web UI without loading a model, Bug? 4 messages

Jul 15, 2023, 4:01 AM Windows

llama.cpp not using GPU despite having BLAS = 1 (Linux, GGML) 121 messages

Jul 15, 2023, 2:35 AM Linux LLaMA AMD

What are Ooga's default preset values? 4 messages

Jul 14, 2023, 7:38 PM

What models can I use? 25 messages

Jul 14, 2023, 6:31 PM Hardware Windows

What prompts does Chat mode use instead of Instruct mode? 3 messages

Jul 14, 2023, 2:55 PM Linux Prompts & Characters

I fine-tuned the model and I got CUDA out of memory 4 messages

Jul 14, 2023, 9:00 AM Linux

When training Lora in alpaca format - how do i ensure the conversation history is maintained ? 5 messages

Jul 13, 2023, 4:09 PM

KeyError: 'lm_head.weight' when attempting to load Guanaco 33B with loader other than Transformers 4 messages

Jul 13, 2023, 1:04 PM Linux LLaMA

Show prompts in Console? 3 messages

Jul 13, 2023, 9:23 AM

Expected inference speeds with a 3090Ti / ExLlama setup 106 messages

Jul 13, 2023, 1:15 AM

Getting error while trying to create a LoRA based onTheBloke_WizardLM-33B-V1-0-Uncensored-SuperHOT-8 5 messages

Jul 12, 2023, 9:25 PM Linux Setup Windows

Errors when running SUPERHOT models using Exllama 7 messages

Jul 12, 2023, 6:47 AM

Cpu bottleneck 2 messages

Jul 11, 2023, 8:42 PM Hardware

How exactly does feature "start replay with" work? 21 messages

Jul 11, 2023, 7:27 AM

Runtime error while trying to boot alpaca 5 messages

Jul 11, 2023, 6:29 AM Setup Windows

CUDA out of memory 8 messages

Jul 10, 2023, 8:05 PM Windows

ERROR:Failed to load the model 3 messages

Jul 10, 2023, 9:24 AM

Conda environment is empty 30 messages

Jul 10, 2023, 12:55 AM Setup Windows