help-forum
AssertionError: Total sequence length exceeds cache size in model.forward
exl2 model error: 'Nonetype' Object is not subscriptable
Windows
I can't load an AI model (any model) into the program.
[HELP] RuntimeError: CUDA error: device-side assert triggered
Windows
Suddenly really bad responses
Python code request to api for responds generation.
What's even the point of this Discord server?
Really slow generation times
Hardware
Windows
Queuing Requests or Parallel Operations in Ooba API
WebUI on Tesla p40
can't update anymore
Setup
Windows
Llama 3 instruction template
CodeQwen EXL2 error on load
In which file please ? (llm_int8_enable_fp32_cpu_offload)
Windows
Need help sorting out the stopping strings for LLAMA 3.
disable skip special tokens from openAI api
How to build llama.cpp with cuda on windows?
Exllamav2 issues on linux + AMD
Linux
Setup
AMD
Text Generation issue with characters that have large tokens.
Linux
LLaMA
Prompts & Characters
how can i install WizardLM-2? the One that supposedly beats gpt-4-0314?
Hardware
Setup
Prompts & Characters
Windows
Superbooga
Error whenever loading an AWQ Model.
Windows
AI generates random things the first time, but works in future attempts if I don't change the prompt
Windows
Can't load previous models
Running Command R+ on 70GB vram?
Suggestions of light models for the following Gamer PC and a Macbook Air M1
Windows
I can't load a model
Segmentation fault (core dumped)
AssertionError: Torch not compiled with CUDA enabled
Windows
increase output
Windows
What size EXL quants can my rig run? 128GB RAM, RTX 4060 TI
Interface does not see models folder
Windows
After updating I cannot run models that I have run before. It says out of memory.
Windows
Cheap accelerator for local AI?
coding with ai
Windows
Whisper STT Error
Gradio
Setup
Windows
Running Models via CLI on Jetson AGX Xavier with Dusky Container
With --multi-user and --gradio-auth-path in flags, users cannot use model
Gradio
Windows
Unable to install TheBloke/dolphin-2.6-mixtral-8x7b-GPTQ
Setup
Windows
Models outputs gibberish
Understanding Different Quantisation Techniques in Model Names
Issue with whisper (unhashable type: 'list')
Gradio
Windows
No module named 'soundfile'
websocket not found
Gradio
Windows
assistance regarding model downloading
API Error With LangChain
Need to extend context in some form from 20k
AutoAWQ not working
Setup
Windows
What are Instruction and Chat templates, and how do I use them?
Setup
Prompts & Characters
Windows
Trouble installing the oobabot-plugin
Windows