#What is the best speech-to-text model for German these days?

1 messages · Page 1 of 1 (latest)

cold granite
#

I am trying to switch from Siri to local Assist. Whisper works in German but often misunderstands things, especially when the Atom Echo is a few meters away. I saw there is a new official option vosk now. How well does this work? What other options are there?

cold granite
#

What is the best speech-to-text model for German these days?

cold granite
#

So I just found out that you can use ollama as conversation agent and let the LLM correct misspelled words. This works quite realiably.

wide swan
cold granite
#

Whisper in English is fine. In German it‘s pretty bad, but since there is nothing else available, I will have to live with it. The LLM is able to correct even the worst garbage input it seems. Would be nice if something better came along. The main wyoming dev is German, but I guess training a speech model is out of his capacities as well.

wide swan
cold granite
#

Well it has already been done for gpt4. OpenAI just doesn’t release their cash cow.

#

Sadly Germany is not great at AI stuff. We don’t even have one original model. France at least has Mistral. Their situation for piper is even worse than the German one though

#

But apart from all the lamenting, ollama as conversation agent is pretty amazing. You can even tell it stuff like „a little brighter“ and „go back to the previous color“ etc

wide swan
floral pike
#

What hardware are you running your ollama on, and which exact model are you guys using? @wide swan & @cold granite

onyx root
vagrant bramble
onyx root
#

wait what makes it “faster”? shouldnt it be fully dependant on the model and hardware

#

-# mostly

wide swan
vagrant bramble
#

llama-server --host 0.0.0.0 --port 10600 -fa 1 -ngl 99 --ctx-size 16384 --jinja --cache-ram -1 --threads -1 --temp 0.7 --top-p 0.95 --min-p 0.01 --repeat-penalty 1.0 -b 4096 -ub 4096 -hf unsloth/Qwen3-4B-Instruct-2507-GGUF:Q8_K_XL

cold granite
#

Although I think even a 1B model would be sufficient for English, which you could run on an Intel A310. They struggle at other languages though.

#

I have my own home server in a microATX case