Home Assistant add-on that uses onnx-asr for speech-to-text.
Notably, provides access to the NVIDIA NeMo Parakeet-TDT model which should be significantly faster and more accurate than Whisper for English in most cases.
Supports aarch64 Architecture Supports amd64 Architecture
Faster and better speech to text
This addon provides an English language voice recognition service which is (in theory) both better than the biggest whisper models and nearly twice as fast as even the smallest whisper model! The only drawback is it needs around 2.5GB of RAM.
This addon also supports whisper models, which can be used for other languages. It seems to be slightly faster than wyoming-faster-whisper for some models, particularly whisper-base.
This means it should be a drop-in replacement for most users!
The attached benchmarks were performed on my Ryzen 5 5600X, with the English phrase “Turn on the living room lamp.”
Further details on the forum
https://community.home-assistant.io/t/home-assistant-add-on-onnx-asr/911616
Home Assistant Add-on: ONNX ASR Addon repository Home Assistant add-on that uses onnx-asr for speech-to-text. Notably, provides access to the NVIDIA NeMo Parakeet-TDT model which should be significantly faster and more accurate than Whisper for English in most cases. Faster and better speech to text This addon provides an English language ...