#Different models for tts/stt

1 messages · Page 1 of 1 (latest)

pallid sage
#

I was wondering if it's possible to use anything besides piper and whisper.

brazen patrol
pallid sage
brazen patrol
#

I suspect someone will do it. but it takes quite a bit of compute to run it.

#

Actually a pretty cool idea if you have the VRAM to run it on something.

pallid sage
#

Makes sense, I'm running all my TTS/stt/llm on my workstation ATM and connecting ha to it from the rpi

#

Yeah I'm running on 7900xtx

brazen patrol
pallid sage
#

Interesting, I figured I can run it with llama-server

brazen patrol
#

I am sure its "possible" to run the model on non-cuda cards with the standard tools. however might take some time before all the pieces are in place. remember its only been a few days.

pallid sage
#

But I was wondering about any Wyoming proxy kinda thing

brazen patrol
brazen patrol
#

Or if someone made an OpenAI-TTS wrapper you could probably connect it using THIS custom integration.

#

ooo, looks like someone has made an OpenAI-TTS wrapper HERE.
If you can get that running you can use the above custom integration with it I imagine.

brazen patrol
#

Just a matter of trying to fit the pieces together.

pallid sage
#

Man this is a rabbit hole lol

brazen patrol
dapper wren