I'm noticing a divergence in voice assistant wake word architecture. I can't be the first one to comment on this, but I also can't really find anyone else talking about it. Almost all the wake word training libraries output onnx, and all the software that's being developed wants to use tflite files because, I suppose, it's better for extremely low-power devices.
Is there a server somewhere that I can set up that integrates nicely into the HA voice pipeline that's based on OpenWakeWord or something else that uses .onnx files? It's possible to convert onnx -> tflite and I've written scripts to do it, but it's a pain and depending on what software generates the onnx file there's hidden settings (flipped axes are the big one) that need to be addressed, and no particularly easy way to convert. It's just a pain, right now.
Are there any projects addressing this? Are we gonna decide on one or the other at some point?
I realize this is an esoteric question - I just don't understand why training software / scripts output one and all the wake-word software wants another.