I am working on enabling the Wake Word with the Media Player functionality that currently exists seperatly inside https://github.com/esphome/firmware/blob/main/voice-assistant/m5stack-atom-echo.yaml and https://github.com/esphome/firmware/blob/main/media-player/m5stack-atom-echo.yaml respectively.
Taking some inspiration and logic from @sturdy mortar's config at https://github.com/esphome/firmware/blob/main/media-player/onju-voice.yaml
I have the below