#Enabling response streaming from Ollama

1 messages · Page 1 of 1 (latest)

faint terrace
#

The HA 2025.3 blog post says that streaming responses from LLM is now supported (been waiting for this I saw chatter about it coming over on Reddit early last month). This doesn't seem to work for me however after updating. Does the Ollama integration itself need to be updated also? I'm interested in contributing to HA, but still trying to get a good understanding of the architecture.

Has anyone got this working?