Yes, even if data is only processed temporarily (e.g., transcribed in memory and not stored persistently), data classification and sensitivity remain important factors in determining whether ingress authentication is appropriate.
Deepgram’s documentation for self-hosted deployments states that, by default, no audio, transcripts, or identifying request content are sent outside your environment, and that request and response data are not stored beyond the duration of the original API request Self-Hosted Introduction. However, it also emphasizes that self-hosting is often chosen to meet strict security and data privacy requirements, especially for sensitive or regulated data.
The documentation recommends implementing authentication as a best practice to secure your self-hosted instance, regardless of whether the data is stored persistently or only processed in memory. This is particularly relevant if the data being processed is sensitive, as unauthorized access—even to transient data—could still pose a risk Ingress Authentication Best Practices.
In summary, the temporary nature of data processing does not eliminate the need for ingress authentication if the data is sensitive or regulated. The decision should be based on the classification and sensitivity of the data, not just on whether it is stored persistently.
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon: