I'm currently using OpenRouter and would like to add a moderation layer before sending user input to the actual completion call. I allow users to select any model from a list of available models.
However, I couldn’t find clear documentation explaining whether OpenRouter performs content moderation by default, or if I need to configure it manually. The only reference I found was a two-year-old Reddit post: https://www.reddit.com/r/SillyTavernAI/comments/160ylup/openrouter_ai/
My second question is:
If OpenRouter doesn’t include built-in moderation, can I use the OpenAI Moderation API before forwarding the request to OpenRouter? Specifically, can I call the OpenAI moderation endpoint through OpenRouter, or do I need to use OpenAI’s API keys directly for moderation?