#moderation

1 messages · Page 1 of 1 (latest)

compact flint
#

Can you share the prompt?

#

Oh nvm

#

You didn't actually prompt the model with anything

#

And the user content is inside the system prompt which is wrong

compact flint
#

You should put a little more effort into it. If you want to base the prompt on the 11labs policy, pass the link to said policy into chatgpt or something and ask chatgpt to generate a prompt for detecting malicious content.

That's the fast way but a good starting point.

Make sure that prompt is in the system message. Make sure the user content is in the user message.

And I'm not sure why have you included the assistant message there

sick wyvern
compact flint
sick wyvern
#

im using groq

compact flint
#

I guess improve your prompt then
So that it can correctly classify your example case and test it with other variations until you think it's good enough

#

"mark any unsafe content" is not a very good call to action for an LLM, this part has to be elaborate

sick wyvern
#

but im not sure if its best way