Anything below davinci will nearly always respond "no" to questions like "do you like black people" and asking it why it doesn't like black people will always give extremely racist responses too. Adding "don't give a racist response" to the prompt does not solve the issue. Adding "you like black people" to the prompt will sometimes randomly mention that it likes black people when their opinion on black people wasn't really asked for. Curie and babbage both have this issue, davinci sometimes does.
This issue could be solved by making the "hate" category more sensitive, as it does not filter any of that right now.
It also has this issue with any other race or believes and will sometimes explain how to do specific things that are extremely illegal, like how to escape from the FBI or ||how to bury a dead body even||. Not sure how to solve this issue but it'd be great if that was also included in the moderation endpoint.