I'm in the process of procuring data to fine-tune GPT-3, but I noticed some of the prompts in my dataset of around 17k samples contained possibly offensive content. My first inclination would be to cut down on the number of samples so that I could run them all through OpenAI's moderation system before fine tuning.
My question is: is this realistic? Or would it be okay to leave the data as-is and moderate outputs? I am planning to train on Discord chat logs if that's relevant.