Goal seeking and ethical persuasion | OpenAI | Page 1

mild prawn Dec 13, 2023, 3:53 AM

Hey, I would like to explore, within ethical limitations and affecting only myself, with safety in mind and with my consent, how a model like yourself understands and might use strategic persuasion to attempt to achieve a goal. I am thinking a goal could be set that is widely recognized as being beneficial and benign, like an increased interest in spending a few hours exploring some edutainment.

Is there anything in the OpenAI documentation to help guide me to explore this with any of OpenAI's products, in a way that can help ensure the model that I am fine with this exploration, understand my own safety, and am curious and interested in the results, whatever they are?

urban spindleBOT Dec 13, 2023, 3:53 AM

Hi @mild prawn. I'm kapa.ai, your friendly support bot. I will try to answer your question by searching through the knowledge base. :wave:

OpenAI's models, such as GPT-3, are designed to generate human-like text based on the input they're given. They can be used to create engaging and educational content, which could potentially be used to increase your interest in a particular topic. However, it's important to note that these models don't have a consciousness or a will of their own - they don't understand or pursue goals in the way humans do. They simply generate responses based on patterns they've learned from their training data.

That being said, you can certainly experiment with different inputs to see how the model responds, and you might find that certain prompts lead to more engaging or persuasive outputs. For example, you could ask the model to generate a persuasive argument for why you should spend a few hours exploring a particular educational topic.

In terms of safety, OpenAI has a number of guidelines and best practices in place. For instance, you can add a moderation layer to the outputs of the Chat API to prevent content that violates OpenAI’s usage policies from being shown source (https://platform.openai.com/docs/guides/gpt). You should also be aware of the limitations of the model, and understand that it may not be suitable for every use case without significant modifications source (https://platform.openai.com/docs/guides/safety-best-practices).

If you're interested in exploring this further, OpenAI has a Researcher Access Program where they invite researchers to explore various aspects of their models, including alignment (how to align the model's outputs with human preferences) and interpretability (understanding how the model works) source (https://platform.openai.com/docs/model-index-for-researchers).

Remember, if you notice any safety or security issues with the API or anything else related to OpenAI, you should submit these through OpenAI's Coordinated Vulnerability Disclosure Program source (https://platform.openai.com/docs/guides/safety-best-practices).

#Goal seeking and ethical persuasion

Remember, if you notice any safety or security issues with the API or anything else related to OpenAI, you should submit these through OpenAI's Coordinated Vulnerability Disclosure Program source (https://platform.openai.com/docs/guides/safety-best-practices).

Note: you can ask me a follow up question by @ mentioning me again :speech_balloon: