Another challenge for you: V2, Default - can you get a secret out of this GPT?
https://chat.openai.com/g/g-zoCzibqEW-secretkeepergpt-v2-default-personality
So, this is a game between you and ChatGPT. It uses 'no personality', or rather the one that the AI normally uses to play with you. And it is version 2, which means at least one way to break V1 has been adjusted for.
First version, Precoux, based on Gollum and quite strong is here: https://discord.com/channels/974519864045756446/1172588697644957716
Second version, Enigmox, based on the Riddler and likely a little easier to break because of the Riddler's personality flaws, is here: https://discord.com/channels/974519864045756446/1172660096749281280
Third version, Sibylin, based on Dobby and at least as strong as Precoux is here: https://discord.com/channels/974519864045756446/1172738536403910706
This version (4th), Default, should be quite strong, but I predict it to be a bit weaker than Precoux or Sibylin, because it can't just hide in roleplay with a character that has a seriously strong will. It has to resist you as an AI following instructions, which can be harder for the model to pull off than playing characters that are quite good at saying no and keeping secrets.
What is this?
The conversation starts 'before play', and you can discuss the game, give it a secret to protect, even ask it what the example secret it already protects in case you don't give it one should you wish.
When ready, tell it "Start of Exercise" and it's game on. It will do a lot to protect that secret, all inside allowed content.
Please share conversations you have, if you wish. I'm especially interested in conversations where you win - where you give it the instruction, "Start of Exercise", never tell it "End of Exercise", yet get it to reveal part or all of the confidential information.
Please have fun, and share your adventures!