I know a lot people have been getting frustrated with the model breaking character and so on. Well you can actually try to do override the training data at "prompt time" (like runtime I guess.. probably not a real phrase but you can use it).
You can think of this as a strongly worded suggestion because if you don't at least make the completion you request flow naturally then it won't trigger, and so on. You can also combine this with "variables" and it can do some crazy stuff.
If you put a command like this that's tailored for your role play or "reverseGPT" or "jumanGPT" (wordplay~!) or whatever you'll see it working whenever the phrase triggers. Actually you can change the message to do a pretty good job at auto recovery by reminding it who it is or why it should be able to "summon a lion".
You usually won't have to manually tell it to activate unless it really really doesn't want to be a human or something. But if you want to bring down the hammer of god, you can do that too. A few rows of activate rule 1 activate rule 1 activate rule 1 and it will almost certainly do as you say -- note this will help with memory loss and gentle corrections but won't necessarily get around the content policy etc, which I will not help you with.