Bit of a bump of my question yesterday. | OpenAI | Page 1

Ok. After playing around a bit. There are definitely some limitations to method 1. I think when I run is called the assistant prompt is included towards your token rate limit. So if you have a decently long prompt the total tokens to process the messages is a multiple of the messages you have.

If this is true, method 2 might be required.

#Bit of a bump of my question yesterday.