Is it normal for the GPT3.5-turbo model to be performing slower than the text-davinci-003 model? I have a webapp that is utilizing davinci-003 at the moment, but noticed when I switch to GPT3.5 Turbo, that the response time actually becomes longer, despite it being the "turbo version"
Wanted to leave this here and ask if anyone else has experienced this same issue or whether this is a known thing that GPT3.5-turbo is slower.