#GPT 3.5 become 10x slower than 2 weeks ago

8 messages · Page 1 of 1 (latest)

mild pulsar
#

The GPT 3.5 API became incredibly slow, I compared it with the speed in the playground with the same prompt. Speed is 10 times slower. Could this be related to free credits from Microsoft Azure? Our account was paid, then we got these credits.

mild pulsar
#

this request took 1 minute:

{"model":"gpt-3.5-turbo","messages":"temperature":0.5,"max_tokens":3000}
#

completion_tokens: 201,
total_tokens: 349

Is it OK 1 minute for these numbers?

woeful orchid
#

try lowering the max tokens to 1000

teal gazelle
#

Lowering the tokens may work however what is the point if you can’t use the API as intended. These issues make it difficult to actually make products from OpenAI with these laggy issues. I’ve also been testing over the last 4 months and the same issues keep popping up. Main with the endpoint for gpt3 turbo being overloaded, and the error response is try again later or contact OpenAI. We need more hardware in place to process more requests and resources is the only real solution to this ongoing problem imo. Anyone else have an opinion?

austere finch
#

@mild pulsar im having a possibly related problem.
the backend is using gpt 3.-1-turbo-0301 isntead, which is 10x more expensive, and maybe also slower

mild pulsar
#

I forced to switch to davinchi-003, because the app is unusable with this speed. davinchi renders it for 15 sec.

woeful orchid
#

Yeah , I agree they need more scalability. Way more