Hello,
I am building an app that uses chatgpt (specifically gpt-3.5-turbo) currently I have it hosted on heroku 25/month shared tier. Locally the API call is snappy and responds quick. On the cloud it can take anywhere from 2-6 seconds for the api to kick off. I’ve identified that it is the initial api call itself that is slow, once streaming starts it is okay. My question is, where do you guys host your back ends for calling chatgpt?
Other info:
Running on node js using OpenAI official node client
Using websockets
Api is using stream=true