Hi Team,
I am experiencing delayed response from the Wizard 8x22b model for the prompt specified from last couple of weeks.
Prompt consists of a detailed structure for extracting relevant information from a raw text and convert the same into an asked JSON format.
But the API is giving response in 1min+ timeframe.
Can you share possible reasoning on the same to resolve such issues?