Anyone get prompt cacheing working with | OpenAI | Page 1

vital isle Sep 15, 2025, 4:44 PM

#

Just looking to cut cost for things that are not really changing (system prompts)

#

@grim aurora 👆 (feels like a docs issue)

jolly wing Sep 15, 2025, 5:00 PM

#

I guess you've tried using section headers:
--- LIKE THIS ---

vital isle Sep 15, 2025, 5:46 PM

#

I'm not following? Do you mean in the prompt?

I literally sent the same lorem ipsum request N times in a row:

when using the chat API it works
when using the reponses API it works ONLY with passing the previous_response_id

#

Also when I send the previous_response_id it continues to increase the tokens (its the same lorem ipsum). Which makes sense since its the "same" convo but I don't want the same convo per-say I just want the cache hits. I thought that would be via the prompt_cache_key

vital isle Sep 15, 2025, 6:17 PM

#

@jolly wing did you mean the prompt? I can also share the res/req object I see the data is being sent as a system role

#Anyone get prompt cacheing working with