#Chat guide?

1 messages · Page 1 of 1 (latest)

spring ore
#

Now that OpenRouter's chat has web search built-in, I see much less of a reason to use a separate client for general needs. (The localStorage thing is a bit scary but I'll take backups… Many others like big-AGI also do this, anyway.) That said, is there a user guide detailing each of these features somewhere? I couldn't find any in the Docs section and some things take a while to figure out.

For example, for all I know there might be a way to configure a prompt to enable (Anthropic) prompt caching, to dramatically reduce the cost of a big message + followups scenario, but I couldn't find anything. Are there / commands, perhaps?

tardy mango
#

Be warned that providers do not share prompt caches, so if DeepInfra caches your prompt (DeepInfra **doesn't **support prompt caching btw. This is just an example) and Hyperbolic serves your next request, HyperBolic won't have the prompt cache and so you'll be paying at a higher rate (HyperBolic also **doesn't **support prompt caching btw. This is just an example

spring ore
#

I had a test session with Claude 3.5 Sonnet (self-moderated) and my Activity confirms that caching was not used. (Both messages were sent to the same back-end.) Perhaps the web search plugin interferes. I'll experiment further…

tardy mango
hoary nacelle
#

@spring ore in my experience, the only way caching discounts will be applied in the openrouter chat interface is if the provider enables it automatically. im not sure web search interferes, ive never seen cache discounts being applied in any of my conversations with claude models in the activity section when using the openrouter chat room. the most frequent provider i see cache discounts applied to is the openai models, but this is anecdotal. just my experience.

spring ore
#

Makes sense. Okay so I guess my question is two-fold then:

  1. Is there a user's guide for https://openrouter.ai/chat which details all its features to make sure I'm not missing out?
  2. I guess I should file a feature request to get some kind of checkbox when sending a prompt if I want it to be a caching checkpoint. 🤔
    I realize I could just use another chat client, but OpenRouter's own is becoming good enough to not want to bother. 😉