Hello. I’m trying to get RAG to work. I use the llama3 405b as model with LibreChat as inference/chat GUI.
It doesn’t seem that openrouter API supports embeddings (which I think may be required for RAG to work?)
So if anyone got it working or know how to get it to work please tell me. I want to upload pdfs and have them summarized.