Hey guys, I am using RAG to retrieve information from a Pinecone DB and then generate a response with llama3. The vector DB contains information about certain tasks that need to be completed but also unrelevant chats. Each embedding contains metadata about the message, timestamp, userid, and username.
If I want to provide additional information about the project behind those tasks, so the LLM can better understand which tasks are linked to what, where should I store this information? Should I create another vector database and provide the content additionally as project background, or should I upload it to the existing vector database? The existing one contains not only tasks but also chat logs between people, so information that could be wrong.
For example, I would like the llm to know which employee positions there are, e.g. developer, supporter, and what their tasks in general are. When a task is written in the chat, the llm should understand the connection between the role and the task.