Has anyone tried building a RAG agent?
The Agent handles the orchestration you choose a model and connect to a tool or knowledge base.
The problem is if you connect to the tool you get control over parameters ie top k and semantic search settings at the agent level. This is helpful because you can control top k and control token usage but it uses it own semantic config which is annoying.
If you connect to a knowledge base instead you can use your custom semantic config in azure portal but you get no control over parameters specifically top k it automatically sets it to 10 which burns through tokens faster and hits request limits faster.
How should I go about handling this?