#RAG(File search) used too much token
1 messages · Page 1 of 1 (latest)
I did check, but I need less token getting used, I cannot afford if it use that many token(rn I upload 2 files out of 25 files we have)
If I upload 25 files it going to take something like 20K token per message
If I upload the file data in the system part it only have 1108 token
if use vector storage it will be 3.5K token
I want it decrease to 300-500 token
set max_num_results
I can't I am not using assistant, I am using resp api
responses api has the same property
playground don't have
then you need to wait for them to add it or use the actual api
I find out vector_store.search_results.page return whole txt file(which is not a part of txt) even I limit the result to 1 it still use whole txt
so do I have to break the content in to mutiple txt not like how does DB like