#High API Input cost #api

1 messages · Page 1 of 1 (latest)

distant ruin
#

Hello, can someone explain how my 500-word request ends up using 16,000 input tokens? From what I’ve researched, it seems that all previous answers in the conversation are sent back with each API call to maintain context. I’d like to know how I can reduce input tokens for smaller requests while still making the AI remember the conversation.