#> When I call the model with around 4000
1 messages · Page 1 of 1 (latest)
No, I don't use stream.
In short, my use case is like
- put some raw text data(~=3500 tokens)
- ask Gemini summarize it
As it's not a chat, I don't really make good use of streamed response as my users cannot do anything until it's done.