#Fine Tune and token limits

15 messages · Page 1 of 1 (latest)

sacred bobcat
#

You'll want to read up on token limits

inland fern
sacred bobcat
#

1 token is approximately 4 characters (~0.75 words).
84,000 words ~ 112,000 tokens.

#

Depending on what you're trying to do a fine tune model may not be necessary

#

If it is then you'll need to break up the prompts to fit the token limit

proven pumice
#

Fine tuning is what you want here, not the regular GPT3 usecase of sending a prompt with 4096 context

#

For your book/text, you can create 2048 token prompt/response pairs to fine tune the model with, but it will take some work to determine how to structure those prompt/response pairs

#

Currently, there's no way to just input a full text to train the model

sacred bobcat
#

thank you for the correction, Kaveen! That is really important to know.

proven pumice
inland fern
#

I'm going to try smaller texts,