Fine Tune and token limits | OpenAI | Page 1

sacred bobcat Dec 28, 2022, 6:35 PM

#

You'll want to read up on token limits

inland fern Dec 28, 2022, 6:36 PM

#

sacred bobcat You'll want to read up on token limits

4000 xd

sacred bobcat Dec 28, 2022, 6:41 PM

#

Depending on the model used, requests can use up to 4097 tokens shared between prompt and completion. If your prompt is 4000 tokens, your completion can be 97 tokens at most.

https://help.openai.com/en/articles/4936856-what-are-tokens-and-how-to-count-them#:~:text=Token Limits,be 97 tokens at most.

What are tokens and how to count them?

#

1 token is approximately 4 characters (~0.75 words).
84,000 words ~ 112,000 tokens.

#

Depending on what you're trying to do a fine tune model may not be necessary

#

If it is then you'll need to break up the prompts to fit the token limit

proven pumice Dec 28, 2022, 9:41 PM

#

sacred bobcat >Depending on the model used, requests can use up to 4097 tokens shared between ...

I don't think this is accurate I think that's for general GPT3 prompting, For fine tuning, each prompt and completion can't exceed 2048 tokens including the separator

#

https://beta.openai.com/docs/guides/fine-tuning
https://beta.openai.com/docs/guides/fine-tuning/preparing-your-dataset

OpenAI API

An API for accessing new AI models developed by OpenAI

OpenAI API

An API for accessing new AI models developed by OpenAI

#

Fine tuning is what you want here, not the regular GPT3 usecase of sending a prompt with 4096 context

#

For your book/text, you can create 2048 token prompt/response pairs to fine tune the model with, but it will take some work to determine how to structure those prompt/response pairs

#

Currently, there's no way to just input a full text to train the model

sacred bobcat Dec 28, 2022, 10:23 PM

#

thank you for the correction, Kaveen! That is really important to know.

proven pumice Dec 28, 2022, 10:24 PM

#

sacred bobcat thank you for the correction, Kaveen! That is really important to know.

No worries! 🙂

inland fern Dec 29, 2022, 3:03 AM

#

I'm going to try smaller texts,

#Fine Tune and token limits