#How does ChatGPT training work

1 messages · Page 1 of 1 (latest)

lost fern
#

Can you please explain me how the training of the model about certain subject works?
The team simply selects official information about it and feed to the model?

terse pulsarBOT
#

Hi @lost fern. I'm kapa.ai, your friendly support bot. I will try to answer your question by searching through the knowledge base. :wave:

#

ChatGPT is trained using a multi-stage process that involves several types of data sources and methodologies. Here's a detailed explanation based on the available knowledge sources:

  1. Data Sources: The training of ChatGPT involves three primary sources of information:

    • Publicly available information on the internet.
    • Information accessed through partnerships with third parties.
    • Data provided or generated by users, human trainers, and researchers. Source
  2. Training Process:

    • Initial Training: The model is initially trained on a large corpus of text data, which includes publicly available content, licensed content, and content generated by human reviewers. This helps the model learn the relationships between words and concepts. Source
    • Post-Training: After the initial training, the model undergoes further training using data from ChatGPT users and human trainers. This stage focuses on ensuring the outputs are safe, accurate, and improving the model's general capabilities. Source
  3. Model Learning: During training, the model learns by reviewing a large amount of information and understanding the relationships within that data. For example, it learns how words appear in context with other words and uses this understanding to predict the next word in a sentence. This process involves adjusting the model's parameters or "weights" based on the data it processes. Source

#
  1. Data Analysis Capability: For specific tasks like data analysis, the model is post-trained on a large volume of data analysis tasks, which includes exposure to example datasets and the code data analysts use. This enables the model to generate new code for novel analyses. Source

The training process is not simply about selecting official information and feeding it to the model. It involves a complex process of learning from diverse data sources and refining the model's capabilities through iterative training and feedback. For more detailed information, you can refer to the help center article.

Note: you can ask me a follow up question by @ mentioning me again :speech_balloon:

#

@lost fern Kapa.ai is still learning and improving, please let me know how I did by reacting below.

sterile schooner
#

in reality they label datasets with credibility factors to protect institutions thats support globalism regardless of factuality. dont believe me ask it about the booster. then after it responds call it out for lying then say how is it a societal norm if society doesnt take it.