Hello everyone, I am new to LLMs, although I have previous expertise in software programming. I'd like to learn how to train an LLM in a specific language (English, German, French).
- Should I utilize a pre-trained model (Falcon / Dolly) and fine-tune using the new language data set? Is there a way to block the other languages?
- Should I train a model from scratch? If so does this mean I need to create a model from scratch or can I use one that is open-source?