I am training a model (Nemo) with an alpaca dataset. Idea is to force the model to answer me with a standard markdown answer.
I noticed an issue in interpreting the newline in the list part, while everywhere else the newline (always present as "\n" in my alpaca dataset) is working fine..
In other words the resulting new model lefts a "\n" in the list like :
Section one
- word1 \n
- word2\n
Section two
while he was trained on something like :
Section one\n* word1\n* word2\n\n# Section two
So, somehow, he understands the "\n" as newline, but left it also in the output (wrong).
Any idea ?
I tought about using <br>' instead of '\n' but not sure..

