The prompt format is the same for llama3.0 and llama3.1 right?
Prompt:
self.prompt = """<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n"{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>\n{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>{output}<|eot_id|><|end_of_text|>"""
Because I get very strange outputs when run my fine tuned llama-3.1 model in interference on the same data set as I trained the llama-3.0 version on (which worked well). It produces random tokens for some reason. Did anybody also encounter this problem, thanks!
Output:
cleaned response: {
"Document Type": "Health Certificate"}<|reserved_special_token_226|>.**
<|reserved_special_token_33|>assistant<|reserved_special_token_48|>{
"Document Type": "Health Certificate"}<|reserved_special_token_231|><|reserved_special_token_147|>assistant<|reserved_special_token_40|>{
"Document Type": "Health Certificate"}<|reserved_special_token_212|>assistant<|reserved_special_token_143|>{
Where the previous output was:
{
"Document Type": "Health Certificate"
}