Hello i have a short question i take a look over the wiki and the Collabs and wanna try bring my Dataset to ChatML
i have 3 cols (instruction, input and output)
my code actually is
chatML_format = """
<|im_start|>user
{}
<|im_end|>
<|im_start|>assistant
{}
input: {}
"""
def chatml_format(examples):
instructions = examples["instruction"]
inputs = examples["input"]
outputs = examples["output"]
texts = []
for instruction, input, output in zip(instructions, inputs, outputs):
text = alpaca_prompt.format(instruction, input, output) + EOS_TOKEN
texts.append(text)
return {"text": texts, }
print("Load DataSet")
texts = dataset.map(chatml_format, batched=True)
but i have no idea how bring "input" into this has anyone an idea ?