Text classification using Mixtral 8x7b | Mistral AI | Page 1

tawdry frost Apr 17, 2024, 5:53 PM

#

Hi all. I'm working on a thesis evaluator and I want to train Mixtral to predict whether a thesis deserves an A or not. My dataset consists of theses which were graded A and others which were not graded A. I would like to use binary classification to solve this problem, but I can't find any helpful online resources for using an LLM to achieve this. My current implementation is using RAG to feed the model the dataset and using the following prompt to retrieve the data.

'''Below is an instruction that describes a task, paired with an input that provides further context. Write response that appropriately completes the request.

Instruction: Label the thesis based on this question: "Does this thesis deserve a grade A or not?" Below are example labeled theses. Read each thesis mentioned first. Learn from the examples and think step by step before responding. Start your response by printing a "A/Not A" statement first as the label. Then explain why you chose that label.

Thesis: Name of thesis
Label: A
.
.
.
Thesis: Name of thesis
Label: Not A
.
.
.

Input:

Thesis: Name of thesis
Label:

Response: '''

But the output seems to be too biased towards giving As. Any advice on how to go about this? I also have access to a basic marking scheme for a thesis and was wondering if I could maybe add more to my current prompt to make use of this so the model can better evaluate it. Thoughts on this approach?

boreal mural Apr 17, 2024, 5:54 PM

#

I think Mistral has a repo with a classifier trained from Mistral7B hold on-

#

https://github.com/mistralai/mistral-src/blob/main/tutorials%2Fclassifier.ipynb

#

maybe this can help a bit

tawdry frost Apr 17, 2024, 6:08 PM

#

boreal mural https://github.com/mistralai/mistral-src/blob/main/tutorials%2Fclassifier.ipynb

Thank you that's actually very helpful

One question: in this notebook they're using a dataset consisting of sentences so they were able to label it using a csv file. In my case, I'm using theses so they're very long documents. Do you know of any other way I can label my dataset pls?

brave scaffold Apr 17, 2024, 7:19 PM

#

tawdry frost Thank you that's actually very helpful One question: in this notebook they're u...

Anything works, refer:https://huggingface.co/docs/datasets/package_reference/loading_methods

#Text classification using Mixtral 8x7b

Input:

Response: '''