vernal mica Jun 25, 2024, 2:23 PM

#

I finetuned a model with mistral small, and the status of the finetuning job was SUCCESS, but when i tried inference, it did this error : raise MistralAPIStatusException.from_response(response, message=str(e)) from e
mistralai.exceptions.MistralAPIStatusException: Status: 500. Message: {"object":"error","message":"Service unavailable.","type":"internal_server_error","param":null,"code":"1000"}

sonic marten Jun 25, 2024, 2:30 PM

#

vernal mica I finetuned a model with mistral small, and the status of the finetuning job was...

Hi, thanks for the feedback, could you provide your user ID as well as the fine tunned model id? Thank you!

vernal mica Jun 25, 2024, 2:32 PM

#

Hello, thaks for your quick response.
Here is the user id : 51c0211a-bbad-4cce-b894-a931a051f920
Here is the job id : 40fcf147-77ee-4e0d-bd54-bfc748943f4c

sonic marten Jun 25, 2024, 2:33 PM

#

and the model id?

vernal mica Jun 25, 2024, 2:34 PM

#

data=[Job(id='40fcf147-77ee-4e0d-bd54-bfc748943f4c', hyperparameters=TrainingParameters(training_steps=32, learning_rate=0.01), fine_tuned_model='ft:mistral-small-latest:eeb309b4:20240625:40fcf147', model='mistral-small-latest', status='SUCCESS', job_type='FT', created_at=1719322222, modified_at=1719322668, training_files=['fdb1c250-ee36-4f75-9f22-3a6af57cc41c'], validation_files=['41783589-5971-4d22-9907-fe04ff187f7b'], object='job', integrations=[])

sonic marten Jun 25, 2024, 2:35 PM

#

and when you try to use ft:mistral-small-latest:eeb309b4:20240625:40fcf147 you get internal server error correct?

vernal mica Jun 25, 2024, 2:36 PM

#

I did that : from mistralai.models.chat_completion import ChatMessage

from mistralai.client import MistralClient

client = MistralClient(api_key="mykey")

Retrieve a jobs

retrieved_job = client.jobs.retrieve('40fcf147-77ee-4e0d-bd54-bfc748943f4c')

chat_response = client.chat(
model=retrieved_job.fine_tuned_model,
messages=[ChatMessage(role='user', content="""

sonic marten Jun 25, 2024, 2:39 PM

#

vernal mica I did that : from mistralai.models.chat_completion import ChatMessage from mist...

could you just in case test it directly with

from mistralai.models.chat_completion import ChatMessage
from mistralai.client import MistralClient

client = MistralClient(api_key="mykey")

chat_response = client.chat(
    model="ft:mistral-small-latest:eeb309b4:20240625:40fcf147",
    messages=[ChatMessage(role='user', content="hi there")]
)

vernal mica Jun 25, 2024, 2:42 PM

#

I tried but i get the exact same error about 30 seconds after i run the code

sonic marten Jun 25, 2024, 2:49 PM

#

We will investigate ! Thanks for the report

vernal mica Jun 25, 2024, 2:54 PM

#

Ok thanks !

sonic marten Jun 25, 2024, 3:16 PM

#

vernal mica Ok thanks !

could you try by setting max_tokens=64 for example?

#

from mistralai.models.chat_completion import ChatMessage
from mistralai.client import MistralClient

client = MistralClient(api_key="mykey")

chat_response = client.chat(
    model="ft:mistral-small-latest:eeb309b4:20240625:40fcf147",
    messages=[ChatMessage(role='user', content="hi there")],
    max_tokens=64
)

vernal mica Jun 25, 2024, 3:21 PM

#

I get the same error

#

Here is the full error i get :

📎 message.txt

sonic marten Jun 25, 2024, 3:22 PM

#

we believe have found the issue, but max_tokens should at least not raise an error, thats odd... one minute!

#

could you try with this:

curl --location 'https://api.mistral.ai/v1/chat/completions' \
--header 'x-api-key: XXXX' \
--header 'Content-Type: application/json' \
--data '{
    "model": "ft:mistral-small-latest:eeb309b4:20240625:40fcf147",
    "messages": [
        {
            "role": "user",
            "content": "hello how are you"
        }
    ],
    "temperature": 0,
    "max_tokens": 10
}'

#

you can just run this curl directly

vernal mica Jun 25, 2024, 3:28 PM

#

yes it seems to work but i don't know why i get infinite chineese characters 😅 Should i add an end token in the dataset or it is automatic ?

sonic marten Jun 25, 2024, 3:29 PM

#

Perfect, let me explain what was happening

#

basically because you didnt set a max_tokens, it was waiting for the model to finish, but, the model was like you say only outputting chinese characters in an infinite loop- hence the internal server error

#

the issue is very likely your learning rate

#

0.01 is too high

#

dont go higher than 0.001

#

anyways, when testing the model always set a max tokens to be sure it doesnt loop

vernal mica Jun 25, 2024, 3:35 PM

#

Thx, you are a life saver. And so setting max tokens with mistralai client is not working ?

sonic marten Jun 25, 2024, 3:37 PM

#

vernal mica Thx, you are a life saver. And so setting max tokens with mistralai client is no...

it should be working, im surprised it didnt

#

would you mind reinstalling mistralai and try again with max_tokens to see if it still happens? might be an issue we are not aware of

#MistralAPIStatusException when testing a finetuned model

Retrieve a jobs