#Struggling to fine-tune DaVinci for custom language output

16 messages · Page 1 of 1 (latest)

crystal sorrel
#

I'm trying to fine-tune DaVinci to output a custom language that I use for describing workflows. Here's an example input and desired output:

Input: "Make an apple pie. Include homemade dough. Low sugar. 8 persons"
Desired Output (simplified):
START "make a low sugar apple pie for 8 persons"; ACTOR1 "Cook"; STEP1 ACTOR1 "Gather ingredients"; <other commands> END;

I've created a fine-tuned model with 500 prompt/completion pairs, but its performance is worse than just using ChatGPT with prompt engineering (which still doesn't perform well enough for this task). All the models (raw DaVinci, fine-tuned DaVinci, and ChatGPT) can generate relevant workflow ideas based on the prompt, but they struggle with outputting the exact format I want.

Am I missing something, or is there a better approach to achieve my desired results?

brazen talon
crystal sorrel
#

do you know if there is some better way to achieve that kind of result ?

brazen talon
#

Which one would work best for you? Hard to say

crystal sorrel
#

I guess I have yet to try embeddings 😄

brazen talon
#

Could you describe your use-case a bit more?

crystal sorrel
#

I build a workflow engine for a BPM collaboration product https://lapala.io (kind of zapier like but which focuses more on human tasks). I want to use openai to generate workflows inside my tool from a user prompt, like in the example I provided in my question. I have found that gpt3.5 and 4 are very good to find good workflow ideas, with good logic etc. But I need the output to be structured so I can parse it into code and use them in my app.

La transformation numérique sur-mesure, simple et sans tout changer

#

I starting to think that I could just let GPT output the semi-valid outputs it manages to do, and find another way to correct them automatically

brazen talon
#

Yeah probably, GPT-4 is typically very good at following a template you tell it to follow

#

The way I personally do tasks like that is store the templates/instructions in embeddings and then call those as necessary

#

But that might be overkill

crystal sorrel
#

you mean you use embeddings as a "giant" prompt engineering ?

brazen talon
#

It's just a fancier way of doing prompt engineering