👋 I'm just looking to get this setup | Dagger | Page 1

nimble fable Feb 18, 2025, 1:40 PM

#

boreal pecan Feb 18, 2025, 3:21 PM

#

Hi! It's probably not user error.. I just pushed a change to how we handle LLM endpoint configuration, and probably broke something... sorry!

Can you confirm that the .env is in the current directory? At the moment it doesn't work if the .env is in a parent dir (even though it should)

#

Also: try setting these as actual env variables, rather than .env, and see if it resolves the issue?

nimble fable Feb 18, 2025, 3:26 PM

#

will do - do you have any advise on the format for the BASE_URL env var?

boreal pecan Feb 18, 2025, 3:28 PM

#

nimble fable will do - do you have any advise on the format for the BASE_URL env var?

it's passed directly to the official openai API client (their go sdk specifically) so whatever their official tools support

cc @kind oriole

nimble fable Feb 18, 2025, 3:28 PM

#

🤔 is there a way to trigger a one off call to the LLM?

❯ ~/bin/dagger-llm shell -c "llm | with-prompt 'hello' | history "
✔ connect 0.1s
✔ looking for module 0.2s
✔ loading type definitions 0.2s

✔ llm: Llm! 0.0s
✔ .withPrompt(prompt: "hello"): Llm! 0.0s
│🧑 hello
✔ .history: [String!]! 0.0s

🧑 💬hello

boreal pecan Feb 18, 2025, 3:28 PM

#

nimble fable 🤔 is there a way to trigger a one off call to the LLM? ``` ❯ ~/bin/dagger-llm...

yes, I made it excessively lazy, you have to explicitly call loop in between

nimble fable Feb 18, 2025, 3:29 PM

#

❯ ~/bin/dagger-llm shell -c "llm | with-prompt 'hello' | loop | history "
✔ connect 0.1s
✔ looking for module 0.2s
✔ loading type definitions 0.2s

✔ llm: Llm! 0.0s
✔ .withPrompt(prompt: "hello"): Llm! 0.0s
│🧑 hello
✔ .loop: Llm! 1.4s

I can't see the call to the LLM, or an error out

boreal pecan Feb 18, 2025, 3:29 PM

#

thought it looked cool, because you can see where the "agentic loop" takes place. But adds unnecessary verbosity so I will change it back

boreal pecan Feb 18, 2025, 3:30 PM

#

nimble fable ``` ❯ ~/bin/dagger-llm shell -c "llm | with-prompt 'hello' | loop | history " ✔ ...

@ivory moss I think this might be "shell doesn't print errors"

#

@nimble fable can you try running dagger shell then running the same dagger pipeline but interactively?

nimble fable Feb 18, 2025, 3:30 PM

#

ok - there is this in the trace now with the env var set

#

so that's picked up the correct vars, I think

#

ok - more of an error this time

#

❯ ~/bin/dagger-llm shell                                                                  [15:30:22]
Dagger interactive shell. Type ".help" for more information. Press Ctrl+D to exit.

✔ llm | model 0.0s
llama-3-70b(other)

✘ llm | with-prompt "can you say hello world back to me" | loop | history 1.3s
! input: llm.withPrompt.loop panic while resolving Llm.loop: runtime error: invalid memory address or nil pointer dereference

#

looking at https://github.com/shykes/dagger/commit/29d157845ab902d41e0edd42ff04aa77291a3eee

is it becuase I'm using an OpenAI compatiable endpoint, but with a llama model?

Maybe the API key isn't being picked up & sent in the request

#guess

#

this code

func (r *LlmRouter) isOpenAIModel(model string) bool {
    return strings.HasPrefix(model, "gpt-") || strings.HasPrefix(model, "openai/")
}

#

func (r *LlmRouter) isLlamaModel(model string) bool {
    return strings.HasPrefix(model, "llama-") || strings.HasPrefix(model, "meta/")
}

#

my env of Model = "llama-3-70b" will fall out to

func (r *LlmRouter) routeOtherModel() *LlmEndpoint {
    return &LlmEndpoint{
        BaseURL:  r.OPENAI_BASE_URL,
        Provider: Other,
    }
}

which now doesn't send the API key

#

☸ staging (civo-system) in melvin on  main [!] on ☁️  dinesh@civo.com
❯ env | grep OPENAI | grep -v KEY
OPENAI_BASE_URL=https://api.relax.ai
OPENAI_MODEL=openai/llama-3-70b
☸ staging (civo-system) in melvin on  main [!] on ☁️  dinesh@civo.com
❯ ~/bin/dagger-llm shell
Dagger interactive shell. Type ".help" for more information. Press Ctrl+D to exit.

✔ llm | model 0.0s
llama-3-70b(other)

#

I would expect the llm | model to output

lama-3-70b (openai)

boreal pecan Feb 18, 2025, 3:45 PM

#

yes that's the very new code (12 hours old) and clearly it's broken... will try to fix now

#

ah that's it, I didn't think I would need to send the API key. silly in retrospect

#

but that shouldn't cause a panic so there's probably something else

#

can you see the stack trace anywhere?

ivory moss Feb 18, 2025, 3:49 PM

#

boreal pecan <@108011715077091328> I think this might be "shell doesn't print errors"

noted - maybe has to do with -c, will check soon

fierce sail Feb 18, 2025, 3:53 PM

#

I noticed with the former code that I would get a "couldn't find .env file" error some percentage of the time, even though it was there, when running dagger shell and connecting to the llm dev engine.

nimble fable Feb 18, 2025, 3:56 PM

#

kind oriole Feb 18, 2025, 4:09 PM

#

fierce sail I noticed with the former code that I would get a "couldn't find .env file" erro...

that would previously come from not running llm before another pipeline using llm

fierce sail Feb 18, 2025, 4:12 PM

#

kind oriole that would previously come from not running `llm` before another pipeline using ...

ah, right, forgot that was a requirement with that setup. I'd been doing it unconciously but inconsistently

boreal pecan Feb 18, 2025, 4:14 PM

#

reminder guys we're in a support thread 😁

fierce sail Feb 18, 2025, 4:17 PM

#

nimble fable

Your error looks very familiar to me. I think I was getting that previously. Happy to jump on with you to troubleshoot in discord.

boreal pecan Feb 18, 2025, 4:26 PM

#

it's probably something silly that I introduced last night

nimble fable Feb 18, 2025, 5:02 PM

#

@fierce sail - I can hop on a call within the next hour

boreal pecan Feb 18, 2025, 5:07 PM

#

I'm pushing a first fix, to send the API key on your endpoint. That shouldn't be the cause for the panic, but it's needed either way

#

pushed

#

If you pull the latest version of melvin, and re-run dagger shell -c './dagger-llm | engine | up' (or re-run ./setup.sh) the fix should be applied

fierce sail Feb 18, 2025, 5:38 PM

#

@boreal pecan the v1 is getting stripped off the baseURL somehow when chat/completions is appended

kind oriole Feb 18, 2025, 5:40 PM

#

fierce sail <@488409085998530571> the `v1` is getting stripped off the `baseURL` somehow whe...

its working for me. I can try to help. Note if you change the OPENAI_BASE_URL (or any .env var) you have to restart the engine

nimble fable Feb 18, 2025, 5:40 PM

#

https://github.com/openai/openai-go/issues/134

GitHub

Dont trim url path postfix; BASE_URL option logic mismatch with pyt...

Now the client deletes the base url path. For example, if you use https://some-comparable-api.local/v1 as the base url, requests will be sent without /v1

fierce sail Feb 18, 2025, 5:41 PM

#

Going to add the trailing slash, restart engine, and try it

kind oriole Feb 18, 2025, 5:41 PM

#

Oh yep trailing slash is required

#

a quirk of the openai client

nimble fable Feb 18, 2025, 5:43 PM

#

restarted the engine, and this is now sending requests

A new shell only did not work

#

requests are now being sent to the LLM, I still get this error

fierce sail Feb 18, 2025, 5:46 PM

#

Anything else we could collect for diagnosis @kind oriole besides 👆

kind oriole Feb 18, 2025, 5:47 PM

#

This is on relax? Which model?

fierce sail Feb 18, 2025, 5:47 PM

#

If he drops loop the errors go away, but doesn't call the LLM

nimble fable Feb 18, 2025, 5:47 PM

#

llama 3, 70b

#

I've shared an API key for relax with Jeremy

#

https://tenor.com/view/money-money-money-gif-3641396535535975514

Tenor

fierce sail Feb 18, 2025, 5:50 PM

#

@kind oriole we can figure it out and follow up.

kind oriole Feb 18, 2025, 5:51 PM

#

the index out of range is from here https://github.com/shykes/dagger/blob/llm/core/llm.go#L582 just not sure why yet

#

Maybe because it's streaming? Not sure

#

fwiw I'm running vanilla ollama and not seeing this error

fierce sail Feb 18, 2025, 5:58 PM

#

kind oriole fwiw I'm running vanilla ollama and not seeing this error

They use this via tools for their web search, so they haven't seen this issue.

kind oriole Feb 18, 2025, 6:00 PM

#

fierce sail They use this via tools for their web search, so they haven't seen this issue.

sorry not sure what that means

boreal pecan Feb 18, 2025, 6:01 PM

#

kind oriole its working for me. I can try to help. Note if you change the `OPENAI_BASE_URL` ...

Working on fixing that right now, with the help of @sturdy python and @smoky valley 🙏

fierce sail Feb 18, 2025, 6:02 PM

#

kind oriole sorry not sure what that means

I was just asking if their model was verified as fully openai tools compatible and they use it via that interface in prod, so yes!

boreal pecan Feb 18, 2025, 6:13 PM

#

@nimble fable can you let me know when you have a chance to try again? I'm curious if my fix helps

nimble fable Feb 18, 2025, 6:27 PM

#

checking now

#

@boreal pecan, I can't see a recent change. The last one I have is

23cf032 (HEAD -> main, origin/main, origin/HEAD) update dagger dependency

this now get's a response from the LLM correctly (so passes the key in the request) but it still shows the stacktrace above

#

#1341401835818188800 message

boreal pecan Feb 18, 2025, 6:29 PM

#

nimble fable <@488409085998530571>, I can't see a recent change. The last one I have is * 23...

that's the one

#

Oh sorry I missed that update

#

I feel like I'm missing something obvious... Do you spot anything @kind oriole @ivory moss in that stack trace?

#

It seems related to streaming

#

Maybe it crashes if an endpoint doesn't support streaming, or handles it in a non-standard way?

#

This is in the stack trace:

    acc := new(openai.ChatCompletionAccumulator)
    for stream.Next() {
        if stream.Err() != nil {
            return nil, stream.Err()
        }

        res := stream.Current()
        acc.AddChunk(res)

                // 👇
        if content := res.Choices[0].Delta.Content; content != "" {
                // 👆
            if logsW == nil {
                // only show a message if we actually get a text response back
                // (as opposed to tool calls)
                ctx, span := Tracer(ctx).Start(ctx, "LLM response", telemetry.Reveal(), trace.WithAttributes(
                    attribute.String("dagger.io/ui.actor", "🤖"),
                    attribute.String("dagger.io/ui.message", "received"),
                ))
                defer telemetry.End(span, func() error { return rerr })

                stdio := telemetry.SpanStdio(ctx, "",
                    log.String("dagger.io/content.type", "text/markdown"))

                logsW = stdio.Stdout
            }

            fmt.Fprint(logsW, content)
        }
    }

ivory moss Feb 18, 2025, 6:32 PM

#

that's my guess yeah

kind oriole Feb 18, 2025, 6:32 PM

#

Yeah can you check res.Choices first before [0]?

ivory moss Feb 18, 2025, 6:32 PM

#

it's kind of weird, i'd expect stream.Err() != nil in that case

boreal pecan Feb 18, 2025, 6:44 PM

#

@kind oriole @ivory moss can I let you handle this one? I'm close to a fix for "bug 7" (clean llm config without having to re-run llm after setup & each engine restart)

kind oriole Feb 18, 2025, 6:45 PM

#

boreal pecan <@135620352201064448> <@108011715077091328> can I let you handle this one? I'm c...

I don't have a repro. @fierce sail were you able to repro with the relax key?

fierce sail Feb 18, 2025, 6:45 PM

#

kind oriole I don't have a repro. <@933501536624054272> were you able to repro with the rela...

just finished a meeting and spinning up repro now with latest llm branch

kind oriole Feb 18, 2025, 7:05 PM

#

Got the repro, thanks @fierce sail , working on a fix

kind oriole Feb 18, 2025, 7:24 PM

#

Pushed a fix. My repro is working now!

#👋 I'm just looking to get this setup