#agents | Dagger | Page 3

spring wave Mar 14, 2025, 6:52 PM

#

@shrewd ermine ok try now

shrewd ermine Mar 14, 2025, 6:56 PM

#

spring wave <@135620352201064448> ok try now

fixed, ty!

#

building now

#

ok i ran this one that worked on llm.8 and its the same response on llm latest

worn hill Mar 14, 2025, 7:06 PM

#

whoever made this was a strange person

#

incredible dedication to bear grills

storm gate Mar 14, 2025, 8:40 PM

#

It's now possible to use the dockerfile-optimizer standalone without the Github PR flow (with dagger-llm.8)

$ dagger shell -m github.com/samalba/agents/dockerfile-optimizer
⋈ src=$(git https://github.com/samalba/demo-app | head | tree)
⋈ optimize-dockerfile-from-directory $src | file Dockerfile | contents

Thanks for the suggestion @smoky ocean

spring wave Mar 14, 2025, 9:43 PM

#

dammit

quiet ether Mar 14, 2025, 9:44 PM

#

spring wave dammit

and this is how the apocalypse day began

spring wave Mar 14, 2025, 9:45 PM

#

context: adding a hint to the model when its context changes, trying to avoid this:

✔ container | from alpine | .llm 1.7s
│🤖 Thank you for letting me know. How can I assist you with the current container context?
│ ┃ 1.4s ◆ Input Tokens: 6,971 ◆ Output Tokens: 20
LLM@xxh3:7bf41186fcf1a6ee

spring wave Mar 14, 2025, 10:01 PM

#

instead of replying it ran a tool to reply -_-

#

@smoky ocean you can tag llm.9 whenever, brought back withPromptVar and made multi-object lazily enabled (when you set a var)

worn hill Mar 14, 2025, 10:16 PM

#

spring wave instead of replying it ran a tool to reply -_-

this is the exact thing that makes avante.nvim+claude completely useless lmao

#

it really really really wants to use tools to the point it just never responds

smoky ocean Mar 14, 2025, 10:18 PM

#

0.17.0-llm.9 🧵

smoky ocean Mar 14, 2025, 11:11 PM

#

🚨 🚨 🚨 new release: 0.17.0-llm.9. Now with revolutionary multi-object support 🙂 The design is not fully baked so we also left single-object, to avoid breaking existing modules and scripts.

#

@spring wave Ctrl-L doesn't work for me on that version

spring wave Mar 14, 2025, 11:22 PM

#

smoky ocean <@108011715077091328> `Ctrl-L` doesn't work for me on that version

try pressing it again?

#

(unironically - there's a sleep(100ms) in there to time the clearing for when the scrollback is flushed, quite unfortunate)

smoky ocean Mar 14, 2025, 11:34 PM

#

worked in ghostty. not in zed

smoky ocean Mar 15, 2025, 12:00 AM

#

@spring wave @shrewd ermine FYI I think llm.9 does break all modules, because of capitalization changes

spring wave Mar 15, 2025, 12:00 AM

#

smoky ocean worked in ghostty. not in zed

does zed also not work after pressing it again?

smoky ocean Mar 15, 2025, 12:01 AM

#

spring wave does zed also not work after pressing it again?

correct

spring wave Mar 15, 2025, 12:01 AM

#

Zed might be eating the Ctrl+L keybind entirely, I know Cursor defaults to using it for chat stuff

#

or does it work outside of dagger

smoky ocean Mar 15, 2025, 12:03 AM

#

OK in zed after re-starting the shell, it works on second try. can't repro the issue in zed. so I guess we're good 🙂

spring wave Mar 15, 2025, 12:04 AM

#

also, tiny thing: I made it so submitting an empty shell input starts and immediately cancels a span, instead of doing nothing, since that's a muscle memory (to add spacing). lemme know if it's weird

smoky ocean Mar 15, 2025, 12:05 AM

#

smoky ocean <@108011715077091328> <@135620352201064448> FYI I think llm.9 *does* break all m...

Actually I'm trying to figure out if it does break modules or not...

#

dagger -m github.com/samalba/agents/dockerfile-optimizer: ./main.go:66:76: undefined: dagger.Llm

#

dagger -m github.com/shykes/toy-programmer -> loads ok 🤷‍♂️

spring wave Mar 15, 2025, 12:06 AM

#

ah, the type was renamed but the constructor is still dag.Llm - but I think that's a TODO

shrewd ermine Mar 15, 2025, 12:06 AM

#

Nice!

smoky ocean Mar 15, 2025, 12:07 AM

#

Oh I think it's a typo in the first module

#

OK I understand now - not a typo, Sam's module just exposes the LLM type in its API, which is relatively rare - but llm.9 does break that

#

Actually the problem is not actually exposing it's just referencing the type anywhere in his code

#

(in his case it's an internal function)

#

this is tricky because if I open a PR to his module, it will break for pre-llm.9 users

#

which as of right now is 100% of our small testing pool 🙂

#

hey I reference that type also in my melvin module...

#

yup it's broken also 😭

spring wave Mar 15, 2025, 12:16 AM

#

bummer. ah well, good to get these out of the way sooner than later

smoky ocean Mar 15, 2025, 12:17 AM

#

Yeah basically it means we have to be aggressive in getting people to update engine and modules

#

or, slow-roll on both

#

@storm gate FYI https://github.com/samalba/agents/pull/2

GitHub

Update to dagger 0.17.0-llm.9 by shykes · Pull Request #2 · samalba...

The dagger llm pre-release of Dagger has a breaking change: the "Llm" type has been renamed to "LLM".

storm gate Mar 15, 2025, 12:33 AM

#

smoky ocean <@707661676056674346> FYI https://github.com/samalba/agents/pull/2

You did not add unit tests

spring wave Mar 15, 2025, 12:35 AM

#

tried out having it start with Query, pretty fun example: https://v3.dagger.cloud/dagger/traces/4cf133e60e6ba9d4bd29ed41690ca12b

Dagger Cloud

Browse and visualize Dagger traces.

#

complete with the classic running prompt as shell first

#

i think it would have figured things out easier _objects included all objects, not just variables, with the name of the call that created it (like Container@xxh3:abcdef.from(args...)). not sure if worth it, maybe there's another way

smoky ocean Mar 15, 2025, 12:43 AM

#

spring wave i think it would have figured things out easier `_objects` included all objects,...

all objects as in every ID received by the llm?

spring wave Mar 15, 2025, 12:43 AM

#

yea

#

it would be nicer if it would have just recognized it from the tool output though

smoky ocean Mar 15, 2025, 12:44 AM

#

I still think reusing the trick from single-object, with auto-saving each variable, would help overall a lot

spring wave Mar 15, 2025, 12:44 AM

#

feels like it just needs a small hint

spring wave Mar 15, 2025, 12:44 AM

#

smoky ocean I still think reusing the trick from single-object, with auto-saving each variab...

i think that would make this harder, because the variable would be irreversibly reassigned even after making a "mistake" (installing nano instead of vim)

#

but this is a very specific scenario, not sure if it'd be common

smoky ocean Mar 15, 2025, 12:45 AM

#

we can always give it "rewind KEY" and "history KEY" stuff like that

spring wave Mar 15, 2025, 12:45 AM

#

ah true

#

that feels like basically what it's trying to do now, though

#

just instead of keys it's object ids

storm gate Mar 15, 2025, 1:01 AM

#

smoky ocean <@707661676056674346> FYI https://github.com/samalba/agents/pull/2

life hack: write a dagger agent that monitors the latest dagger release and opens a PR against your module. It can make sure your module builds and watch for any breaking change in the changelog, and updates the code as needed.
And we could even make it a github app available to all...

shrewd ermine Mar 15, 2025, 2:06 AM

#

Ideally compat mode saves us from relying on something like that, but yes I'd use it 😄

shrewd ermine Mar 15, 2025, 2:11 AM

#

smoky ocean we can always give it "rewind KEY" and "history KEY" stuff like that

yeah I gave my workspace a reset() func, it makes sense to have that ability for objects in general. Sometimes models get a little carried away with their early attempts and destroy their workspace https://github.com/kpenfound/greetings-api/blob/main/.dagger/workspace/main.go#L60

spring wave Mar 15, 2025, 2:57 AM

#

why do we configure a system prompt for anthropic and gemini but not openai? thinkspin

shrewd ermine Mar 15, 2025, 3:02 AM

#

spring wave why do we configure a system prompt for anthropic and gemini but not openai? <a:...

I think @wraith remnant asked about that when he implemented anthropic. I haven't looked at the openai client much but in anthropic and Gemini the system prompt is a special thing

#

Special as in we might want to make it configurable or something more specific

spring wave Mar 15, 2025, 3:04 AM

#

maybe it nudges it to use tools?

#

i think there's a withSystemPrompt already

shrewd ermine Mar 15, 2025, 3:07 AM

#

I don't know if thats wired up? I thought for Gemini and anthropic it was passed to the constructor

#

oh yeah duh it adds a message as system. I don't know what happens if you do that to gemini. Let's see

#

oh wait no I had it the first time, it's not wired up ! no function "with-system-prompt" in type "LLM"

shrewd ermine Mar 15, 2025, 3:29 AM

#

spring wave maybe it nudges it to use tools?

this would be great tbh

river belfry Mar 17, 2025, 4:23 PM

#

I was looking at make the llm part working with llama.cpp Did anyone already tried?
My understanding is while both tools and streaming are supported, they are not supporting at the same time.
So I started to change a bit the code to handle that, but it's not calling the tools anymore. For instance I have this message in the output "You can write this code in the workspace and then build it by calling ToyWorkspace_build.".
I'll have a deeper look, but if anyone has some ideas 🙂

smoky ocean Mar 17, 2025, 4:39 PM

#

@spring wave want to bikeshed multi-object design later?

#

I could setup another eval maybe

river belfry Mar 17, 2025, 4:44 PM

#

river belfry I was looking at make the llm part working with `llama.cpp` Did anyone already t...

Answering myself, but https://github.com/ggml-org/llama.cpp/pull/12379 looks like to make it work. Sort of. At least I don't have anymore the server error about streams and tools. That's a first step in the right direction

smoky ocean Mar 17, 2025, 4:51 PM

#

river belfry Answering myself, but https://github.com/ggml-org/llama.cpp/pull/12379 looks lik...

cc @lilac crystal 😛

spring wave Mar 17, 2025, 4:51 PM

#

multi-object prompt/metaphor engineering

lilac crystal Mar 17, 2025, 4:51 PM

#

smoky ocean cc <@271799954866044928> 😛

🫡

river belfry Mar 17, 2025, 4:59 PM

#

I wonder if it isn't related, but with ollama and llama3.2 on my machine, I have the same behavior.
I'm running the version llm.9 and if I'm trying toy-programmer with the classical go-program "develop a curl clone" | terminal it does nothing.
By nothing I mean

✔ go-program "develop a curl clone" | terminal 4.7s
│🧑 You are an expert go programmer. You have access to a workspace
│ ┃ 0.0s
│
│🧑 Complete the assignment written at assignment.txt
│ ┃ 0.0s
│
│🧑 Don't stop until the code builds
│ ┃ 0.0s
Container@xxh3:f699019bc6b5b1b3

And nothing more.
So this looks like the behaviour I have with llama.cpp
(If useful https://dagger.cloud/eunomie/traces/9fbb1d84e75179ccf9687c71b06a9a2d)

merry scarab Mar 17, 2025, 5:45 PM

#

If I am getting an error like this

│🤖 0.8s
│ ! POST "https://api.anthropic.com/v1/messages": 400 Bad Request
│ ! {"type":"error","error":{"type":"invalid_request_error","message":"prompt is too long:
│ ! 211147 tokens > 200000 maximum"}}
! input: llm.withContainer.withPrompt.id select: POST
! "https://api.anthropic.com/v1/messages": 400 Bad Request
! {"type":"error","error":{"type":"invalid_request_error","message":"prompt is too long:
! 211147 tokens > 200000 maximum"}}

Is there any easy way for me to see what the prompt was in dagger?

shrewd ermine Mar 17, 2025, 6:20 PM

#

river belfry I wonder if it isn't related, but with `ollama` and `llama3.2` on my machine, I ...

llama3.2 is super small and might need a more helpful prompt compared to bigger models. Compare it to llama3.1 and qwen2.5-coder:7b that will probably at least try to use it's tools automatically. With llama3.2 maybe something like "you have tools to access a workspace where you can read, write, and build code. develop a curl clone". That might be enough

spring wave Mar 17, 2025, 6:22 PM

#

river belfry I wonder if it isn't related, but with `ollama` and `llama3.2` on my machine, I ...

ah sorry this regressed - should be fixed on head of llm

shrewd ermine Mar 17, 2025, 6:22 PM

#

oh i missed it, what was the regression on llm.9?

spring wave Mar 17, 2025, 6:32 PM

#

shrewd ermine oh i missed it, what was the regression on llm.9?

the old-style getters (Llm.workspace) weren't calling sync internally, so the LLM never actually ran

#

v0.17.0-llm.10

smoky ocean Mar 17, 2025, 7:10 PM

#

To everyone using Dagger's agent features: what do you think of the new website? Do you recognize the reasons you personally are excited about Dagger? https://dagger.io

Dagger.io

Dagger is an open-source runtime for composable workflows. It's perfect for systems with many moving parts and a strong need for repeatability, modularity, observability and cross-platform support.

abstract iron Mar 17, 2025, 8:02 PM

#

smoky ocean To everyone using Dagger's agent features: what do you think of the new website?...

What are the use cases of using llms through dagger ?

subtle surge Mar 17, 2025, 8:08 PM

#

abstract iron What are the use cases of using llms through dagger ?

The "Dagger in Action" section lists examples with links to the README. Are you looking for something else?

abstract iron Mar 17, 2025, 8:09 PM

#

subtle surge The "Dagger in Action" section lists examples with links to the README. Are you ...

Mhm do you think it would be possible to have an llm that judge commits on the repository of a project ?

#

I've seen a company selling a product like that but I guess it can be made using dagger aswell

subtle surge Mar 17, 2025, 8:15 PM

#

abstract iron Mhm do you think it would be possible to have an llm that judge commits on the r...

Yeah, I could see that as an expansion of @shrewd ermine 's AI Agents in CI demo - #1349235356184350770 message

abstract iron Mar 17, 2025, 8:16 PM

#

subtle surge Yeah, I could see that as an expansion of <@135620352201064448> 's AI Agents in ...

Ok thanks i'll check

shrewd ermine Mar 17, 2025, 8:16 PM

#

Also check out the examples here: https://docs.dagger.io/ai-agents#examples

smoky ocean Mar 17, 2025, 8:33 PM

#

abstract iron Mhm do you think it would be possible to have an llm that judge commits on the r...

yes seems very doable

shrewd ermine Mar 17, 2025, 11:17 PM

#

@spring wave trying the . | .llm thing on llm.10. What's the flow look like? I tried

. | .llm
> are my tests passing?

spring wave Mar 17, 2025, 11:17 PM

#

shrewd ermine <@108011715077091328> trying the `. | .llm` thing on llm.10. What's the flow loo...

what did it do? seems like it should work

shrewd ermine Mar 17, 2025, 11:20 PM

#

hmm i think it got confused about arguments. that makes sense I guess because I should give it variables for those

#

yup now we're good

spring wave Mar 17, 2025, 11:21 PM

#

if your module constructor takes args you'll need to pass them to .

#

was that it?

#

so like . arg1 arg2 | .llm

shrewd ermine Mar 17, 2025, 11:21 PM

#

I'm using the module from the quickstart and I forgot it doesn't use context directories. So I made a source=$(directory | with-directory / .) and it figured it out

#

multi object 🚀

spring wave Mar 17, 2025, 11:22 PM

#

ah ok nice

woeful quiver Mar 17, 2025, 11:55 PM

#

That new prompt for external access to modules is 🔥

smoky ocean Mar 18, 2025, 7:12 PM

#

@spring wave quick feedback from ongoing mini-workshop: multi-object without auto-save requires custom prompting every time (at least on gpt-4o): "save to the same variable when you're done"

subtle surge Mar 18, 2025, 7:16 PM

#

Hi new and old Daggernauts!

If you’re new here, we host a Dagger Community Call every other week to showcase what the community is building. You can check out past calls here: https://www.youtube.com/@dagger-io/streams.

Are you working on a Dagger Agent project? We’d love to highlight your work in an upcoming call...and yes, there will be Dagger swag involved! 😃

Your project doesn’t need to be finished. We love seeing work in progress and half-baked ideas.

If you’re interested, DM me and I’ll be happy to add you to the agenda or answer any questions. Looking forward to seeing what you’re building!

smoky ocean Mar 18, 2025, 7:25 PM

#

Workshop feedback 🧵

merry scarab Mar 18, 2025, 7:40 PM

#

Im stuck in a doom loop

│🤖 0.2s
│ ! POST "https://api.anthropic.com/v1/messages": 400 Bad Request
│ ! {"type":"error","error":{"type":"invalid_request_error","message":"messages.7:
│ ! `tool_use` ids were found without `tool_result` blocks immediately after:
│ ! toolu_016BtYBAoQfwMzn4P7CeBD2p. Each `tool_use` block must have a corresponding
│ ! `tool_result` block in the next message."}}
! input: llm.withPrompt.loop.withPrompt.loop.withPrompt.loop.setContainer.withPrompt.sync
! select: POST "https://api.anthropic.com/v1/messages": 400 Bad Request
! {"type":"error","error":{"type":"invalid_request_error","message":"messages.7: `tool_use`
! ids were found without `tool_result` blocks immediately after:
! toolu_016BtYBAoQfwMzn4P7CeBD2p. Each `tool_use` block must have a corresponding
! `tool_result` block in the next message."}}

Anyone seen this before?

merry scarab Mar 18, 2025, 7:40 PM

#

merry scarab Im stuck in a doom loop ``` │🤖 0.2s │ ! POST "https://api.anthropic.com/v1/me...

Just trying to have the llm use my container in vibe mode

spring wave Mar 18, 2025, 7:41 PM

#

@merry scarab that's fixed on llm tip

merry scarab Mar 18, 2025, 7:42 PM

#

spring wave <@920499459484418068> that's fixed on `llm` tip

Can you ELI5 how to get on that, I have been using this handy dandy command 🙂

curl -fsSL https://dl.dagger.io/dagger/install.sh | DAGGER_VERSION=0.17.0-llm.9 BIN_DIR=/usr/local/bin sh

spring wave Mar 18, 2025, 7:44 PM

#

v0.17.0-llm.11

merry scarab Mar 18, 2025, 8:45 PM

#

am i being dumb?

I expect this prompt to use the container I give it, but instead it tries to use ubuntu

● llm | with-container $(container | from alpine) | with-prompt "you have access to a container, us
│🧑 you have access to a container, use it to install chromium
│ ┃ 0.0s
│
│🤖 I'll help you install Chromium using a container. I'll use an Ubuntu base image and install
│ ┃ Chromium using apt.
│ ┃ 2.9s ◆ Input Tokens: 11,846 ◆ Output Tokens: 83
│
│ ✔ Container.from(address: "ubuntu:latest"): Container! 1.1s
│
│ ✔ remotes.docker.resolver.HTTPRequest 0.1s
│ ✔ remotes.docker.resolver.HTTPRequest 7.0s

shrewd ermine Mar 18, 2025, 8:47 PM

#

merry scarab am i being dumb? I expect this prompt to use the container I give it, but inste...

yeah, that'll happen 😂 In the future we'll be able to mask an object's functions so you could actually restrict it from using from, but for now you can add more prompting to say "dont use from. only use the withExec tool"

merry scarab Mar 18, 2025, 8:47 PM

#

shrewd ermine yeah, that'll happen 😂 In the future we'll be able to mask an object's function...

Thanks!

merry scarab Mar 18, 2025, 9:15 PM

#

shrewd ermine yeah, that'll happen 😂 In the future we'll be able to mask an object's function...

Hm sorry something still feels wrong, it tells me i didnt give it a container at atll. Feels like something silly

⋈ llm | with-container $ctr | with-prompt "i gave you a container, does it have a browser in

And then its like "sorry i dont have a container"

storm gate Mar 18, 2025, 9:54 PM

#

Question, since -llm.{10,11}, the cli fails to read my api keys from the env, it now fails reading from (an non-existing) .env file. Did anything change or do we have a regression?

shrewd ermine Mar 18, 2025, 10:15 PM

#

storm gate Question, since `-llm.{10,11}`, the cli fails to read my api keys from the env, ...

it currently still shows a bunch of errors for the stuff it doesn't find, but should still find the vars that are set. What env vars are you setting?

storm gate Mar 18, 2025, 10:16 PM

#

shrewd ermine it currently still shows a bunch of errors for the stuff it doesn't find, but sh...

The usual keys, and I get an empty answer from the llm

shrewd ermine Mar 18, 2025, 10:17 PM

#

oh but it does answer? 🤔 not like a 403 or something? What provider are you hitting? I'm mainly using ollama and gemini

spring wave Mar 18, 2025, 10:48 PM

#

shrewd ermine it currently still shows a bunch of errors for the stuff it doesn't find, but sh...

Showing those errors should be fixed, lmk if you're still seeing it

storm gate Mar 18, 2025, 10:55 PM

#

spring wave Showing those errors should be fixed, lmk if you're still seeing it

The empty response was my own mistake, but I can see these errors, only when I introspect the span from the LLM router config:

#

It's not bothering at all, they stay collapsed by default and the overall flow works without errors.

spring wave Mar 18, 2025, 10:57 PM

#

oh, yeah the collapsing was the solution 😛 - it should only expand if they all fail

#

would be nice to avoid them in the first place for sure

shrewd ermine Mar 18, 2025, 11:01 PM

#

spring wave Showing those errors should be fixed, lmk if you're still seeing it

yep you're right, it's just burned into my eyes lol

spring ocean Mar 19, 2025, 12:12 AM

#

I'm from the hack day @smoky ocean's. i was playing with the agent and Gemini and I was struggling to have it find tools:
I've attached the output from my interaction with the model. I have been asked to ping @spring wave

📎 message.txt

merry scarab Mar 19, 2025, 1:22 AM

#

Weird bug - prompt is completely missing here but its doing the right thing 🙂 https://v3.dagger.cloud/levlaz/traces/a8a6e230396e3a873190d50b9b896582?listen=7fbd4e97db816dab&listen=929acc8e9ca3db93&showHidden=52a5eb8074ffedd9#a340488547251a0c

Dagger Cloud

Browse and visualize Dagger traces.

smoky ocean Mar 19, 2025, 2:52 AM

#

@shrewd ermine I have function masks almost working 🙂

shrewd ermine Mar 19, 2025, 2:53 AM

#

https://tenor.com/view/come-here-come-on-sad-im-here-for-you-come-gif-17176157

Tenor

smoky ocean Mar 19, 2025, 2:53 AM

#

pushing the branch

shrewd ermine Mar 19, 2025, 2:54 AM

#

what's the UX? I pass a list of function names as opts?

smoky ocean Mar 19, 2025, 2:54 AM

#

https://github.com/dagger/dagger/pull/9899

GitHub

llm: function mask by shykes · Pull Request #9899 · dagger/dagger

smoky ocean Mar 19, 2025, 2:55 AM

#

shrewd ermine what's the UX? I pass a list of function names as opts?

llm | with-container $foo --function-mask=withExec,rootfs,directory
llm | set-container foo $bar --function-mask=withExec,rootfs,directory

(applies to all types, not just container)

shrewd ermine Mar 19, 2025, 2:56 AM

#

very cool. Building it now

#

still building 😅 I need @spring wave 's pc

smoky ocean Mar 19, 2025, 3:07 AM

#

https://tenor.com/view/the-simpsons-homer-simpson-suspicious-eating-watching-clock-gif-19258678

Tenor

shrewd ermine Mar 19, 2025, 3:15 AM

#

✘ llm | with-container $ctr --function-mask=withExec,rootfs,directory 0.0s
! input: llm.withContainer index 0 out of bounds
│ ✘ LLM.withContainer(
│ │ │ functionMask: ["withExec", "rootfs", "directory"]
│ │ │ value: ✔ Container.withWorkdir(path: "/app"): Container! 0.0s
│ │ ): LLM! 0.0s
│ ! index 0 out of bounds
! input: llm.withContainer index 0 out of bounds

smoky ocean Mar 19, 2025, 3:16 AM

#

😦

#

I wasn't able to test it, ran into unrelated LLM hang issues

#

probably something stupid. will look into it

shrewd ermine Mar 19, 2025, 3:17 AM

#

ah got it, wasn't sure if i was holding it wrong

smoky ocean Mar 19, 2025, 3:19 AM

#

@shrewd ermine is there a stack trace in the engine?

shrewd ermine Mar 19, 2025, 3:19 AM

#

guessing something's up here https://github.com/dagger/dagger/pull/9899/files#diff-73f0094219b4f5510b335d7f8ca08d679fb0c4b2014c3dbdcaafe6d84bd3f38fR873

GitHub

llm: function mask by shykes · Pull Request #9899 · dagger/dagger

shrewd ermine Mar 19, 2025, 3:57 AM

#

smoky ocean <@135620352201064448> is there a stack trace in the engine?

no but I added some debugging. elmts.Len() is 3, i is 0, and it's returning index out of bounds 🤔

#

and that's coming from here https://github.com/dagger/dagger/pull/9899/files#diff-73f0094219b4f5510b335d7f8ca08d679fb0c4b2014c3dbdcaafe6d84bd3f38fR946

smoky ocean Mar 19, 2025, 4:08 AM

#

ah...

#

I smell something stupid

shrewd ermine Mar 19, 2025, 4:09 AM

#

haha yeah...

steep onyx Mar 19, 2025, 4:09 AM

#

dagql arrays are 1-indexed, you probably want i+1

#

In the call to Nth

shrewd ermine Mar 19, 2025, 4:10 AM

#

that would do it! I'm sure there's a fun story behind that

#

trying that

#

confirmed that fixes it 🙏

next speedbump

✘ llm | with-container $(container | from golang:latest) --function-mask=withExec,rootfs,directory | with-prompt "write a curl clone" | container | terminal 0.0s
! input: llm.withContainer.withPrompt.container instantiate: cannot instantiate dagql.Class[*github.com/dagger/dagger/core.Container] with core.maskedValue

spring wave Mar 19, 2025, 4:43 AM

#

shrewd ermine that would do it! I'm sure there's a fun story behind that

someday...

#

it started out as a "i don't want to deal with *int everywhere", how did it end up like this...

smoky ocean Mar 19, 2025, 4:48 AM

#

shrewd ermine confirmed that fixes it 🙏 next speedbump ``` ✘ llm | with-container $(contain...

this one might be more serious..

#

I think my "clever" approach to add masking with minimal changes, might be a little too naive

#

will need to add a little more substance to it tomorrow

#

(tldr I wrap the actual value, of interface type dagql.Typed , with a simple wrapper type that keeps the original value embedded, and adds just the mask field:

type maskedValue struct {
 dagql.Typed
 mask []string
}

I was banking on the fact that my maskedValue still implements the Typed interface, with the original value - the perfect passthrough.

Except not at all, because callers try to cast it back to the original type, and I can't pass-through typecasts (I guess)

#

I think I'll need to define a new interface, and use that instead of dagql.Typed across the whole llm.Withxxx call chain

shrewd ermine Mar 19, 2025, 4:54 AM

#

Yeah makes sense!

smoky ocean Mar 19, 2025, 5:03 AM

#

@spring wave @shrewd ermine multi-object DX bikeshed. How do you feel about:

LLM.bindContainer("foo", ...)
LLM.bindings()
LLM.binding("foo").container()

LLM.bindToyWorkspace("bar", ...)
LLM.bindings()
LLM.binding("bar").toyWorkspace()

LLM.bindString("baz", ...)
LLM.bindings()
LLM.binding("baz").string()

Benefits:

Consistency around the word "binding". Everything related to bindings has the common root "bind".
bindings() allows for listing existing bindings -> that's a gap currently
binding() groups all the getters. So that cuts the volume in half right there
binding() and bindings() will always be listed immediately after bindFoo because of the uppercase/lowercase sorting

shrewd ermine Mar 19, 2025, 5:06 AM

#

Works for me! Would the value from .binding() have a GetType or something too?

smoky ocean Mar 19, 2025, 5:06 AM

#

well it has type()

#

could be getType() or asType()

#

went for the shortest as a baseline

shrewd ermine Mar 19, 2025, 5:07 AM

#

Yeah makes sense, was just trying to think about using values from bindings()

smoky ocean Mar 19, 2025, 5:07 AM

#

alternative: replace LLM.bindFoo with LLM.withFooBinding

smoky ocean Mar 19, 2025, 5:08 AM

#

shrewd ermine Yeah makes sense, was just trying to think about using values from bindings()

Oh bindings plural

shrewd ermine Mar 19, 2025, 5:09 AM

#

smoky ocean Oh `bindings` plural

No you had it right! Sorry replying from my phone... So I'd get keys from bindings() and check their type from binding(key)

smoky ocean Mar 19, 2025, 5:12 AM

#

shrewd ermine No you had it right! Sorry replying from my phone... So I'd get keys from bindin...

Yeah, but we can also add any metadata in the result of bindings(), if we want to introspect their type, or current id etc.

#

In practice callers are supposed to know the type they want

shrewd ermine Mar 19, 2025, 5:13 AM

#

Yeah I was trying to think of the case where the LLM was able to save to a new variable (if that's going to be allowed) and how to safely find that

#

And withFooBinding sounds good too

#

I do like the parallels to container.withServiceBinding

smoky ocean Mar 19, 2025, 5:16 AM

#

ctr, err := dag.LLM().WithPrompt("please save the container to $foo. don't mess up please").Binding("foo").Container()
if err != nil {
 panic("you had one job")
}

proper stratus Mar 19, 2025, 5:21 AM

#

Where I can find examples of using multi-object? Wanna try that out

smoky ocean Mar 19, 2025, 5:23 AM

#

@proper stratus docs update coming very soon!

#

@proper stratus in the meantime, you can try it straight from the shell:

Start dagger shell

$ dagger

Make sure it's v0.17.0-llm.11 (released today)

Set a few variables in the shell

ctr=$(container | from alpine | with-new-file hi.txt "Hi Bob")
dagger_repo=$(git https://github.com/dagger/dagger)

Switch to "prompt mode"

Start prompting

I gave you a container and a git repository. First, open the file hi.txt in the container, and tell me its contents. Then, fetch the last stable release of the git repo, get the subdirectory docs/, and copy them into the container I gave you. Save the result to new_container

Switch back to shell mode

Check that the new container was created

$new_container | terminal

#

(it's simpler that it seems in written form)

spring wave Mar 19, 2025, 5:29 AM

#

side note: been considering tab to swap between prompt/shell, assuming the input is empty (have to compete w/ tab completion)

smoky ocean Mar 19, 2025, 5:30 AM

#

@proper stratus in code, you can use LLM.Set<Foo>() and LLM.Get<Foo>() where <Foo> is the binding type. If you're familiar with the single-object API, it's the same, except you need to specify a key

smoky ocean Mar 19, 2025, 5:30 AM

#

spring wave side note: been considering `tab` to swap between prompt/shell, assuming the inp...

I vote Ctrl-/ 🙂

spring wave Mar 19, 2025, 5:30 AM

#

is that from something?

smoky ocean Mar 19, 2025, 5:30 AM

#

or Ctrl-<something>

smoky ocean Mar 19, 2025, 5:30 AM

#

spring wave is that from something?

no, just a throwback to the slash commands

#

conveniently placed. prime location

#

in an up-and-coming neighborhood

spring wave Mar 19, 2025, 5:31 AM

#

lol

proper stratus Mar 19, 2025, 5:42 AM

#

If I run the module in Dagger shell, does the agent know the TUI output? I want to try give the agent that output so it can help me improve my Dagger module performance.

smoky ocean Mar 19, 2025, 5:47 AM

#

proper stratus If I run the module in Dagger shell, does the agent know the TUI output? I want ...

Ah. No, the agent can only see the output of functions that it calls itself. One thing you can do, is give it a container with the dagger CLI installed (with dagger-in-dagger nesting enabled) then have it run dagger CLI commands in there.

#

I believe @merry scarab was working on something very similar just today

#

also, @spring wave is working on allowing agents to access your current module's dependencies. That would allow you to install the modules of your choice, then have the agent call them directly

proper stratus Mar 19, 2025, 5:55 AM

#

So currently if I give it a module, it just knows what I write in that module, not the dependencies I install in that module?

smoky ocean Mar 19, 2025, 6:00 AM

#

yeah 1) it can't access the dependencies and 2) it can't call the module constructor, you have to call it and bind the object instance to the llm

smoky ocean Mar 19, 2025, 6:35 AM

#

Today's workshop made me think about API integration.

@spring ocean and @wraith remnant worked on an agent that involves a lot of them. At the moment it's possible to write Dagger modules that wrap cloud APIs, and there are benefits to that - but it's labor-intensive. The DX is cumbersome and there are gaps, for example Dagger/Graphql types don't map directly to JSON and OpenAPI (eg. no maps). I believe @violet stump, @olive badge and @uneven depot brought this up in the in the past.

What if we added first class to external APIs somehow? Maybe as a special kind of dependency - imagine if your dagger.json could have remote APIs as a dependency, and the engine exposed that as a dagql module? The dependency source could be an OpenAPI/Swagger/graphql schema of some kind (I'm sure there are catalogs out there). They would be loaded by a special builtin SDK. Could be a big boost to our DX

uneven depot Mar 19, 2025, 12:58 PM

#

That's a neat idea! I wonder if that same idea can extend to CLI tools also? That's what I end up wrapping more than APIs. I usually try to get an official image for the tool, if not pull an Alpine container and install it. CLI tools don't have a common structure though, like openAPI so, it's probably impossible. There's no guarantee a rest API follows the oapi spec also, so consumers may end up with weird errors that they can't directly identify because the api is wrapped in another SDK.

woeful quiver Mar 19, 2025, 7:50 PM

#

I'm making a few changes and will polish and publish, but I thought it would be fun to write a database agent that can take a database connection and answer questions. I'm using an example database for dvd rentals.

shrewd ermine Mar 19, 2025, 8:15 PM

#

👀

woeful quiver Mar 19, 2025, 8:24 PM

#

I saw that yesterday, was asked for that very feature last night at the meetup 🔥

spring wave Mar 19, 2025, 8:26 PM

#

FrogeAlarm llm has been merged into main

shrewd ermine Mar 19, 2025, 8:27 PM

#

https://tenor.com/view/oh-my-god-its-happening-ok-stay-calm-the-office-gif-6091046

Tenor

smoky ocean Mar 19, 2025, 8:36 PM

#

smoky ocean <@108011715077091328> <@135620352201064448> multi-object DX bikeshed. How do you...

Follow-up to DX thread @spring wave @shrewd ermine. Should we consider spinning out a LLMEnvironment type, separate from LLM? The former would have the bindings & state management. The latter would have the prompting and endpoint routing. Soon there will be MCP that currently grafts onto LLM. But would now cleanly graft onto LLMEnvironment instead.

Maybe makes the modules code cleaner also? Clear delineation of the LLM vs. its environment?

shrewd ermine Mar 19, 2025, 8:36 PM

#

you have my attention 🙂

spring wave Mar 19, 2025, 8:37 PM

#

yeah, was thinking something similar

shrewd ermine Mar 19, 2025, 8:38 PM

#

"environment" is the accepted industry term for where llm's interact with their tools and state right?

#

or is it more specific

smoky ocean Mar 19, 2025, 8:57 PM

#

it will be 😇

#

i think the industry is stuck on "tools" and will soon realize that they need more. Environment in my opinion is the next logical evolution, and I think we should spearhead it.

#

An environment implies 1) objects 2) state 3) rules for how objects interact

#

all of which dagger can provide

#

cc @noble notch 👆

shrewd ermine Mar 19, 2025, 8:59 PM

#

ship it!

smoky ocean Mar 19, 2025, 9:00 PM

#

Environment API 🧵

noble notch Mar 19, 2025, 9:03 PM

#

smoky ocean An environment implies 1) objects 2) state 3) rules for how objects interact

Stuck on tools and frameworks!

smoky ocean Mar 19, 2025, 9:28 PM

#

loop() 🧵

storm gate Mar 19, 2025, 9:44 PM

#

I switched from the llm tag release, to main. @worn hill it'd be nice if we could set the --allow-llm from an env var for the CI. I thought setting DAGGER_LLM_ALLOW=all would override the cli arg but it does not work

worn hill Mar 19, 2025, 9:45 PM

#

storm gate I switched from the llm tag release, to main. <@430802613848506380> it'd be nice...

yeah i can add this, shouldn't be hard. i had another post-flagparse processing thing i wanted to do anyways

storm gate Mar 19, 2025, 10:43 PM

#

I finally got my end to end AI flow working reliably: https://github.com/samalba/demo-app/pull/19

This PR is generated by an agent, and reviewed by another, with a recommendation to merge or not, based on the diff and PR description.

It looks like I am discussing alone spidermanpointing because of the github token I use. It's pretty cool!

All the code is here: https://github.com/samalba/agents

smoky ocean Mar 19, 2025, 10:47 PM

#

https://tenor.com/view/meeseeks-mee6-roped-me-into-this-roped-me-into-this-gif-21405644

Tenor

smoky ocean Mar 19, 2025, 11:44 PM

#

smoky ocean Today's workshop made me think about API integration. <@426920645402689537> and...

https://github.com/dagger/dagger/issues/9914

quiet ether Mar 20, 2025, 1:12 AM

#

smoky ocean or `Ctrl-<something>`

asked ChatGPT about that but didn't take any hot takes. Mostly referencing vim's modes and python's ! special character

having said that C-/ maps as ^_ in my keyboard ( I think a bunch them do for some reason) which then bash in my case uses it to undo. FWIW C-[ and C-] are not currently remapped to anything and seems like bash doesn't use them

spring wave Mar 20, 2025, 1:17 AM

#

quick idea: LLM.interrogate - like Container.terminal but for debugging an LLM. Runs the .sync and then pops you into interactive prompt-mode shell so you can ask it why it messed up

#

or, a way to pipe a LLM to .llm so you can load it as your current session, then you can at least change your function to return *LLM

quiet ether Mar 20, 2025, 1:22 AM

#

spring wave or, a way to pipe a `LLM` to `.llm` so you can load it as your current session, ...

have we incorporated the concept of stop_sequences? I mean.. how can we know if the LLM effectively accomplished its task to interrogate it?

#

Ideally you'd want to interrupt it when you know it's just not going anywhere, right?

spring wave Mar 20, 2025, 1:23 AM

#

yeah that's another thing i've been wondering, if we do that we can get -i to do it automatically which is even better

#

for what i suggested you'd just splice it in after your last prompt before things go haywire, and hope it does it again. (same as splicing in .Terminal())

spring wave Mar 20, 2025, 1:24 AM

#

quiet ether have we incorporated the concept of `stop_sequences`? I mean.. how can we know i...

is there a way to do this? i didn't see it in the API when I looked, and feared the only option would be some sort of sentiment analysis thing lol

#

MVP could just be grepping for "sorry" haha

quiet ether Mar 20, 2025, 1:25 AM

#

spring wave is there a way to do this? i didn't see it in the API when I looked, and feared ...

anthropic API has it at least

#

woeful quiver Mar 20, 2025, 1:25 AM

#

woeful quiver I'm making a few changes and will polish and publish, but I thought it would be ...

Just pushed the code , short video, and an updated README to https://github.com/jasonmccallister/database-agent

Would love an extra set of eyes/feedback if there is any!

GitHub

GitHub - jasonmccallister/database-agent: Dagger module to give an ...

Dagger module to give an AI Agent access to a database - jasonmccallister/database-agent

woeful quiver Mar 20, 2025, 1:31 AM

#

woeful quiver Just pushed the code , short video, and an updated README to https://github.com/...

managed to get that to < 300 lines of code - but supports bot mysql and postgres, Might be able to trim that down even more if I tried

smoky ocean Mar 20, 2025, 2:17 AM

#

@spring wave probably safest and most portable to have _error builtin that llm can call to report an error

#

I love the idea of explicit LLM.terminal()

#

and I think prompt mode in the CLI should use it

#

separately, I think it would be SUPER powerful if you could just save variables of type LLM, and automatically the prompt mode shortcut can cycle through them. the default LLM would be a special case of this

I like this variable-based approach better than .llm which is too close to llm

spring wave Mar 20, 2025, 2:39 AM

#

https://tenor.com/ZeJP.gif (in response to _error)

Tenor

noble notch Mar 20, 2025, 3:23 AM

#

woeful quiver Just pushed the code , short video, and an updated README to https://github.com/...

check out this template with some tips, namely to make it easier to understand and more interesting to people who'll see this and won't be familiar with Dagger or "modules" yet.

📎 readme-template-markdown.md

noble notch Mar 20, 2025, 3:33 AM

#

noble notch check out this template with some tips, namely to make it easier to understand a...

would love to post it to HN as something like "Show HN: DBAgent - Talk to your database" [1]
in fact there was a recent similar post that did well: https://news.ycombinator.com/item?id=43356039
[1] DBA but the A is for Agent 🤪

todsacerdoti

Xata Agent: AI agent expert in PostgreSQL

river belfry Mar 20, 2025, 12:54 PM

#

I have hard time to use llama.cpp because of tools and streaming not supported at the same time.
So I'm running that on my machine: https://github.com/dagger/dagger/pull/9919
Basically if there's a tool it doesn't use streaming in the same call.
That looks like to work, but I don't know if that's the right way to do it or if it can have side effects

GitHub

do not use tools and streaming at the same time by eunomie · Pull R...

For llama.cpp that doesn't support both at the same time

woeful quiver Mar 20, 2025, 2:42 PM

#

noble notch check out this template with some tips, namely to make it easier to understand a...

updating the readme with those changes: https://github.com/jasonmccallister/database-agent/pull/2

GitHub

Update version and docs by jasonmccallister · Pull Request #2 · jas...

Updates the Dagger version to the mainline release
Updates the documentation from feedback

river belfry Mar 20, 2025, 3:52 PM

#

river belfry I have hard time to use `llama.cpp` because of tools and streaming not supported...

With that change, I finally have a fully local experience using llama.cpp that works. For instance the toy-programmer is working as expected. A bit slower of course, but that works.

shrewd ermine Mar 20, 2025, 3:59 PM

#

river belfry With that change, I finally have a fully local experience using llama.cpp that w...

I think the tricky part with this is that it seems like it's just llama.cpp that doesn't fully support the OpenAI compat API, since this currently works fine for other OpenAI compat APIs like openai itself, azure, ollama, etc. We had the same problem with Gemini because they advertise an OpenAI compat endpoint but it doesn't fully work, so we implemented the native Gemini client instead

polar loom Mar 20, 2025, 6:26 PM

#

Hi
I've written an example of how to use an agent with Dagger in Python:
https://github.com/azorej/dagger-agent-example

Nothing special: I've used kpenfound/dag/workspace as the base for my workspace module and wrote a simple function to fix Dockerfile.

The most interesting part: I'm using a devcontainer to simplify setup, so it will be easier for others to try out the example.
I haven't seen a lot of use for devcontainers in the Dagger examples, and it's not very practical to have different versions of Dagger installed on one machine.

Therefore, it would be great if we could normalize the use of devcontainers.

GitHub

GitHub - azorej/dagger-agent-example: Test Dagger ML agents

Test Dagger ML agents. Contribute to azorej/dagger-agent-example development by creating an account on GitHub.

workspace :: Daggerverse

A generated module for Workspace functions

#

btw, I am not sure how code generation works in Dagger.
Do I need to maintain a separate workspace module?
Or can I use the same module for both orchestration and the agent workspace?

smoky ocean Mar 20, 2025, 6:31 PM

#

polar loom btw, I am not sure how code generation works in Dagger. Do I need to maintain a ...

Ideally you would not need to maintain a separate module (while being free to, if you want)

There is a temporary limitation which prevents a module from calling itself via the Dagger API. We are working on allowing this. By extension, this also prevents a module from creating a binding to its own types, for a LLM to use.

This is why at the moment you have to separate the module being referenced by a LLM binding, and the module doing the binding.

--> hopefully this makes sense!

smoky ocean Mar 20, 2025, 6:56 PM

#

I'm giving a live demo tonight... Should I show multi-object or not? 🙂

shrewd ermine Mar 20, 2025, 6:58 PM

#

smoky ocean I'm giving a live demo tonight... Should I show multi-object or not? 🙂

✅ ❌

#

ok the ❌ isn't helpful lol. I would vote no just because the DX is still up for discussion (I think? unless the WithFooBinding is in) and the reliability is in question depending on your model and objects

woeful quiver Mar 20, 2025, 7:04 PM

#

Also, the deprecation underlines (at least in Zed) with the current with<> makes my eyes wander like I wrote broken code/syntax

shrewd ermine Mar 20, 2025, 7:04 PM

#

oh but isn't the deprecation warning for single object?

spring wave Mar 20, 2025, 7:44 PM

#

woeful quiver Also, the deprecation underlines (at least in Zed) with the current with<> makes...

ran into this too, thinking of un-deprecating them until we're 100% sure. We haven't been able to fully escape the idea of a "current state", and the pattern of exposing vars to the LLM and pulling a single value out still feels most natural to me

proud sigil Mar 20, 2025, 8:06 PM

#

Hi everyone, I've recently begun exploring Dagger, love the idea of building containers for AI agents. I'm curious if there are common patterns or best practices for picking which models to use. It could be because you want to try different models for the same task and compare. Or, you could be building something that benefits from multiple models each with a specialized task.

shrewd ermine Mar 20, 2025, 8:10 PM

#

proud sigil Hi everyone, I've recently begun exploring Dagger, love the idea of building con...

Welcome! Definitely checkout https://docs.dagger.io/ai-agents#faq , it's a bit bare right now but we've been working on adding best practices as we can. As far as model selection, claude 3.7 and gpt-4o seem to be pretty capable in general. I've been enjoying gemini-2.0-flash too but you need to get the prompting just right for it to be successful. For coding tasks, qwen2.5-coder of whatever size you can run has been good too, but also needs just the right prompting and configuration

proud sigil Mar 20, 2025, 8:12 PM

#

Thanks, I'll check out the FAQ. Do you think it would be worthwhile to build a module that could abstract away the model selection? As in, not have to fret about which is the current SOTA model for X and just have the module enable the current best? I know that sounds a bit abstract.

#

I imagine with more "vibe coding", you just forget about which model(s) and say "give me the best model that I can run on this machine right now for this task."

shrewd ermine Mar 20, 2025, 8:14 PM

#

Yeah it's an interesting problem. Most of the functions I've been writing specify a default model but allow one to be passed in. The hard part that I've seen is that the prompting is somewhat model-specific so it's hard to just swap out the model and keep everything else the same

proud sigil Mar 20, 2025, 8:16 PM

#

As it is, it's hard to keep track of which model identifier is right <model family><version>-<params>-<tuned for>-<quantized>. The naming conventions for these models is...rough.

#

Let alone the right prompt style/setup

shrewd ermine Mar 20, 2025, 8:21 PM

#

Yeah I totally agree. It would be nice to have that kind of thing handled at some model router level since at the agent/Dagger level you don't necessarily know what models are available

proud sigil Mar 20, 2025, 8:23 PM

#

It's something my team and I are looking into/building. I was looking at Dagger separately for workflow orchestration and then got nerd-swiped with agent containerization. Perhaps we can contribute.

shrewd ermine Mar 20, 2025, 8:23 PM

#

Basically it would be cool if the agent could say "give me a model that meets these criteria" and the model server gives you the best fit

proud sigil Mar 20, 2025, 8:24 PM

#

Which I think works great since containers may have the same functionality but access to different hardware resources or compute budget

#

So if the model registry could choose based on the resources available, it'd be a nice abstraction. Unless I'm misunderstanding the intent with containers.

shrewd ermine Mar 20, 2025, 8:26 PM

#

Yeah its a surprisingly similar problem to container orchestration/scheduling in platforms like kubernetes. The app doesn't say "put me on this node", it just says "give me a node with this cpu/memory and access to this volume"

#

I expect something like that to show up here sometime soon 😄 https://openrouter.ai/docs/features/model-routing

OpenRouter Documentation

Model Routing - Smart Model Selection and Fallback

Route requests dynamically between AI models. Learn how to use OpenRouter's Auto Router and model fallback features for optimal performance and reliability.

proud sigil Mar 20, 2025, 8:30 PM

#

Yes, I think this is a good starting point.

smoky ocean Mar 20, 2025, 9:28 PM

#

I was definitely thinking about adding models: [string] as an argument to LLM()

proud sigil Mar 20, 2025, 9:32 PM

#

How would that work?

#

It can choose from the set or have fall backs if the first doesn't work?

smoky ocean Mar 20, 2025, 9:32 PM

#

proud sigil It can choose from the set or have fall backs if the first doesn't work?

Either "any of these" or ordered by preference with fallback - wasn't sure

proud sigil Mar 20, 2025, 9:39 PM

#

I like that affordance. I'd be curious how to build logic around the set of models. How to make it easier for the developer or the workflow to choose among the models in the set.

#

But allowing for multiple models is a good starting point IMO

smoky ocean Mar 20, 2025, 9:41 PM

#

proud sigil I like that affordance. I'd be curious how to build logic around the set of mode...

Well it would be the developer of the workflow providing a list of models it doesn't mind using

#

that would be the choice

proud sigil Mar 20, 2025, 10:05 PM

#

Choice is good

smoky ocean Mar 20, 2025, 10:24 PM

#

My personal coding agent 🧵

smoky ocean Mar 20, 2025, 11:11 PM

#

as soon as 0.17 is out we can remove the custom install instructions from AI agent tutorial 🥳

steep onyx Mar 20, 2025, 11:13 PM

#

smoky ocean as soon as 0.17 is out we can remove the custom install instructions from AI age...

it's out

shrewd ermine Mar 20, 2025, 11:15 PM

#

smoky ocean as soon as 0.17 is out we can remove the custom install instructions from AI age...

And Jason removed it!

smoky ocean Mar 20, 2025, 11:18 PM

#

https://tenor.com/view/dream-team-basketball-jordan-gif-3213109727397617065

Tenor

#

https://tenor.com/view/late-to-the-party-party-late-walking-in-late-open-the-door-gif-13813203

Tenor

Late To The Party

▶ Play video

woeful quiver Mar 21, 2025, 12:31 AM

#

Is it possible to set a callback for the llm response? Meaning every single response from the llm I can capture and send somewhere?

smoky ocean Mar 21, 2025, 12:32 AM

#

woeful quiver Is it possible to set a callback for the llm response? Meaning every single resp...

You can query it after the fact with LLM.history()

#

That history API is a bit barebones, but we can beef it up to distinguish messages by sender (LLM, user, tool)

#

I think for now you could filter it by emoji in the contents 😛

woeful quiver Mar 21, 2025, 12:36 AM

#

smoky ocean That history API is a bit barebones, but we can beef it up to distinguish messag...

yeah, the demo I was thinking of would be to have a NATS publisher sending the request and a NATS consumer sending all of the responses to the stream - giving a decoupled kind of pub/sub agent

smoky ocean Mar 21, 2025, 12:38 AM

#

@steep onyx @spring wave should I be worried that after running dagger develop with 0.17, my IDE autocompletes to dag.Llm and not dag.LLM ?

woeful quiver Mar 21, 2025, 12:39 AM

#

woeful quiver yeah, the demo I was thinking of would be to have a NATS publisher sending the r...

spring wave Mar 21, 2025, 12:39 AM

#

smoky ocean <@949034677610643507> <@108011715077091328> should I be worried that after runni...

known issue

#

if that reduces the worry 😛

#

i think we need to teach strcase about that acronym

#

ah, looks like we do that in core/ but the codegen code probably (hopefully) isn't loading core/

smoky ocean Mar 21, 2025, 12:45 AM

#

Weird BBI error at the end of this demo prep session: https://v3.dagger.cloud/dagger/traces/09cd836f493f191fdb1ceb31de288a83

Dagger Cloud

Browse and visualize Dagger traces.

#

(see very last error)

spring wave Mar 21, 2025, 12:56 AM

#

looks like it tried to call a function with "app" in place of a FooID arg, essentially an unbound var

#

which didn't work because all the vars are app_*, and there's never just an app thinkspin wonder why it tried that

spring wave Mar 21, 2025, 12:58 AM

#

spring wave ah, looks like we do that in `core/` but the codegen code probably (hopefully) i...

oh geez the codegen has a whole separate case conversion system

spring wave Mar 21, 2025, 1:37 AM

#

smoky ocean <@949034677610643507> <@108011715077091328> should I be worried that after runni...

https://github.com/dagger/dagger/pull/9933

GitHub

sdk(go): Llm -> LLM by vito · Pull Request #9933 · dagger/dagger

This is gonna break a bunch of stuff, but better now than later...

spring wave Mar 21, 2025, 3:16 AM

#

bots make typos too 🤗

smoky ocean Mar 21, 2025, 6:48 AM

#

random question. of multiobj continues to cause problems. should we implement it as single object + shadowing?

#

the builtins system

smoky ocean Mar 21, 2025, 7:25 AM

#

👀👀👀 https://github.com/dagger/dagger/pull/9935

GitHub

Expose a Dagger module as an MCP server by tiborvass · Pull Request...

This introduces a new dagger mcp command that starts an MCP stdio server.
MCP clients can configure to exec: dagger mcp or dagger mcp -m path/to/module.
In this version, each dagger function corres...

smoky ocean Mar 21, 2025, 5:01 PM

#

smoky ocean random question. of multiobj continues to cause problems. should we implement it...

@spring wave wdyt?

#

@warped bramble @wraith remnant my guess is that your MCP pull request already works with multi-object... But only for models that don't need the crutch of a special system prompt

#

also @spring wave we could use the old "read the manual first" trick to inject the system prompt without making it a real system prompt - then it would work over mcp

spring wave Mar 21, 2025, 5:03 PM

#

smoky ocean also <@108011715077091328> we could use the old "read the manual first" trick to...

i think it wouldn't have the weighting of a real system prompt, but yeah could try it

woeful quiver Mar 21, 2025, 5:03 PM

#

This is the error I'm getting on Gemini returning a Go struct: https://v3.dagger.cloud/JasonDagger/traces/c8f50741d8745eb87147a4f4649fac71

Dagger Cloud

Browse and visualize Dagger traces.

spring wave Mar 21, 2025, 5:07 PM

#

smoky ocean <@108011715077091328> wdyt?

I do keep coming around to the idea that single-object is all we need for bootstrapping, and anything else can be implemented as a module that is able to maintain its own state (as the single object) . which i have done on a throwaway branch somewhere. Like I have a pretty strong feeling that different situations might call for different schemes, one of which being 100% control over the set of available tools to keep the model from jumping around and saving vars aimlessly.

smoky ocean Mar 21, 2025, 5:09 PM

#

spring wave I do keep coming around to the idea that single-object is all we need for bootst...

I still need the ability to give the llm not just constructor functions, but pre-configured objects, ie. with my secrets etc

#

@spring wave I'm going to get back to dev mode today, I'm loaded up with demo feedback and papercuts. How do I do this in a way that doesn't conflict?

spring wave Mar 21, 2025, 5:14 PM

#

smoky ocean <@108011715077091328> I'm going to get back to dev mode today, I'm loaded up wit...

a few options:

try my llm-evals branch if you want to start from my experimental changes + have a suite you can use to test your changes
just stick with main, maybe copy over those same evals since they should be compatible
or just monkey around on main
there will probably be conflicts either way but that's OK, experimentation is always messy/good.

smoky ocean Mar 21, 2025, 5:15 PM

#

spring wave a few options: * try my `llm-evals` branch if you want to start from my experime...

I'm thinking I'll start with Environment, which is mostly API changes to use the same underlying implementation. From there, might try the "single object with shadowing" idea

#

I'm thinking we should merge dagger mcp (hidden) asap to avoid conflict storm

warped bramble Mar 21, 2025, 5:16 PM

#

One drawback of single-object (that i'm sure you're already aware of) is that just by bringing in dagger.Container's functions, you get ~70 tools. Which could grow quickly to the 128 tools limit.

spring wave Mar 21, 2025, 5:18 PM

#

that's true for multi-object too

#

but, we never combine tools of multiple types, so that helps a bit

warped bramble Mar 21, 2025, 5:18 PM

#

Ah sorry, i thought there was an indirection (not super familiar with it yet)

spring wave Mar 21, 2025, 5:19 PM

#

i was only able to hit that limit by doing LLM.withLLM since it has a ton of getters/setters lol

smoky ocean Mar 21, 2025, 5:19 PM

#

no you're right @warped bramble . in multi-object you get at most the tools of container; but it doesn't add up as you "unlock" more yypes

spring wave Mar 21, 2025, 5:19 PM

#

(also, funny how a 128 limit is cropping up again, I remember that from the early Docker days with aufs :P)

smoky ocean Mar 21, 2025, 5:20 PM

#

god...

#

did you hear the story of what it turned out to be? Which we discovered much later...

spring wave Mar 21, 2025, 5:20 PM

#

hmmm was it the limit of the mount opts string length or something?

smoky ocean Mar 21, 2025, 5:21 PM

#

yeah exactly. It wasn't actually 128 of anything - it just roughly landed at that number by chance with typical opt strings

#

and we all saw what we needed to see, to make sense of the world

spring wave Mar 21, 2025, 5:22 PM

#

classic

#

the fix: mount everything under /d/

woeful quiver Mar 21, 2025, 6:06 PM

#

woeful quiver This is the error I'm getting on Gemini returning a Go struct: https://v3.dagger...

input: databaseAgent.ask google API error occurred: googleapi: got HTTP response code 400 with body: [{
  "error": {
    "code": 400,
    "message": "Please ensure that function call turn contains at least one function_call part which can not be mixed with function_response parts.",
    "status": "INVALID_ARGUMENT"
  }
}
]

spring wave Mar 21, 2025, 6:12 PM

#

woeful quiver ``` input: databaseAgent.ask google API error occurred: googleapi: got HTTP resp...

most likely unrealted to that Go struct issue, it's some sort of flakiness, haven't been able to pin it down yet

woeful quiver Mar 21, 2025, 6:15 PM

#

spring wave most likely unrealted to that Go struct issue, it's some sort of flakiness, have...

do you want a branch on my repo where I am seeing this to help?

spring wave Mar 21, 2025, 6:16 PM

#

woeful quiver do you want a branch on my repo where I am seeing this to help?

oh I'm having an easy enough time hitting it on my own, thanks 😂

woeful quiver Mar 21, 2025, 6:16 PM

#

spring wave oh I'm having an easy enough time hitting it on my own, thanks 😂

haha ok

somber vault Mar 21, 2025, 7:45 PM

#

Any trick to hide the progress bar so that it doesn't mess with python's input() reading from console?

spring wave Mar 21, 2025, 8:25 PM

#

buckle up, I just merged https://github.com/dagger/dagger/pull/9933 💥 /cc @smoky ocean (breaking change)

GitHub

sdk(go): Llm -> LLM by vito · Pull Request #9933 · dagger/dagger

This is gonna break a bunch of stuff, but better now than later...

smoky ocean Mar 21, 2025, 9:32 PM

#

spring wave buckle up, I just merged https://github.com/dagger/dagger/pull/9933 💥 /cc <@488...

OK installing!

worn hill Mar 21, 2025, 10:12 PM

#

spring wave buckle up, I just merged https://github.com/dagger/dagger/pull/9933 💥 /cc <@488...

how does one fix this exactly with an unreleased cli? like if i do dagger develop on my module that calls Llm (that's used by dagger/dagger tests) it puts a long dev version specifier in the dagger.json, i fix my compile errors and then should i just truncate the version specifier to 0.17.1 ?

spring wave Mar 21, 2025, 10:16 PM

#

worn hill how does one fix this exactly with an unreleased cli? like if i do dagger develo...

I would just fake it and say 0.17.0 tbh. The auto dagger.json version bumping does seem a bit too aggressive to me. Ideally we would only bump if to the minimum required version, and never to dev versions imo

worn hill Mar 21, 2025, 10:18 PM

#

spring wave I would just fake it and say 0.17.0 tbh. The auto `dagger.json` version bumping ...

thinkies but doesn't the 0.17.0 go sdk want Llm?

spring wave Mar 21, 2025, 10:18 PM

#

it does but v0.17.1 isn't a thing afaik

#

so i don't think dev versions would match it?

#

if you say v0.17.0 dev engines can at least still use it

#

you could tag a Llm version before the LLM bump i suppose

worn hill Mar 21, 2025, 10:20 PM

#

squint maybe i'm making bad assumptions about the failures im tryna fix

smoky ocean Mar 21, 2025, 10:22 PM

#

OK so: definitely don't publish llm modules targeting 0.17.0?

#

release 0.17.1 monday?

spring wave Mar 21, 2025, 10:55 PM

#

smoky ocean OK so: definitely don't publish llm modules targeting 0.17.0?

yeah I think it's better for folks to stray forward to dev compatibility vs. maintain compatibility with whatever iteration we happened to ship in v0.17.0

smoky ocean Mar 21, 2025, 10:56 PM

#

@spring wave got a crash trying to use query object in prompt mode, in the dagger module https://dagger.cloud/dagger/traces/318291796c3f542d7f3194a4294a7a4e#418845bd7580ade6

spring wave Mar 21, 2025, 10:58 PM

#

looks like it ended up on an array maybe? those are currently not handled, might need something special like "select the Nth item"

#

also, yeah, at the moment you have to mention "dagger" for it to realize you want to use that module - ran into that to. try "lint the Dagger docs"

#

maybe it should be further scoped

smoky ocean Mar 21, 2025, 11:00 PM

#

was going to try that next - but got that panic first

#

there's a subtlety here btw, sometimes you want an API endpoint to a module; and sometimes you want an API endpoint from the context of hte module. At the moment Dagger doesn't clearly delineate the two.

Maybe the distinction becomes more important when we throw LLM and their environments the mix?

smoky ocean Mar 21, 2025, 11:33 PM

#

New video drop 🙂 https://x.com/solomonstre/status/1903226073938268361

Solomon Hykes (@solomonstre) on X

This video by @kylepenfound kind of blew my mind. He demystifies the concept of a coding agent, and shows how to build your own from scratch. From zero to "robot ships a feature" in one video 🤯

If you're curious about coding agents, but not sure where to start... Watch it! 🧵

forest reef Mar 22, 2025, 7:51 AM

#

Hello, I'm trying to create an simple assistant for simple Kubernetes issues. As with many real-world problems, it would be ideal to find a perfect solution and finish, but it seems necessary to be able to instruct human intervention or interruption at each attempt (e.g. LLM Call). From what I can see, dagger/agent currently only adjusts loops through prompts, but would more programmatic control be possible? (e.g. MaxTry or confirmation on every call?)

storm gate Mar 22, 2025, 5:29 PM

#

forest reef Hello, I'm trying to create an simple assistant for simple Kubernetes issues. As...

You have a couple of ways to proceed, first of all you don't have to let the LLM handle the main loop. We built demos doing both, and I prefer to keep the LLM loop small, as well as its toolset.

Then you handle the main logic in a bigger surrounding loop that will do things beyond what an LLM can do. For example call containers, call an API, or anything that the Dagger API can do outside of LLMs, etc... You can then include extra information when you re-call the LLM, which increases the LLM accuracy (tried with both OpenAI and Anthropic models).

Also note that even if the LLM tries several times with its own loop, you can limit it to a specific number of attempts by making it explicit in the prompt.

shrewd ermine Mar 22, 2025, 10:15 PM

#

I made a module for working with firecrawl (firecrawl.dev) since it seems pretty hot for LLMs + web scraping https://daggerverse.dev/mod/github.com/kpenfound/dag/firecrawl

firecrawl-dag :: Daggerverse

A module for working with Firecrawl (firecrawl.dev)

shrewd ermine Mar 22, 2025, 10:52 PM

#

fleet fiber Mar 23, 2025, 2:03 AM

#

@shrewd ermine thanks for the YT videos on Agents, I'm going through them now.
https://www.youtube.com/watch?v=VHUi9ABdASA
https://www.youtube.com/watch?v=B7P04M9c1m0

YouTube

Dagger

AI Agents in CI

This demo shows how an AI Agent can operate in a CI environment to assist in resolving test failures.

Code: https://github.com/kpenfound/greetings-api

Have questions? Ask us in Discord: https://discord.com/invite/dagger-io

▶ Play video

YouTube

Dagger

A Simple SWE Agent with Dagger

This demo shows off a simple agent that automatically creates new features in a demo project. Features are designed and assigned as GitHub issues and the agent creates a pull request with the completed work.

Code: https://github.com/kpenfound/greetings-api/blob/main/SWE_AGENT.md

Have questions? Ask us in Discord: https://discord.com/invite/da...

▶ Play video

spring wave Mar 23, 2025, 5:19 PM

#

finally figured out those cryptic "mismatched function call/response" shaped errors - it's when an LLM tries to call a tool that doesn't exist, we were dropping that on the floor

smoky ocean Mar 23, 2025, 7:06 PM

#

local git awareness 🧵

quiet ether Mar 23, 2025, 7:28 PM

#

spring wave finally figured out those cryptic "mismatched function call/response" shaped err...

I'm also getting these kind of errors quite often using Anthropic:

! POST "https://api.anthropic.com/v1/messages": 400 Bad Request {"type":"error","error":{"type":"invalid_request_error","message":"messages.33: `tool_use` ids were found without
│ ! `tool_result` blocks immediately after: toolu_01T2iDjHMNTfRJWtGAgqidc6. Each `tool_use` block must have a corresponding `tool_result` block in the next message."}}
! input: llm.setK3S.withPrompt.loop.setK3S.withPrompt.loop.setK3S.withPrompt.loop.setK3S.withPrompt.sync select: POST "https://api.anthropic.com/v1/messages": 400 Bad Request
! {"type":"error","error":{"type":"invalid_request_error","message":"messages.33: `tool_use` ids were found without `tool_result` blocks immediately after:
! toolu_01T2iDjHMNTfRJWtGAgqidc6. Each `tool_use` block must have a corresponding `tool_result` block in the next message."}}

seems like a 🐛 ?

spring wave Mar 23, 2025, 7:29 PM

#

yep, same issue, have a fix on my llm-evals branch

#

it's two issues: 1. that the model tried to make that call (bad prompting), 2. that we dropped the bad call and ended up with garbled history

quiet ether Mar 23, 2025, 7:38 PM

#

spring wave it's two issues: 1. that the model tried to make that call (bad prompting), 2. t...

I think I'm seeing something odd with the token caches where if I ask the LLM to run another thing that it ran before, it replies that it's going to do it but the tool actually never gets called. I'll try to get a repro

#

even if the dagger function has cache buster

quiet ether Mar 23, 2025, 8:28 PM

#

quiet ether even if the dagger function has cache buster

Maybe the dagql cache? 🤔

spring wave Mar 23, 2025, 8:56 PM

#

quiet ether Maybe the dagql cache? 🤔

yeah if it makes the same DagQL-level query multiple times it'll only show up in the trace once. I ended up adding a cache buster for my evals, and that worked

quiet ether Mar 23, 2025, 9:02 PM

#

spring wave yeah if it makes the same DagQL-level query multiple times it'll only show up in...

But don't we have function caching within the same session? I think I'm hitting that?

#

I have a cache buster within my function but the trace doesn't even show the initial function call

spring wave Mar 23, 2025, 9:03 PM

#

yep

#

that's true, cache busters technically have to be propagated all the way out now

#

i mean if we do a dagql persistent cache

quiet ether Mar 23, 2025, 9:04 PM

#

spring wave that's true, cache busters technically have to be propagated all the way out now

Yep... Seems like it

spring wave Mar 23, 2025, 9:05 PM

#

this might change with @steep onyx's work - he had to do something special for intra-session dagql cache hit telemetry

quiet ether Mar 23, 2025, 9:05 PM

#

We need to find a way to set a pragma at the function level to hint the engine that the function should never be cached

#

Or just disable function caching altogether in prompt mode 🤔

#

There's many edge cases here I think

#

I'll open an issue tomorrow

smoky ocean Mar 24, 2025, 12:51 AM

#

This? https://github.com/dagger/dagger/issues/7428

GitHub

Cache function calls by default · Issue #7428 · dagger/dagger

Problem Dagger has great caching, but Dagger Functions don't fully benefit from it, because their runtime containers are not cached. This has several consequences: Functions that perform comput...

quiet ether Mar 24, 2025, 1:04 AM

#

smoky ocean This? https://github.com/dagger/dagger/issues/7428

not exactly. In this particular case because I'm running a long-living Dagger session, the LLM is not being able to execute the same function twice with the same arguments since it'll always return the initial cached response

#

not sure what's the best way to handle that though. I'll open an issue to start the discussion tomorrow 🙏

smoky ocean Mar 24, 2025, 1:26 AM

#

you mentioned a pragma to disable caching of a function. That's part of the proposal in 7428. Are you thinking of a pragma that would be llm-specific?

quiet ether Mar 24, 2025, 1:35 AM

#

smoky ocean you mentioned a pragma to disable caching of a function. That's part of the prop...

yes, my initial thought was to make it llm-specific but TBH I haven't really thought about it too much in detail. Would it make sense to differentiate the caching (and potentially other properties) behavior based if the function is being called by an LLM or not? 🤔

smoky ocean Mar 24, 2025, 1:36 AM

#

quiet ether yes, my initial thought was to make it llm-specific but TBH I haven't really tho...

I would prefer not to, if possible. But we can figure it out in your issue. Maybe we have no choice? 🤷‍♂️

steep onyx Mar 24, 2025, 4:27 PM

#

quiet ether yes, my initial thought was to make it llm-specific but TBH I haven't really tho...

what's an example of a function where you'd want different caching specifically when called by an LLM?

smoky ocean Mar 24, 2025, 4:46 PM

#

@spring wave @worn hill @wraith remnant @warped bramble just to point out a major unresolved point between MCP and main branch: if our tool bindings implementation requires injecting a system prompt, it won't work over MCP. I know it's a tricky tradeoff. Just want to clarify that it's a high-impact problem to solve..

warped bramble Mar 24, 2025, 4:56 PM

#

Other question possibly related: do we care or not about MCP clients that don't update their tools list as we make more tools available dynamically ? Because I wonder if (maybe just a stop-gap) the tools we expose in MCP would be a static list of loader/getter/setter tools that are essentially an indirection on top of what LLMEnv would provide. (Keep in mind i'm not up to date with what the new multi object API should look like).

river belfry Mar 24, 2025, 5:13 PM

#

Just to share some fun stuff (at least to me 😅 )
I built a small agent that allows me to start a dev environment based on (any?) codebase. It will install everything I need, without to worry about it. Depending on the model you use it will even build an run the tests before to give the container back.
That's just a demo, so probably a lot of stuff to improve, but that's nice to play with.
dagger -c "dev-environment path/to/some/code | terminal"
(there's also an other task that is summarizing a subreddit, nothing related to the first one but also nice to test)
If you want to try it be careful to select the model you want, by default it's tuned to use some of my local models for a fully local experience, including models)
https://github.com/eunomie/local-agent

spring wave Mar 24, 2025, 5:55 PM

#

system prompting

smoky ocean Mar 24, 2025, 7:21 PM

#

@wraith remnant @warped bramble do you know if Cursor supports dynamic tool registration? Also I noticed in a MCP+Cursor video that the client asks for manual confirmation before each tool call. I wonder if that will be annoying with Dagger MCP, since that implies more intermediary internal calls

warped bramble Mar 24, 2025, 7:23 PM

#

smoky ocean <@274903880343748619> <@707011193814122506> do you know if Cursor supports dynam...

It doesn't support dynamic tool registration atm. And yes it asks for manual confirmation for every tool call. #welcomeToTheMCPJungle

wraith remnant Mar 24, 2025, 7:25 PM

#

warped bramble It doesn't support dynamic tool registration atm. And yes it asks for manual con...

yeah, not only doesn't it support dynamic tool registration but it gets confused by it -- still unsure if it's because of us or not

shrewd ermine Mar 24, 2025, 7:27 PM

#

warped bramble It doesn't support dynamic tool registration atm. And yes it asks for manual con...

is the manual confirmation part just a cursor thing?

warped bramble Mar 24, 2025, 7:28 PM

#

shrewd ermine is the manual confirmation part just a cursor thing?

so far yes

worn hill Mar 24, 2025, 8:33 PM

#

river belfry Just to share some fun stuff (at least to me 😅 ) I built a small agent that all...

this is cool, have you thrown any lower level language (rust, c) stuff at it? https://github.com/redis/redis or https://github.com/tree-sitter/tree-sitter would be fun examples

GitHub

GitHub - redis/redis: Redis is an in-memory database that persists ...

Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs,...

GitHub

GitHub - tree-sitter/tree-sitter: An incremental parsing system for...

An incremental parsing system for programming tools - tree-sitter/tree-sitter

smoky ocean Mar 24, 2025, 10:04 PM

#

Release checklist 🧵

smoky ocean Mar 24, 2025, 11:02 PM

#

@quiet ether re: your discord agent. Can you split it into 1) a discord module and 2) an example agent using it? We're trying to apply that model to all examples going forward, to maximize composability

#

(ideally that discord module would be reusable enough to be a basis for stdlib)

quiet ether Mar 24, 2025, 11:10 PM

#

smoky ocean <@336241811179962368> re: your discord agent. Can you split it into 1) a `discor...

Roger!

quiet ether Mar 24, 2025, 11:36 PM

#

quiet ether Roger!

@smoky ocean one thing that I was wondering is if i should try to make it work with multi-object by default. It's not a big deal through because I can otherwise wrap all the tools that I need in a single workspace and use single-object as we currently showcase in multiple demos

shrewd ermine Mar 24, 2025, 11:38 PM

#

what would the multiple objects be? discord client and _?

quiet ether Mar 24, 2025, 11:39 PM

#

shrewd ermine what would the multiple objects be? discord client and _?

for my case. Discord and GIthub client

#

and potentially a third object to send notifications to somewhere else besides Discord?

smoky ocean Mar 24, 2025, 11:43 PM

#

yes multi object. specifically multi object from prompt mode...

quiet ether Mar 24, 2025, 11:44 PM

#

smoky ocean yes multi object. specifically multi object from prompt mode...

👍

smoky ocean Mar 24, 2025, 11:44 PM

#

by default the builtin agent has access to your module's dependencies

#

(right @spring wave )

quiet ether Mar 24, 2025, 11:44 PM

#

smoky ocean by default the builtin agent has access to your module's dependencies

really? I thought I needed to initialize them beforehand

#

i.e foo=$(my-module)

smoky ocean Mar 24, 2025, 11:44 PM

#

not anymore 🙂 (at least that's the UX we want to enable)

#

install; prompt; boom it works

quiet ether Mar 24, 2025, 11:45 PM

#

is that v0.17.1?

smoky ocean Mar 24, 2025, 11:45 PM

#

of course that leaves the question of injecting config

#

which is why we need to dogfood asap

quiet ether Mar 24, 2025, 11:46 PM

#

the only thing I'm missing is the ability to set multiple -m flags then. So I can make it work in the prompt without even creating a module at all

smoky ocean Mar 24, 2025, 11:46 PM

#

quiet ether the only thing I'm missing is the ability to set multiple `-m` flags then. So I ...

ha ha that is something I've wanted forever

#

maybe now I will finally get it 🙂

#

but init & install is a good start i think

#

maybe in the future, we will have a first class concept of environment, which you could initialize list load etc

quiet ether Mar 24, 2025, 11:47 PM

#

smoky ocean maybe now I will finally get it 🙂

doesn't seem super hard to add?? 🤔 . I can try checking out if I can make it work after I finish my module

shrewd ermine Mar 24, 2025, 11:48 PM

#

quiet ether doesn't seem super hard to add?? 🤔 . I can try checking out if I can make it wo...

i think it might not be so simple because multiple -m means each one is a dependency, and a single -m means you are that module

spring wave Mar 24, 2025, 11:48 PM

#

smoky ocean (right <@108011715077091328> )

yep, it starts scoped to the toplevel Query now so it can call your module's constructor, and I think dependencies too, but actually not 100% sure - I remember we do things to avoid leaking module dependencies

smoky ocean Mar 24, 2025, 11:48 PM

#

quiet ether doesn't seem super hard to add?? 🤔 . I can try checking out if I can make it wo...

my guess is it will be hard . single module is probably a baked in assumption everywhere in the cli code

#

but would love to be wrong

quiet ether Mar 24, 2025, 11:50 PM

#

shrewd ermine i think it might not be so simple because multiple `-m` means each one is a depe...

true.. will 👀 🙏

quiet ether Mar 24, 2025, 11:50 PM

#

spring wave yep, it starts scoped to the toplevel `Query` now so it can call your module's c...

is this v0.17.1 Alex?

spring wave Mar 24, 2025, 11:51 PM

#

yes i think so

#

https://github.com/dagger/dagger/pull/9931

GitHub

LLM: `Query` access, `$_` setting, un-deprecate APIs by vito · Pul...

A few experiments:

Add LLM.withQuery for making the whole Query schema available to the LLM

loadFooFromID fields are skipped

CLI prompt starts from Query so you can jump right into action, with...

#

oh also - $_ is a thing, it'll always be the last object that the LLM operated on / "returned"

#

there may be some bikeshedding to do there, but i think it's an important mechanic

quiet ether Mar 24, 2025, 11:55 PM

#

spring wave oh also - `$_` is a thing, it'll always be the last object that the LLM operated...

❤️ I've awlays wanted a $LAST kind of thing. Wondering if we thought about using bash's !!?

spring wave Mar 24, 2025, 11:56 PM

#

that's the one that gets substituted with the last input right? so you don't have to go back and edit?

#

(i'm a fishy kinda guy)

worn hill Mar 24, 2025, 11:56 PM

#

correct, but you can also use it in context, like i do with-dev go test ... and then when i rebuild i do dev && !!

#

(also im on zsh, and fairly certain that alias is POSIX, more ancient than even bash)

spring wave Mar 24, 2025, 11:58 PM

#

right right

quiet ether Mar 24, 2025, 11:58 PM

#

yep, exactly

worn hill Mar 24, 2025, 11:58 PM

#

!$ is another one that's probably very useful in a dagger shell context, just the last arg of the last command

spring wave Mar 24, 2025, 11:58 PM

#

what's the prevailing use case? !! | with-foo?

worn hill Mar 24, 2025, 11:59 PM

#

yeah, either append or prepend

spring wave Mar 24, 2025, 11:59 PM

#

i guess that can be $_ | with-foo once we support $_ in shell too (not just LLM response)

worn hill Mar 24, 2025, 11:59 PM

#

i generally use prepend more, though, because append can also be <up arrow> | with-foo

quiet ether Mar 24, 2025, 11:59 PM

#

in bash !! actually re-computes the last command, doesn't hold the actual value

worn hill Mar 25, 2025, 12:00 AM

#

oh yeah it's unevaluated

spring wave Mar 25, 2025, 12:00 AM

#

right it's more like a macro than a var

worn hill Mar 25, 2025, 12:00 AM

#

yeah

quiet ether Mar 25, 2025, 12:00 AM

#

so you'd do container | from alpine

and then something like

myfunc --ctr $(!!)

spring wave Mar 25, 2025, 12:00 AM

#

and then your history will contain myfunc --ctr $(container | from alpine) right?

worn hill Mar 25, 2025, 12:01 AM

#

lol that's a big "depends" i think

quiet ether Mar 25, 2025, 12:01 AM

#

spring wave and then your history will contain `myfunc --ctr $(container | from alpine)` rig...

yes, correct

worn hill Mar 25, 2025, 12:01 AM

#

at least on my config it saves the !! unless you tab-expand

quiet ether Mar 25, 2025, 12:01 AM

#

worn hill at least on my config it saves the `!!` unless you tab-expand

I don't get any !! in the history

worn hill Mar 25, 2025, 12:02 AM

#

oh you're right... huh

#

nm i'm making shit up

shrewd ermine Mar 25, 2025, 1:30 AM

#

@spring wave what does _scratch do and why does it get called so often in prompt mode?

spring wave Mar 25, 2025, 1:30 AM

#

it resets the current state to nil, so there aren't any per-object tools

#

i've gotten rid of it on llm-evals

shrewd ermine Mar 25, 2025, 1:31 AM

#

lol so basically a table flip, got it

spring wave Mar 25, 2025, 1:32 AM

#

lol, yea pretty much. curious how it does on the llm-evals tool scheme. is it easily reproducible?

shrewd ermine Mar 25, 2025, 1:32 AM

#

yeah i'm just setting source=$(directory | with-directory / .) and asking gemini to make changes to my project

spring wave Mar 25, 2025, 1:33 AM

#

ah gemini specifically is the model i've been fixing, lemme try. are you using a particular agent? or that's it?

shrewd ermine Mar 25, 2025, 1:34 AM

#

just prompt mode right now, trying to work on a no-code experience

woeful quiver Mar 25, 2025, 1:36 AM

#

Ok, worked a generic sql module using Go structs, I've essentially broken the "workspace" with the LLM. Its trying to use the SqlTableDetails@xxh3:0735eeb380a1522f as the table name?

Trace: https://v3.dagger.cloud/JasonDagger/traces/929aee5c6d2e1152969edcc861f2325f

Code: https://github.com/jasonmccallister/sql/blob/09b96e193a85a93826a532168119628b8efe8492/main.go#L100

wraith remnant Mar 25, 2025, 1:37 AM

#

spring wave lol, yea pretty much. curious how it does on the `llm-evals` tool scheme. is it ...

Quick question(s): On your llm-evals branch, after calling a tool, do you automatically get scoped to that tool or that tool's result or do you stay in the same original scope that gets updated with select + typeName?

spring wave Mar 25, 2025, 1:38 AM

#

wraith remnant Quick question(s): On your llm-evals branch, after calling a tool, do you autom...

when a tool returns an object, it auto-selects it yes

#

so theoretically selectFoo is only for 1) when you have no current selection or 2) when you want to go back to a different/older object

#

but, some models are keen to call it redundantly. those ones need a system prompt 😦

#

tried everything: putting something in the description => ignored. putting an explicit hint in the output => ignored. having selectFoo return an error if it was redundant => it just keeps doing it

wraith remnant Mar 25, 2025, 1:40 AM

#

spring wave when a tool returns an object, it auto-selects it yes

Ok, and this is the same behaviro on the single object, BUT, the user can choose to go back to the original env (in the shell) with llm | with-hello $(.) | with-prompt "spin up an alpine container" | hello where hello brings back to the original env scope?

woeful quiver Mar 25, 2025, 1:40 AM

#

woeful quiver Ok, worked a generic `sql` module using Go structs, I've essentially broken the ...

switching to JSON string was a little better: https://dagger.cloud/JasonDagger/traces/79a96b344ad7c040ffcd4c0e861ef0a1

shrewd ermine Mar 25, 2025, 1:40 AM

#

Ok, worked a generic sql module using

spring wave Mar 25, 2025, 1:41 AM

#

wraith remnant Ok, and this is the same behaviro on the single object, BUT, the user can choose...

the | hello at the end there will access the last value selected by the LLM, which must be a Hello object, otherwise it'll fail

woeful quiver Mar 25, 2025, 2:18 AM

#

Is there a consensus on the best model to use in ollama for tool calling? List looks long and forum/reddit are a little all over the place in terms of recommendations: https://ollama.com/search?c=tools

shrewd ermine Mar 25, 2025, 2:37 AM

#

woeful quiver Is there a consensus on the best model to use in ollama for tool calling? List l...

In that list I've mostly used llama and qwen2.5-coder

river belfry Mar 25, 2025, 8:18 AM

#

worn hill this is cool, have you thrown any lower level language (rust, c) stuff at it? h...

I tried, and the results are... not consistent.
I can use it to work on a rust project for instance, no problem.
But on complex codebases, the result will really depend on the model. With my local qwen2.5 it works well on small codebases.
But tree-sitter for instance (that also contains bindings to other languages) will not be good.
If I switch to openai/gpt-4o (I kept defaults) it works great, install cargo, npm, install some npm tools, build it before to open the terminal
I'd love to have the same thing fully locally, but I probably need a bigger machine and a bigger model for that 😅

somber vault Mar 25, 2025, 1:12 PM

#

Figuring out the deployment part, would appreciate any advice. Rest is working PERFECTLY!
So far the setup:

I want my app containerized to simplify running under k8s, ECS whatnot.
My app inside container calls dagger engine itself.
Ideally some images come cached within dagger engine inside the app container.

What I'm doing right now:
use dind as base and install Dagger + UV

LOAD_CACHE = """
import anyio
import dagger
from dagger import dag

async def main():
    async with dagger.connection():
        print("inside async with dagger")
        container = dag.container().from_("oven/bun:1.2.5-alpine")
        result = await container.with_exec(["bun"])
        print(await result.stdout())


if __name__ == "__main__":
    anyio.run(main)
""".strip()


base = (
    dag.container()
    .from_("docker:28-dind")
    .with_exec(["apk", "add", "curl"])
    .with_exec(["apk", "add", "python3"])
    .with_exec(["curl", "-fsSL", "https://dl.dagger.io/dagger/install.sh", "-o", "/tmp/install.sh"])
    .with_exec(["sh", "-c", "BIN_DIR=/usr/local/bin sh /tmp/install.sh"])
    .with_exec(["curl", "-fsSL", "https://astral.sh/uv/install.sh", "-o", "/tmp/install.sh"])
    .with_exec(["sh", "-c", "XDG_BIN_HOME=/usr/local/bin INSTALLER_NO_MODIFY_PATH=1 sh /tmp/install.sh"])
)


runtime = (
    base
    #.with_exec(["sh", "/usr/local/bin/dockerd-entrypoint.sh"], insecure_root_capabilities=True)
    .with_workdir("/app")
    .with_new_file("/app/load_cache.py", LOAD_CACHE)
    .with_exec(["uv", "run", "load_cache.py"], insecure_root_capabilities=True)
)

Whats the best course of action?

nova bronze Mar 25, 2025, 2:07 PM

#

Figuring out the deployment part, would

lean mural Mar 25, 2025, 6:02 PM

#

Is there a generalize workspaces that folks are using for agents yet or are most folks hand rolling each time? I tried @shrewd ermine 's module from the daggerverse, but also noticed that it's not used in his agents demos

shrewd ermine Mar 25, 2025, 6:04 PM

#

lean mural Is there a generalize workspaces that folks are using for agents yet or are most...

I've mostly made one for each demo that's tailored for the individual use cases. I suspect when multi-object + function masking is in it'll be less relevant.

lean mural Mar 25, 2025, 6:06 PM

#

yeah i keep finding that the agent's context grows huge and the task fails out before it finishes. suspect i'll need to hand roll a workspace module

shrewd ermine Mar 25, 2025, 6:08 PM

#

yeah exactly, the workspace pattern is perfect for that. I tried to make a generalized one with kpenfound/dag/workspace but I still ended up needing changes for each implementation. Maybe function masking on top of a very generalized workspace would be a solution

merry scarab Mar 25, 2025, 6:09 PM

#

I do think we should ship a default workspace with dagger init or something just to help people get off on the right foot

shrewd ermine Mar 25, 2025, 6:09 PM

#

another case for init templates 😛

#

Environments (#1352023893543747754 message) might also replace this

#

✨ new stuff https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025

Google

Gemini 2.5: Our most intelligent AI model

Gemini 2.5 is our most intelligent AI model, now with thinking.

merry scarab Mar 25, 2025, 6:12 PM

#

shrewd ermine `Environments` (https://discord.com/channels/707636530424053791/1352023893543747...

Thanks! I missed this whole discussion lol, compeltely on board with the idea

fleet fiber Mar 25, 2025, 6:43 PM

#

Was talking Silicon Valley (HBO show) with a friend today and he told me about Windsurf's commercial that has Russ from the show on it. Hilarious. BUT, what Windsurf's founders describe is where I think Dagger might be headed "Both collaborative and independently powerful"

https://youtu.be/3xk2qG2QPdU?si=x5L63_3k2DDuSv7z&t=44

YouTube

Codeium - Windsurf

The Windsurf Editor (World’s First Agentic IDE)

Introducing the Windsurf Editor - the world’s first agentic IDE. 🏄

In Windsurf, we have given the AI a previously unseen combination of deep codebase understanding, access to a powerful set of tools, and real time access to your actions. The result? A magical experience we call Cascade, the evolution of chat that keeps you truly in the flo...

▶ Play video

warped bramble Mar 25, 2025, 7:05 PM

#

I'm getting 503 errors using Gemini, is it just me ?

#

nvm, came back

shrewd ermine Mar 25, 2025, 7:06 PM

#

yep I hit that too

uneven depot Mar 25, 2025, 8:51 PM

#

fleet fiber Was talking Silicon Valley (HBO show) with a friend today and he told me about ...

Path to tres commas!

smoky ocean Mar 25, 2025, 9:28 PM

#

Cloudflare agent stuff

smoky ocean Mar 26, 2025, 12:41 AM

#

@spring wave what does LLMEnv.intern() do exactly? Re-entrant ingestion into "ID system" + return ingested ID? So I have to call it at least once to ingest the value, but then I can safely call it several times and get the same result, without side effects on the env state?

#

(context: rebasing my Environment API branch on planet-eval 🙂 )

#

https://tenor.com/view/aladdin-jasmine-carpet-flying-moon-gif-22582378

Tenor

spring wave Mar 26, 2025, 12:44 AM

#

smoky ocean <@108011715077091328> what does `LLMEnv.intern()` do exactly? Re-entrant ingesti...

yep!

#

the PR is ready for approval now

smoky ocean Mar 26, 2025, 12:48 AM

#

@spring wave how do you use the expectedType argument in Get()?

spring wave Mar 26, 2025, 12:49 AM

#

smoky ocean <@108011715077091328> how do you use the `expectedType` argument in `Get()`?

it's to handle the potential case where we just get a number from the model - that type will get auto-prefixed onto it to normalize it. So if you have an expected type (because you're being called in the context of an ID arg, or you're being called from selectFoo, etc.) - be sure to populate it

#

it's hard to consistently convince a model to make that mistake, so it's kind of "best effort" atm, may need refinement as we test more (for example, to handle 1 vs. "1")

#

it would probably also make sense to assert that the value matches that type name, but that's already handled other places so I didn't bother

smoky ocean Mar 26, 2025, 12:51 AM

#

I see everything is locked to objects, I'm guessing it wont be too hard to expand to any dagql.Typed in the future? There use to be a check for objects in some parts

spring wave Mar 26, 2025, 12:52 AM

#

it might make sense to do that again yeah, but it might also make sense to still keep non-Object types in a separate spot since the mechanics are so different. I split it up at a time where string vars were moved out into the LLM (because they were reduced to just prompt vars), but now they're back in the LLMEnv, and I just gave them their own map to keep it tidy

#

make sure you pull, you might not have those changes

#

i was going to ask earlier: do we anticipate passing other types as variables, or only strings? other scalars are easy, but arrays are where things get complicated

smoky ocean Mar 26, 2025, 12:56 AM

#

At least other scalars yeah. Didn't think about arrays, might not be worth it

spring wave Mar 26, 2025, 12:57 AM

#

are scalar values preserved in shell? or are they all strings?

#

foo=1 is a string i'd imagine

#

(though there may be other ways to set these for sure)

smoky ocean Mar 26, 2025, 12:57 AM

#

My immediate concern for the env API, is splitting LLMEnv in two halves: a Dagger-facing backend called Environment, and a LLM-facing frontend called MCP. So trying to sort the implementation in those 2 buckets

#

Implementation as I understand is mostly unaffected (besides being split in two), except for the functionMask part, which will move to the individual binding instead of just the current selected object

spring wave Mar 26, 2025, 12:59 AM

#

oh right, I want to try functionMask again, I had it almost working but the model ended up just getting confused, so I shelved it :/ (but kept some of the code intact for when I get back to it)

smoky ocean Mar 26, 2025, 1:00 AM

#

The Type#number system will be in the backend Environment. But things like the concept of "current" object, the specific string replies, tool hints etc, move to MCP

spring wave Mar 26, 2025, 1:00 AM

#

I'm tempted to try another model where instead of having a "current object" you gradually increase your scope of available functions and explicitly pass a self argument

smoky ocean Mar 26, 2025, 1:01 AM

#

Hopefully Environment can stabilize while MCP keeps iterating like crazy on best LLM interface

spring wave Mar 26, 2025, 1:01 AM

#

spring wave I'm tempted to try another model where instead of having a "current object" you ...

my concern here is I'm not sure the "state machine" concept would survive the transition

smoky ocean Mar 26, 2025, 1:01 AM

#

(to be clear I'm using MCP in a loose sense. It would encapsulate both actual MCP protocol implementation, and BBI. I'm thinking we can just rebrand our BBI as "MCP", really a sort of "MCP+" 🙂

#

I feel like the longer we wait before we split, the harder it will get

spring wave Mar 26, 2025, 1:02 AM

#

looking forward to some code so this can start to become concrete in my head 😛

smoky ocean Mar 26, 2025, 1:02 AM

#

yeah sorry, the last week or so I haven't been able to look at the code at all

#

thanks for keeping that core going

quiet ether Mar 26, 2025, 3:43 AM

#

👋 how's the load secret from env var tools coming?

✔ call the Discord API and set the bot token using the DISCORD_TOKEN environment variable 5.0s
│🧑 call the Discord API and set the bot token using the DISCORD_TOKEN environment variable
│ ┃ 0.0s
│
│🤖 To work with the Discord API using the provided tools, I'll need to call  Query_discord  which requires a bot token parameter. The token should be provided through the environment, but I don't
│ ┃ have direct access to environment variables. Let me let you know what's needed:
│ ┃
│ ┃ The  Query_discord  function requires a bot token and this needs to be provided as a SecretID. You'll need to first create a secret with the Discord token and then use that secret with the
│ ┃ Discord API call.

quiet ether Mar 26, 2025, 3:44 AM

#

quiet ether 👋 how's the `load secret from env var` tools coming? ``` ✔ call the Discord ...

✔ load the secret from the DISCORD_TOKEN env variable 12.4s
│🧑 load the secret from the DISCORD_TOKEN env variable
│ ┃ 0.0s
│
│🤖 I can help you load a secret using the name "DISCORD_TOKEN". I can use the  Query_loadSecretFromName  function for this purpose.
│ ┃ 2.7s ◆ Input Tokens: 2,427 ◆ Output Tokens: 95
│
│ ✔ loadSecretFromName(name: "DISCORD_TOKEN"): Secret! 0.0s
│
│🤖 Now I can retrieve the information about this secret:
│ ┃ 4.0s ◆ Input Tokens: 628 ◆ Output Tokens: 48
│
│🤖 3.3s ◆ Input Tokens: 702 ◆ Output Tokens: 37
│
│ ✘ Secret.name: String! 0.0s
│ ! secret not found: xxh3:a87f0af2887099fc
│
│🤖 I apologize, but it seems there was an error retrieving the secret. This could mean that either:
│ ┃
│ ┃ 1. The secret named "DISCORD_TOKEN" doesn't exist in the environment

#

smoky ocean Mar 26, 2025, 6:04 PM

#

@spring wave @warped bramble @shrewd ermine @wraith remnant can we talk live about llm release in a little bit?

spring wave Mar 26, 2025, 6:09 PM

#

smoky ocean <@108011715077091328> <@707011193814122506> <@135620352201064448> <@274903880343...

@worn hill and i are pairing in team-audio, feel free to drop in

smoky ocean Mar 26, 2025, 6:12 PM

#

spring wave <@430802613848506380> and i are pairing in team-audio, feel free to drop in

cool, need another 30mn or so

quiet ether Mar 26, 2025, 6:21 PM

#

This actually gave an idea. How about a flag along with "-c" so the shell doesn't automatically exits after running the commands? Python's REPL supports this by adding the -i flag

warped bramble Mar 26, 2025, 6:25 PM

#

@quiet ether FYI i have an idea to make MCP work with multiple modules statically (not yet dynamically), could be in a follow up.

spring wave Mar 26, 2025, 7:00 PM

#

smoky ocean cool, need another 30mn or so

actually, sorry, i have to do an errand and will probably miss that timing

smoky ocean Mar 26, 2025, 7:00 PM

#

No problem, me too 😛

#

Want to ping us when you're back? The shell launch tornado will probably be over by then

#

Can you still talk today?

spring wave Mar 26, 2025, 7:03 PM

#

smoky ocean Can you still talk today?

yep! just have to dip out for a couple of hours

river belfry Mar 26, 2025, 7:21 PM

#

Are SetXxxx(name, value) and WithXxxx(value) equivalent?
I thought WithXxx was deprecated, but maybe not anymore?
What's the impact regarding tools? When I use WithXxxx I can see the tools (for instance using .LLM().Tools()) but not when doing the same thing with SetXxxx.
So I guess I'm missing something here, but not sure what 😉

smoky ocean Mar 26, 2025, 7:33 PM

#

river belfry Are `SetXxxx(name, value)` and `WithXxxx(value)` equivalent? I thought `WithXxx`...

it's work in progress... APIs not yet stabilized

#

The way it currently works is that both work, and are layered. There is a concept of "selected object", which you can set directly with WithXXX. The LLM can also change its own selection with internal tools. At a higher layer on top of that, there are named bindings (variables) which can be set in the LLM environment (SetXXX). The LLM can list them and select them

#

We are actively working to simplify this API. It's tricky because there are several variables in the equation:

Best DX for the developer (ie. you)
Best performing LLM interface (how the bindings are presented to the LLM, lots of tricks and iterations there)
MCP support. We want the same system to work over LLM and MCP protocols.
Keeping modules up-to-date with API changes (examples, early prototypes etc)

#

See #1352023893543747754 for proposed direction

#

@river belfry when you do WithXxx("foo", bar) the object bar is available at the named binding "foo". I think in the current implementation there is a special tool called _objects or _list or something like that, and it will list the bindings

quiet ether Mar 26, 2025, 8:16 PM

#

just to validate array types are not very well handled by the dagger llm yet, right? Getting some panics when try to call function which return those

shrewd ermine Mar 26, 2025, 8:17 PM

#

correct, I think arrays of basic types are fine but not arrays of objects

spring wave Mar 26, 2025, 8:36 PM

#

quiet ether just to validate array types are not very well handled by the dagger llm yet, ri...

This works on my llm-evals pr, which still needs a ✅

spring wave Mar 26, 2025, 10:07 PM

#

spring wave This works on my llm-evals pr, which still needs a ✅

Merged!

spring wave Mar 26, 2025, 10:45 PM

#

@smoky ocean half baked idea related to return values and whether there's an implicit current state / return value, and building on the idea of letting it know upfront what type of value we want back: something like withFileSlot("bin", "The compiled binary.") - the goal being for the model to fill all the "slots"(?) before returning. Those could then all be synced back to the shell. Dunno about the code DX, I have a feeling the functional model of "many inputs in, one input out" may still be more intuitive (I generally prefer schemes that don't require you to make up names for things if you only have one thing), but something to think about

smoky ocean Mar 26, 2025, 10:46 PM

#

spring wave <@488409085998530571> half baked idea related to return values and whether there...

Could be a property of the binding, set with an optional to withBinding

#

mm but "returning" a binding is weird

#

but so it "returning a slot" 😛

spring wave Mar 26, 2025, 10:47 PM

#

lol, i thought of it more like filling a slot but am not bound (heh) to the word at all

#

since there may be multiple of them

#

there's probably a better metaphor

#

what I really don't want is mutable bindings

smoky ocean Mar 26, 2025, 10:48 PM

#

I think if LLM familiarity matters, we should go for something very very present in the training data, like returning or exporting

#

Or printing 🙂 (we don't have to actually print it)

#

Or showing ?

spring wave Mar 26, 2025, 10:48 PM

#

yeah probably comes down to what the evals say

smoky ocean Mar 26, 2025, 10:48 PM

#

What would a human user do

#

It would probably enter the values in a form

#

You could almost say it would be prompted to enter a value 😛

#

oh god we crossed the streams

#

I mean that is how they win in the end

#

https://tenor.com/view/ghostbusters-dont-cross-the-streams-shoot-gif-15786226

Tenor

warped bramble Mar 27, 2025, 12:04 AM

#

FYI https://github.com/dagger/dagger/pull/9983, i managed to test it with .model claude / .model openai in the shell. cc @smoky ocean

woeful quiver Mar 27, 2025, 12:15 AM

#

Dumb idea, but a user with a form would be to submit or even.. enter?

#

submitBinding

#

We’re all going to submit to them in the end.. so might as well adapt now?

#

enterBinding 🤘

worn hill Mar 27, 2025, 1:09 AM

#

https://tenor.com/hQCe4i9FNeZ.gif

Tenor

#

https://github.com/dagger/dagger/pull/9978 @spring wave and i got a life alert prototype up and running today, so far pretty impressive at least with gemini, gonna try some other models tomorrow

GitHub

life alert: the _human_help tool so LLMs know they can ask clarifyi...

sorta working? if i ask gemini to use the tool, it does, prompts me, and reads my responses afaict. gemini also happens to ask pretty good clarifying questions, like maybe better than the ones it d...

spring wave Mar 27, 2025, 1:53 AM

#

knowing the desired return type

smoky ocean Mar 27, 2025, 2:43 AM

#

https://ai.google.dev/gemma/docs/capabilities/function-calling

Google AI for Developers

Function calling with Gemma | Google AI for Developers

shrewd ermine Mar 27, 2025, 2:56 AM

#

Definitely eager to try this one. I've seen lots of good things about gemma3 but in the few prompts I've thrown at it, I'm not 100% convinced yet. But I'm a gemini fan so I know they can do it 🙂

spring wave Mar 27, 2025, 3:19 AM

#

I'm tinkering with function masks again and it's working suspiciously well... tempted to sneak it in. It saves SO many tokens (2849 vs 13,470), and in turn makes things run much faster especially with Claude. Also cool to see the LLM planning ahead.

#

(that mask-less run ended with "overloaded"... I have been seeing that a lot)

lilac dagger Mar 27, 2025, 4:35 AM

#

Hi, first post here: I tried using o3-mini of openai but got the error in the pic which shows parallel_tool_calls is not supported by the model yet, and parallel_tool_calls is true by default per https://platform.openai.com/docs/api-reference/chat/create#chat-create-parallel_tool_calls. Actually there is a recent issue on openai forum claiming o3-min tool calling issue (https://community.openai.com/t/o3-mini-api-with-tools-only-ever-returns-1-tool-no-matter-prompt/1112390/3), so it seems we cannot expect to use parallel_tool_calls for o3-mini any time soon. Alternatively, thinking of turning the option off, as I tracked down the dagger implementation which doesn't seem to allow custom params for the api (https://github.com/dagger/dagger/blob/20e8a174fd9e45c7ae915d091167aa7ef18d822a/core/llm_openai.go#L117), so parallel_tool_calls is true by default and not configurable from client side. So it seems a deadend for o3-mini unless I missed anything ? Is there any workaround or is allowing custom parameters for llm api worth being supported in near term ?

GitHub

dagger/core/llm_openai.go at 20e8a174fd9e45c7ae915d091167aa7ef18d82...

An open-source runtime for composable workflows. Great for AI agents and CI/CD. - dagger/dagger

merry scarab Mar 27, 2025, 10:22 AM

#

Has anyone been able to get their agent to understand how to find a trace url for the work its doing?

I am trying to have this URL included in a markdown file that the my agent is creating, but sadly right now it just says something along the lines of - **Dagger Cloud Trace**: N/A (local testing)

river belfry Mar 27, 2025, 2:06 PM

#

While trying to use mistral-nemo I've got:
After the optional system message, conversation roles must alternate user/assistant/user/assistant/...
Is it something worth investigating (or is that something already known)?

merry scarab Mar 27, 2025, 3:35 PM

#

Has anyone seen this error from (I think anthropic)?

input: daggerverseQa.doQa received error while streaming: {"type":"error","error":{"details":null,"type":"overloaded_error","message":"Overloaded"}     }

shrewd ermine Mar 27, 2025, 3:36 PM

#

Yeah, I think they just need a break, same idea as google's 429

merry scarab Mar 27, 2025, 3:39 PM

#

I have a feeling my demo is not going to go well 😦

shrewd ermine Mar 27, 2025, 3:39 PM

#

usually they're not overloaded for long so 🤞

spring wave Mar 27, 2025, 3:40 PM

#

I hit that pretty frequently with Claude :/ @merry scarab have you tried your demo with Gemini? it'll be much better with v0.17.2, super fast and no throttling/overloaded errors

merry scarab Mar 27, 2025, 3:40 PM

#

spring wave I hit that pretty frequently with Claude :/ <@920499459484418068> have you tried...

No I have not yet but like 5 min to show time so....

#

I dont have a gemini account either, yet ..

smoky ocean Mar 27, 2025, 3:41 PM

#

switch to openai?

merry scarab Mar 27, 2025, 3:41 PM

#

openai has other errors - it tells me to f off because of a 30k token limit or something

#

Ugh........ I am so sad. This was working overall before but now its not writing my file again 😭

why does this always seem to happen right before a demo

#

nvm it works!

#

🤞

#

The overloaded thing gets cached lol -- this is where I really wish I could have a flag or somethign to tell shell to DOIT or something

shrewd ermine Mar 27, 2025, 3:54 PM

#

I've run into this too, we should not cache those errors

smoky ocean Mar 27, 2025, 3:59 PM

#

Masked functions 🧵

spring wave Mar 27, 2025, 4:00 PM

#

smoky ocean Masked functions 🧵

(possible early thread deduping: https://discord.com/channels/707636530424053791/1354656055925149716)

wraith remnant Mar 27, 2025, 4:38 PM

#

custom params

smoky ocean Mar 27, 2025, 5:21 PM

#

🚨🚨🚨 Dagger 0.17.2 is out, with many improvements to the LLM API. Make sure to upgrade and try running your agents again! Let us know if you see any issues or improvements!

woeful quiver Mar 27, 2025, 6:17 PM

#

Updated my agents to v0.17.2 - everything works as expected. Any specific APIs we should try/test out?

subtle surge Mar 27, 2025, 6:18 PM

#

woeful quiver Updated my agents to v0.17.2 - everything works as expected. Any specific APIs w...

@spring wave

spring wave Mar 27, 2025, 6:28 PM

#

woeful quiver Updated my agents to v0.17.2 - everything works as expected. Any specific APIs w...

not really! there are some additive APIs but they're mostly meta/fringe things, it's mostly just whether your existing agents are still working 😛

woeful quiver Mar 27, 2025, 6:29 PM

#

spring wave not really! there are some additive APIs but they're mostly meta/fringe things, ...

yeah working just fine, nice work 💪

worn hill Mar 27, 2025, 6:35 PM

#

soooo @quiet ether @spring wave @smoky ocean the "we want dagger install'd deps to be available to the LLM" thing: this is obviously desirable for local modules, should it also apply to remote modules? eg dagger -m toy-programmer shell should have an LLM env where toy-workspace is callable? remotes-depending-on-remotes, too? like dagger -m dagger/dagger should have gerhard/daggerverse/notify?

smoky ocean Mar 27, 2025, 6:41 PM

#

worn hill soooo <@336241811179962368> <@108011715077091328> <@488409085998530571> the "we ...

Great question. I mentioned in another thread (can't find it of course) that we've always had this ambiguity between "query a module from the outside" and "query from inside a module". We've never had to resolve it, but it started gently biting us in the shell, and now even more 🙂

worn hill Mar 27, 2025, 6:56 PM

#

imma just try to have the llm always feel like it's inside the module regardless of local/remote status

smoky ocean Mar 27, 2025, 7:15 PM

#

worn hill imma just try to have the llm always feel like it's inside the module regardless...

Makes sense to me

#

That way it will match the shell .cd

spring wave Mar 27, 2025, 7:26 PM

#

portalfrom #1354880578390065162 message - here's a delegateable task: retry logic in the LLM loop. Each provider implementation checks for certain retryable errors, annotates the error response as such (wrapping error type), outer loop checks for it and retries, backing off as appropriate

#

i'd wager that's pretty high priority

#

oh @shrewd ermine opened an issue 🙏 - https://github.com/dagger/dagger/issues/9970

shrewd ermine Mar 27, 2025, 7:38 PM

#

debugging a thing in the gemini client implementation https://github.com/dagger/dagger/blob/main/core/llm_google.go#L221

if candidate.Content == nil {
  return nil, fmt.Errorf("no content?")
}

I think maybe we need to continue in this case rather than error. Anyone else have more context on cases when the Content is nil (with streaming)

wraith remnant Mar 27, 2025, 7:44 PM

#

shrewd ermine debugging a thing in the gemini client implementation https://github.com/dagger/...

do you have a trace ? 👀🙏

shrewd ermine Mar 27, 2025, 7:45 PM

#

wraith remnant do you have a trace ? 👀🙏

I don't, I haven't hit it on my end yet but im trying to make a repro

spring wave Mar 27, 2025, 7:59 PM

#

shrewd ermine debugging a thing in the gemini client implementation https://github.com/dagger/...

I ran into this and saw we were getting FinishReason=10 which is something like "the model generated a malformed function call" - but couldn't find any more info, can't even see said malformed call in the dump, but may be able to see it on raw network traffic if that were inspectable

shrewd ermine Mar 27, 2025, 8:01 PM

#

ah got it, yeah I don't think we're handling FinishReason at all right now are we

river belfry Mar 27, 2025, 8:03 PM

#

I need to explore more, but on the same agent, same (local) model, I have worse results with the 0.17.2 than I had before. 🤔
I was running a version based on commit a2aaf08158a64bc47e4d3fe143701b9dbb88d885 that was pretty good. Not sure what changed, I'll have a look at the llm related commits between this commit and the release.

spring wave Mar 27, 2025, 8:19 PM

#

shrewd ermine ah got it, yeah I don't think we're handling `FinishReason` at all right now are...

right not at all. For extra fun, the genai package's consts don't even go up to 10 🙃 - I had to look up their API

spring wave Mar 27, 2025, 8:19 PM

#

river belfry I need to explore more, but on the same agent, same (local) model, I have worse ...

do you have code anywhere I can try out?

shrewd ermine Mar 27, 2025, 9:03 PM

#

once I write a PR review agent from my demo repo I'm going to have a review bake-off between the different big models 😎

subtle surge Mar 27, 2025, 9:09 PM

#

shrewd ermine once I write a PR review agent from my demo repo I'm going to have a review bake...

please record 😍

worn hill Mar 27, 2025, 9:15 PM

#

worn hill soooo <@336241811179962368> <@108011715077091328> <@488409085998530571> the "we ...

so i've been poking around for a couple hours now and i cannot for the life of me figure out where to rip open a seam to mix module dependencies into LLMEnv. i think i've found a couple of the relevant pieces, like LLMHook.InstallObject exposes objects in the LLM env, core/schema/modulesource.go has pieces that iterate through module dependencies... ModSource.lazilyLoadSchema even calls mod.Install on each module in a ModDeps (although that feels like maybe a different meaning of Install)... coming from the outside, shell_fs.go has maybeLoadModule for bringing in modules, but that's got all its own definitions of modules that don't map to core types, and there's a lot of indirection between those shell modules and the LLM install hook.

where would you start with this? @spring wave @hidden tartan it feels somewhat related to codegen activities, but the calling context is very different

smoky ocean Mar 27, 2025, 9:16 PM

#

worn hill so i've been poking around for a couple hours now and i cannot for the life of m...

cc @steep onyx @shrewd fern 🙏

spring wave Mar 27, 2025, 9:23 PM

#

worn hill so i've been poking around for a couple hours now and i cannot for the life of m...

I suspect you might be too far down in the stack - I would try doing it in the CLI from the outside, for example if we have an API for getting a module's dependencies and calling .Serve I think that should result in them being installed into Query

worn hill Mar 27, 2025, 9:24 PM

#

spring wave I suspect you might be too far down in the stack - I would try doing it in the C...

lol i was literally just looking at this

#

@spring wave there's a spooky comment here though: ```go
// Serve a module's API in the current session.
//
// Note: this can only be called once per session. In the future, it could return a stream or service to remove the side effect.
func (r *Module) Serve(ctx context.Context) error {
if r.serve != nil {
return nil
}
q := r.query.Select("serve")

return q.Execute(ctx)

}

lilac dagger Mar 27, 2025, 9:25 PM

#

A question about "direct host access" which is disabled for dagger function per https://docs.dagger.io/api/sdk/#differences: my case is to develop an AI agent in form of dagger module, and I'm trying to let it take an input source directory from host (where my sample app is at), and do something agentic in my dagger module/functions including read, write, modify codes in my sample app or execute some arbitrary commands. But I tried achieving with no luck, the best I reached is taking in the host dir, manipulating it with or without container but cannot export it via code. Is it because function context is only the module folder and exporting to host via code is not allowed ? The dagger demos of agent are mostly publishing the agentic results on the project to PR, but I just want to apply them to host. It seems related to this issue https://github.com/dagger/dagger/issues/8235 ?

GitHub

Dagger for generating code and docs · Issue #8235 · dagger/dagger

Problem In theory Dagger is perfect for generating code or docs. In practice, the logic for exporting files back to the client filesystem is simplistic and brittle, which makes the experience awkwa...

Dagger SDKs | Dagger

Dagger SDKs make it easy to call the Dagger API from your favorite programming language, by developing Dagger Functions or custom applications.

smoky ocean Mar 27, 2025, 9:26 PM

#

https://tenor.com/view/digging-thank-you-gif-8141656698417804254

Tenor

smoky ocean Mar 27, 2025, 9:27 PM

#

lilac dagger A question about "direct host access" which is disabled for dagger function per ...

I believe there is now an optional argument to llm() to give it "privileged" access to the caller's context. I think it's called withQuery

#

it's all or nothing though

hidden tartan Mar 27, 2025, 9:28 PM

#

worn hill <@108011715077091328> there's a spooky comment here though: ```go // Serve a mod...

It can be called multiple time per session from different modules, not from the same one (it will conflict with already installed dep)

worn hill Mar 27, 2025, 9:40 PM

#

hidden tartan It can be called multiple time per session from different modules, not from the ...

sick, i think that works fine in this context, building now

hidden tartan Mar 27, 2025, 9:40 PM

#

worn hill sick, i think that works fine in this context, building now

That's what I do with the client gen, I call serve for each dep that needs to be served

#

Something like that

dag.moduleSource("xxx").Serve(ctx)
dag.moduleSource("bbb").Serve(ctx)

Boom xxx and bbb are queriable 😄

worn hill Mar 27, 2025, 9:41 PM

#

in the shell context, i'm initially trying this one layer up in 2/3 callsites of maybeLoadModule (.cd and on startup, skipping the one in exec)

#

func (h *shellCallHandler) maybeLoadModuleAndDeps(ctx context.Context, path string) (*moduleDef, *configuredModule, error) {
    def, cfg, err := h.maybeLoadModule(ctx, path)
    if err != nil {
        return nil, nil, err
    }

    for _, dep := range def.Dependencies {
        digest, err := dep.Source.Digest(ctx)
        if err != nil {
            return nil, nil, err
        }
        _, err = h.getOrInitDef(digest, func() (*moduleDef, error) {
            return initializeModule(ctx, h.dag, dep.Source)
        })
        if err != nil {
            return nil, nil, err
        }
    }

    return def, cfg, nil
}

looks like this

hidden tartan Mar 27, 2025, 9:43 PM

#

worn hill ```go func (h *shellCallHandler) maybeLoadModuleAndDeps(ctx context.Context, pat...

And is it working as expected?

worn hill Mar 27, 2025, 9:43 PM

#

dunno yet still building lol

hidden tartan Mar 27, 2025, 9:43 PM

#

Okay let me know 😄

#

You could just call dep.Source.AsModule.Serve technically

#

If you want to serve that dep

#

I guess there's more to add it to the shell completion etc but that should be a one liner to only serve

lilac dagger Mar 27, 2025, 9:46 PM

#

smoky ocean I believe there is now an optional argument to `llm()` to give it "privileged" a...

Thanks for the llm hint. But a dumber question is that without using llm/agent, just normal operations, is it possible to use a module function to take input source and write like a hello.txt directly back to the source directory on host via code instead of dagger shell ? Here is a simple code that ran with no error but nothing is created on mysampleapp folder even with wipe as True, and file is created in container as verified via terminal(), not sure what I miss here.

worn hill Mar 27, 2025, 9:48 PM

#

hidden tartan Okay let me know 😄

it did actually work party_blob

#

imma try it with just serve now, because the way i did it the shell wiring is incomplete anyways

merry scarab Mar 27, 2025, 9:57 PM

#

lilac dagger Thanks for the llm hint. But a dumber question is that without using llm/agent, ...

Yeah this is possible I did a demo showing this exact scenario this morning. My example used LLM but it works the same in all scenarios.

Check out the code and video

https://github.com/levlaz/agent-playground/tree/main/daggerverse-qa

https://www.youtube.com/live/uOSmyFx7O7Q?feature=shared&t=2851

Main thing is you need to use ‘export’ to get the file or directory back out to your local machine

https://docs.dagger.io/api/chaining/#export-directories-files-and-containers

GitHub

agent-playground/daggerverse-qa at main · levlaz/agent-playground

Public Repo for Building AI Agents using Dagger. Contribute to levlaz/agent-playground development by creating an account on GitHub.

YouTube

Dagger

Dagger Community Call – AI in Action: Autonomous QA & Hybrid Clou...

Join the Dagger team and fellow Daggernauts for our bi-weekly Community Call! Stay up-to-date with the latest product enhancements, discover innovative use c...

▶ Play video

Chaining | Dagger

Function chaining is one of Dagger's most powerful features, as it allows you to dynamically compose complex pipelines by connecting one Dagger Function with another. The following sections demonstrate a few more examples of function chaining with the Dagger CLI.

worn hill Mar 27, 2025, 10:01 PM

#

https://github.com/dagger/dagger/pull/9992 @spring wave @hidden tartan for your perusal

GitHub

serve dependences for shell prompt mode by cwlbraa · Pull Request ...

this lets prompt users dagger install modules and manipulate them in prompt mode as though they were the core module.
there's a alternate approach where we do a bit more and call
def, err ...

smoky ocean Mar 27, 2025, 10:03 PM

#

Every time I see a PR by @worn hill, a little "victory trumpet" sound plays in my head because of that little trumpet-shaped avatar. Is it just me?

worn hill Mar 27, 2025, 10:05 PM

#

that's the idea

#

it is a muted horn fwiw, so quiet victory trumpet

smoky ocean Mar 27, 2025, 10:11 PM

#

I'll keep that in mind

lilac dagger Mar 27, 2025, 10:22 PM

#

merry scarab Yeah this is possible I did a demo showing this exact scenario this morning. My ...

Thanks @merry scarab ! I've watched the live video of you this morning but notice the difference that you use export function from dagger shell instead of writing it in code. I'm asking whether it's possible to achieve it using pure code. I guess this comment from dagger team is relevant ? https://github.com/dagger/dagger/issues/8226#issuecomment-2312479275 And I tried using your code, the shell way of exporting works for me even the target dir is arbitrary out of module root but the attached code doesn't work for neither under module root or arbitrary path (ran without error but nothing exported).

GitHub

🐞 Container Export to Host machine is not working · Issue #822...

What is the issue? We have created a container using go sdk and trying to export to host machine using Export function but it is not working. Same operation we are doing using cli "export path...

smoky ocean Mar 27, 2025, 11:19 PM

#

lilac dagger Thanks <@920499459484418068> ! I've watched the live video of you this morning b...

I see the issue.

You're right that modules can't export to their caller's context. One workaround is to assemble a single directory with the contents to export, and return that. Then the caller can export in one go.

lilac dagger Mar 27, 2025, 11:34 PM

#

smoky ocean I see the issue. You're right that modules can't export to their caller's conte...

It makes sense. Now I kind of get that why module isn't allowed to export to host since it's supposed to be the caller responsible for doing changes to host rather then module. The caller can be either dagger cli/shell or some custom app.

smoky ocean Mar 27, 2025, 11:42 PM

#

lilac dagger It makes sense. Now I kind of get that why module isn't allowed to export to hos...

Exactly. We sacrificed some short term convenience in exchange for a more robust composition system.

lean mural Mar 28, 2025, 2:02 AM

#

is there a way to see the full LLM query? i keep blowing up because the input grows too large as the system runs the tests on repeat, but i haven't figured out how to tell what exactly is getting appended to the LLM request

smoky ocean Mar 28, 2025, 2:06 AM

#

lean mural is there a way to see the full LLM query? i keep blowing up because the input gr...

yes you can call LLM.history()

lean mural Mar 28, 2025, 2:09 AM

#

smoky ocean yes you can call `LLM.history()`

if i were running in the shell?

smoky ocean Mar 28, 2025, 2:13 AM

#

lean mural if i were running in the shell?

ah, I think there's a .llm builtin that returns the builtin llm state. so .llm | history ?

lean mural Mar 28, 2025, 2:16 AM

#

smoky ocean ah, I think there's a `.llm` builtin that returns the builtin llm state. so `.ll...

ah you meant in the SDK with LLM.history()? i meant after a run / in traces for trying to understand the failure

smoky ocean Mar 28, 2025, 2:18 AM

#

lean mural ah you meant in the SDK with `LLM.history()?` i meant after a run / in traces fo...

ah I see. Did you setup tracing in Dagger Cloud? You can see the full history in the web view if the trace

lean mural Mar 28, 2025, 2:07 PM

#

Ok now the LLM is doing what I expect, but dagger keeps exiting after the LLM prompt cycle finishes:

const llmSpace = await dag.llm()
  .withWorkspace(ws)
  .withPromptFile(prompt)
  .sync();

return await llmSpace
  .workspace()
  .diff()

it nevers calls diff -- it exits at the llm call

lean mural Mar 28, 2025, 2:11 PM

#

lean mural Ok now the LLM is doing what I expect, but dagger keeps exiting after the LLM pr...

fwiw i had the same issue with the shorter,

return await dag.llm()
        .withWorkspace(ws)
        .withPromptFile(prompt)
        .workspace()
        .diff();

but i broke it up trying to debug

shrewd ermine Mar 28, 2025, 3:07 PM

#

That looks right, but there could be 2 things going on:

If you're looking in cloud, currently the LLM basically "takes over" the whole trace and hides everything before/after. We need to fix this
At one point the tracing output in the terminal would basically push the output of diff off screen. I don't remember what the current state of this is, I thought it was fixed... but when that was happening, running the same command again to get the cached run would show me the correct output.

spring wave Mar 28, 2025, 3:15 PM

#

could use a review on this if anyone's got time: https://github.com/dagger/dagger/pull/9993

GitHub

llm: avoid exposing the model to "variables" concept by vito · Pul...

This was a bit of a halfway measure and is currently detrimental to consistent model behavior.
For example, I've observed:

needless calls to readVariable
attemps to pass variables as funct...

lean mural Mar 28, 2025, 3:30 PM

#

shrewd ermine That looks right, but there could be 2 things going on: 1. If you're looking in ...

oh snap! maybe i'll write the diff as a file and export it instead of a string and see what i get

lean mural Mar 28, 2025, 3:55 PM

#

@shrewd ermine have you seen this in your greetings api?

POST https://api.github.com/repos/lamalex/greetings-api/pulls/3/comments: 422 Validation Failed [{Resource:PullRequestReviewComment Field:pull_request_review_thread.path Code:invalid Message:} {Resource:PullRequestReviewComment Field:pull_request_review_thread.diff_hunk Code:missing_field Message:}]

its failing to write suggestions, https://v3.dagger.cloud/lamalex/traces/e8fd16f89c8f5aaa89d2db32192d6de5?span=2deb6b8a5cac88a3

Dagger Cloud

Browse and visualize Dagger traces.

shrewd ermine Mar 28, 2025, 3:56 PM

#

lean mural <@135620352201064448> have you seen this in your greetings api? ``` POST https:/...

I haven't but I'm surprised I haven't 😅 There probably needs to be some kind of validation on the payload https://github.com/kpenfound/greetings-api/blob/main/.dagger/debugger.go#L93

eager fiber Mar 28, 2025, 3:57 PM

#

spent yesterday loading a bunch of NYC open data into postgres and then wrote an agent to analyze it.

gpt-4o seems to consistently generate working queries, but then quickly ends up in rate limit land.
gpt-4o-mini avoids rate limits but generates mostly useless queries and spends a bunch of time just looping on nonsense

trying out some alternative models this morning

shrewd ermine Mar 28, 2025, 3:58 PM

#

eager fiber spent yesterday loading a bunch of NYC open data into postgres and then wrote an...

Nice, is that on a free tier account or paid?

eager fiber Mar 28, 2025, 3:58 PM

#

shrewd ermine Nice, is that on a free tier account or paid?

open AI ive paid for but it says i have to spend more to get to higher rate limit tiers?

lean mural Mar 28, 2025, 3:58 PM

#

shrewd ermine I haven't but I'm surprised I haven't 😅 There probably needs to be some kind of...

```suggestion
            Click the button to see a greeting!

looks right but 🤷

shrewd ermine Mar 28, 2025, 4:01 PM

#

eager fiber open AI ive paid for but it says i have to spend more to get to higher rate limi...

ah ok, I haven't used OpenAI at all yet but that makes sense. FWIW gemini goes really far for free, but maybe that's a hot take 😅

eager fiber Mar 28, 2025, 4:02 PM

#

shrewd ermine ah ok, I haven't used OpenAI at all yet but that makes sense. FWIW gemini goes r...

which model?

shrewd ermine Mar 28, 2025, 4:03 PM

#

gemini-2.0-flash

spring wave Mar 28, 2025, 4:56 PM

#

🧵 to bikeshed prompt mode toggle

smoky ocean Mar 28, 2025, 5:34 PM

#

https://github.com/dagger/dagger/issues/10003

GitHub

Default function masks when binding core types to LLM? · Issue #10...

In addition to user-configurable function masks, maybe we should mask some functions from core types by default. The core types (Container, Directory) have a lot of functions, so the tool count goe...

quiet ether Mar 28, 2025, 6:07 PM

#

worn hill https://github.com/dagger/dagger/pull/9992 <@108011715077091328> <@2818744806518...

Connor, would really love to have this so I can continue working on my Discord module with this enabled. Just checking if I can help somehow to merge it

quiet ether Mar 28, 2025, 6:08 PM

#

quiet ether Connor, would really love to have this so I can continue working on my Discord m...

or..I can just build the engine off that PR and hold, that's fine either way 🙏

worn hill Mar 28, 2025, 6:14 PM

#

quiet ether or..I can just build the engine off that PR and hold, that's fine either way 🙏

feel free to cherry-pick, i've got some broken tests to track down and some tests to write before that will become mergable

eager fiber Mar 28, 2025, 7:02 PM

#

@shrewd ermine ya gemini limits seem better.

#

although you'll notice i asked about 2025 and got a 2024 response 😛

#

so have to play with the data and prompts a bit.

river belfry Mar 28, 2025, 7:23 PM

#

spring wave do you have code anywhere I can try out?

My code is there but not sure how easy it is to test it https://github.com/eunomie/local-agent
But I finally managed to find the first commit that changed the behavior:
https://github.com/dagger/dagger/commit/44370a44d
I'll check again with main to see.
Basically:

I'm running qwen2.5 as the (local) model
the app is a small tool containing an environment with functions like addpackages, tree, read, write
the goal is to create dev environments on the fly, by letting the LLM read the files and understand what to do.
before this commit, the llm will read the file tree, read the files, install packages
after, the llm will read the file tree, read the files, but stops there. It never install packages, in best case it prints what it should do
My guess would be in this upgrade of openai-go from 0.1.0-alpha.61 to 0.1.0-beta.2
I'm trying to upgrade to the 0.1.0-beta-3 to see if that's any better

river belfry Mar 28, 2025, 7:34 PM

#

river belfry My code is there but not sure how easy it is to test it https://github.com/eunom...

ok, so the 0.1.0-beta.3 is better. My feeling is it's still worse than the alpha.61, but it's better
I'll do some more tests, but I can open a PR to bump it

feral birch Mar 28, 2025, 7:41 PM

#

Hi all! I've been playing around with Dagger's agent use cases and am very excited to build an agent on top of it! Thinking ahead, I do have a question on how I can distribute my agent that's built on top of Dagger for others to run. I understand that I could ask my users to install Dagger and then do dagger install dagger run etc, with my own dagger module. But is there a way that I could package dagger runtime as part of my own executable and just ship one single binary to my users? Thanks for the help!

river belfry Mar 28, 2025, 7:41 PM

#

river belfry ok, so the `0.1.0-beta.3` is better. My feeling is it's still worse than the `al...

https://github.com/dagger/dagger/pull/10005

GitHub

deps: bump openai-go to v0.1.0-beta.3 by eunomie · Pull Request #1...

https://github.com/openai/openai-go/releases/tag/v0.1.0-beta.3
Full changelog: openai/openai-go@v0.1.0-beta.2...v0.1.0-beta.3

quiet ether Mar 28, 2025, 9:42 PM

#

worn hill https://github.com/dagger/dagger/pull/9992 <@108011715077091328> <@2818744806518...

just tested this and unblocks my use-case. Thx Connor ❤️

worn hill Mar 28, 2025, 10:46 PM

#

quiet ether just tested this and unblocks my use-case. Thx Connor ❤️

was chatting with @spring wave and there is one thing downstream of this that feels kinda necessary: the LLM needs a selectQuery tool. when its selected something else, in my use cases usually the type of the parameter of a query that im building up to put into the query, once you've got the param all constructed, it can no longer pass it in to the query-rooted function you're trying to call. curious if you've hit that same thing in your experimentation.

steep onyx Mar 28, 2025, 10:50 PM

#

Did a rebase of the engine-wide cache PR on main and picked up the new AllowLLM tests, which started failing. The main "problem" is that tests are getting cache hits/deduped-execution when calling the same modules with the same args, which results in only one of the test clients actually getting a prompt to ask if it's okay to use the LLM, causing others to fail randomly...

"problem" in quotes because I'm not sure yet if that's actually something to consider a bug. The point of the --allow-llm stuff is to not eat users tokens without their permission, right? So in this case, not prompting a user who's tokens wouldn't be consumed anyways makes sense, I think? Would others agree? cc @worn hill

#

I got the tests to pass by mixing in the client's AllowedLLMModules settings into the cache key for function calls, but that's a sad way to fix it since it means less function caching everywhere just based on those settings.

steep onyx Mar 28, 2025, 11:17 PM

#

steep onyx Did a rebase of the engine-wide cache PR on main and picked up the new `AllowLLM...

For now, going with just adjusting the tests to avoid duplicate function calls that trigger this. If anyone has an opinion on whether the behavior should be different in this situation lemme know.

worn hill Mar 28, 2025, 11:48 PM

#

steep onyx Did a rebase of the engine-wide cache PR on main and picked up the new `AllowLLM...

yeah, i'd agree. i was hitting this with the old cache algos too, but added this cache buster to try to avoid it... it seemed to work and cause the desired-for-test cache invalidations but i won't pretend i had a complete understanding of why, i was just trying to get hit all the cases i needed to hit and it seemed to unblock me

GitHub

dagger-test-modules/llm-dir-module-depender/llm-test-module/main.go...

Contribute to dagger/dagger-test-modules development by creating an account on GitHub.

#

@steep onyx that said the full caching on llm calls is gonna be really interesting... does it correctly factor in message history and whatnot? people definitely don't think of LLMs as being pure or "hermetic" (weird word to use here, i know, but i think you catch my meaning) and im curious how easy it's gonna be to get accidental cache hits that produce surprising behavior

#

those test examples are not what i'd call surprising fwiw, mostly cuz they each start from scratch so there's no collected message history

steep onyx Mar 28, 2025, 11:56 PM

#

worn hill <@949034677610643507> that said the full caching on llm calls is gonna be really...

For now I'm gonna toss in a CachePerSession on llm so that it just retains the behavior it currently has once we enable persistent dagql caching. Basically just delaying needing answers to those (tricky) questions.

I feel like not caching LLM calls is the right default, but we do definitely need the history to be cacheable (and transferable via remote caching). So there's some subtleties there to disentangle

worn hill Mar 28, 2025, 11:58 PM

#

idk i could see caching feeling nice provided that all the history and env context bits are treated as part of the cache key

smoky ocean Mar 29, 2025, 12:02 AM

#

🚨🚨🚨 Experimental MCP support merging soon! Thank you @wraith remnant @warped bramble 🙏

wraith remnant Mar 29, 2025, 1:48 AM

#

smoky ocean 🚨🚨🚨 Experimental MCP support merging soon! Thank you <@274903880343748619> <@...

In ✅

smoky ocean Mar 29, 2025, 2:04 AM

#

Yay! https://github.com/dagger/dagger/pull/9935

GitHub

Expose a Dagger module as an MCP server by tiborvass · Pull Reques...

This introduces a new dagger mcp command that starts an MCP stdio server
Usage
dagger mcp or dagger mcp -m ref, where the ref is a path to a local module or a remote one
Implementation
In this vers...

smoky ocean Mar 29, 2025, 2:50 AM

#

Work in progress: "Environment API" which cleans up LLM API by introducing an Environment type.

https://github.com/dagger/dagger/pull/10007

GitHub

Environment API: a new API for exposing a Dagger environment to hum...

LLM is now a regular type. It can receive an environment
Environment: an execution context within the Dagger Engine. Basically, a sandbox.
Binding: a named mapping to a value, scoped to an Environm...

river belfry Mar 30, 2025, 7:17 PM

#

Not sure who I should put in reviewers, but I have this one that bumps openai-go. This improves a bit the behavior when using llama.cpp and small models. Not entirely sure why exactly, I wasn't able to find the exact commit between the alpha.61 and beta.2 that degraded the results
https://github.com/dagger/dagger/pull/10005

GitHub

deps: bump openai-go to v0.1.0-beta.3 by eunomie · Pull Request #1...

https://github.com/openai/openai-go/releases/tag/v0.1.0-beta.3
Full changelog: openai/openai-go@v0.1.0-beta.2...v0.1.0-beta.3

quiet ether Mar 30, 2025, 8:47 PM

#

river belfry Not sure who I should put in reviewers, but I have this one that bumps `openai-g...

writing some sort of eval for this would be nice. ref: https://github.com/vito/daggerverse/blob/main/botsbuildingbots/evals/main.go

GitHub

daggerverse/botsbuildingbots/evals/main.go at main · vito/daggerverse

a monorepo of all my Dagger modules. Contribute to vito/daggerverse development by creating an account on GitHub.

river belfry Mar 31, 2025, 6:06 AM

#

quiet ether writing some sort of eval for this would be nice. ref: https://github.com/vito/d...

Based on your module I did that: https://github.com/lgtdio/llmeval
I added my .env to the git repo as there's no screts here. It's using Docker Model Runner but that should be similar if we run the model using llama.cpp.
Basically it generates the reports for my main test case, where I want the LLM to generate dev environment on the fly by inspecting the code base.

I run it with dagger based on the alpha.61 of openai-go (using this branch https://github.com/lgtdio/dagger/tree/llm-demo-2) and here is the result: https://github.com/lgtdio/llmeval/blob/main/reports/with-openai-alpha-61.txt
-> At the end of the report I added the history of the built container.
-> It works really well, found the tools, use thems

I also run the same thing based on beta.3 (main) and here is the result: https://github.com/lgtdio/llmeval/blob/main/reports/with-openai-beta-3.txt
-> It's failing because it doesn't use correctly the tools
-> Instead of finding a tool tree it will install the tree package
-> It never uses the addPackage tool
-> It install weird stuff in mode --force-broken-world
In the end that works, but way less efficient.

I still haven't found the change from the alpha61 that degraded the performances.
My prompt is also complex because the model wasn't able to find the tools at start, but it might depends on the models, especially when they are not so big. But at least it was working.

proper stratus Mar 31, 2025, 6:30 AM

#

Multi-objects is great. However, I have issues with Dagger Shell. When switching from navigate to input mode, the output stops where I was, and I can't see what I'm typing. Sometimes it shows input and output but only a few lines before stopping. Also, output from long prompts is difficult to follow, and it seems to not show the complete output.

river belfry Mar 31, 2025, 9:18 AM

#

Currently I'm mostly running local models (<14B for most of them, so not so big)
I'm seeing a lot of differences in behavior depending on the model. Would it be interesting to share a list of the models that works well/to recommend based on the kind of task to perform?

shrewd ermine Mar 31, 2025, 9:28 AM

#

<14b can be a bit rough, but I've mostly used qwen2.5-coder for code generation and it's been pretty good. Getting the prompting just right is the real challenge with the smaller models but once you get the constraints just right they're good

river belfry Mar 31, 2025, 9:35 AM

#

I'm also using qwen2.5-coder in 14B, I can't really go more with my actual laptop 😕

shrewd ermine Mar 31, 2025, 9:36 AM

#

yeah it might be worth the tradeoff to run 7b with a larger context length too, I haven't dug too deep on that config side

river belfry Mar 31, 2025, 9:36 AM

#

shrewd ermine <14b can be a bit rough, but I've mostly used qwen2.5-coder for code generation ...

Do you have some prompts available on GH? To compare with what I'm doing. I have for instance this one that works (depending on the openai-go version) but it's "big": https://github.com/eunomie/local-agent/blob/main/.dagger/qwen_dev_env.md

shrewd ermine Mar 31, 2025, 9:39 AM

#

That looks pretty good, I don't have anything as good as that 😛 I would add under contraints DO NOT USE THE CONTAINER TOOL. If it calls Container() from your dev-environment module to get the container object it will immediately overwhelm itself. Hopefully that won't be an issue in the next release

river belfry Mar 31, 2025, 10:10 AM

#

Regarding https://docs.dagger.io/api/llm#environments-and-tools I wonder if we shouldn't add a small example. Like if we only want the ability for the llm to read, a small module that only contains a read func. Or if we want to go a bit further, a read function and a tree function that runs tree in a container on a specified directory (I like this one because it's not just restricting the scope of the llm, it's also extending it with custom functions).
What I mean by that is I understand what is written, but also because I know what to expect. And while that sounds clear, I don't know how easy for someone to go from this description to the creation of a small module that can act as an environment.
I'll see to open a PR with a small example and we can discuss it if that makes sense.

shrewd ermine Mar 31, 2025, 10:30 AM

#

river belfry Regarding https://docs.dagger.io/api/llm#environments-and-tools I wonder if we s...

Thanks, iterating on docs now. That part will probably change a bunch with #1352023893543747754 in the upcoming release but that'll be covered

#

btw if there's more feedback on the current LLM docs, now is an excellent time to share 😄

hidden tartan Mar 31, 2025, 12:13 PM

#

Quick demo I made for my GH agent using the prompt mode, it's kinda cool

gloomy kindle Mar 31, 2025, 1:13 PM

#

🤔 for prompt mode, should the results be written back to the variable? sorry, i'm struggling a bit to get the actual result back out

shrewd ermine Mar 31, 2025, 1:14 PM

#

gloomy kindle 🤔 for prompt mode, should the results be written back to the variable? sorry, i...

No, see latest in #1352023893543747754 for relevant discussion

gloomy kindle Mar 31, 2025, 1:14 PM

#

aha 😦

smoky ocean Mar 31, 2025, 1:14 PM

#

welcome to the frontier 😁

gloomy kindle Mar 31, 2025, 1:15 PM

#

i'm using 0.17.2 without those env changes, still applies?

shrewd ermine Mar 31, 2025, 1:15 PM

#

ah, no, different answer there

#

( but I don't know off the top of my head )

smoky ocean Mar 31, 2025, 1:19 PM

#

~~in 0.17.2 I believe:

the llm can set variables
they get synced back to your shell
but you need to explicitly prompt the llm to do it~~

Nevermind I was completely wrong

gloomy kindle Mar 31, 2025, 1:20 PM

#

ahaha

spring wave Mar 31, 2025, 1:21 PM

#

the llm can't set variables in 0.17.2

#

@gloomy kindle i think you want $_

#

that'll be assigned as the last value returned by the LLM

gloomy kindle Mar 31, 2025, 1:22 PM

#

yes 😄

#

that's what i want ❤️

#

thank you!

shrewd ermine Mar 31, 2025, 1:23 PM

#

is there any way to get $_ in code??

gloomy kindle Mar 31, 2025, 1:23 PM

#

spring wave that'll be assigned as the last value returned by the LLM

i'm guessing it's not supported for getting the last result from a normal shell command?

smoky ocean Mar 31, 2025, 1:23 PM

#

spring wave that'll be assigned as the last value returned by the LLM

returned as in selected?

spring wave Mar 31, 2025, 1:23 PM

#

gloomy kindle i'm guessing it's not supported for getting the last result from a normal shell ...

not currently, but i want it to be that too, yeah

spring wave Mar 31, 2025, 1:24 PM

#

smoky ocean returned as in selected?

yeah

#

well

smoky ocean Mar 31, 2025, 1:24 PM

#

or result of last tool call?

spring wave Mar 31, 2025, 1:24 PM

#

both (they are the same)

smoky ocean Mar 31, 2025, 1:24 PM

#

regardless of selection

#

ah ok

spring wave Mar 31, 2025, 1:24 PM

#

since tool calls auto-select

gloomy kindle Mar 31, 2025, 1:24 PM

#

thank you thank you 😄

spring wave Mar 31, 2025, 1:25 PM

#

i suppose that also means $_ will always be an object, never a string, since LLMs never select non-objects

#

but now you can e.g. $agent | last-reply if that's what you want 😛

smoky ocean Mar 31, 2025, 1:33 PM

#

well at least now I'm caught up on what the API is in main..

smoky ocean Mar 31, 2025, 1:50 PM

#

Bug report by @bronze fern : _currentSelection tool is always sent to LLM even when environment is empty. It seems to confuse the LLM (it gives tainted responses that talk about selection)

merry scarab Mar 31, 2025, 2:53 PM

#

This page was generated and deployed with an llm 😄

https://daggerverse-qa.surge.sh/financialadvisor.html

river belfry Mar 31, 2025, 2:57 PM

#

shrewd ermine That looks pretty good, I don't have anything as good as that 😛 I would add und...

Just FYI but after a lot of different tries, it looks like I have better results by adding to a system prompts the list of available tools. This makes better results than to have them inside the prompt file.
With that I have really similar results than I had when we were using openai-go alpha.61 (I mean I have good results and I'll be able to demo with based on main, and really happy about that 🙂 )

river belfry Mar 31, 2025, 2:57 PM

#

river belfry Just FYI but after a lot of different tries, it looks like I have better results...

My system prompt: https://github.com/eunomie/local-agent/blob/main/.dagger/qwen_system_prompt.md

GitHub

local-agent/.dagger/qwen_system_prompt.md at main · eunomie/local-...

Demo of local agent using Dagger. Contribute to eunomie/local-agent development by creating an account on GitHub.

hidden tartan Mar 31, 2025, 3:00 PM

#

river belfry Just FYI but after a lot of different tries, it looks like I have better results...

What's the diff between a system prompt and a regular prompt in the Dagger API?

river belfry Mar 31, 2025, 3:03 PM

#

Based on ⬆️
We have a Tools function. I wonder if we shouldn't make available the list of tools directly. That way we can construct a kind of similar doc but specifically for the model used and the expected format, and send it to the (system) prompt. (It can be useful for small, local models)
Would that make sense? (Happy to try to do it, but wanted to validate the need first)

river belfry Mar 31, 2025, 3:04 PM

#

hidden tartan What's the diff between a system prompt and a regular prompt in the Dagger API?

It's sent as a specific type of message to the openai API https://github.com/dagger/dagger/blob/3f89ce13e1b4ffd435d143cf190d2b88c584353a/core/llm_openai.go#L113-L114

GitHub

dagger/core/llm_openai.go at 3f89ce13e1b4ffd435d143cf190d2b88c58435...

An open-source runtime for composable workflows. Great for AI agents and CI/CD. - dagger/dagger

merry scarab Mar 31, 2025, 3:07 PM

#

An even better version 😄 https://daggerverse-qa.surge.sh/

hidden tartan Mar 31, 2025, 3:10 PM

#

That's actually better to set the system prompt so you don't send the instruction on every query in prompt mode, just noticed that while trying

woeful quiver Mar 31, 2025, 3:24 PM

#

When LLMs are calling LLMs, it might be helpful to name them? cc @hidden tartan (e.g. LLM.WithName("bot1"))

smoky ocean Mar 31, 2025, 3:36 PM

#

@river belfry @hidden tartan I'm not sure I understand, tool calling already works this way - the LLM endpoint already injects the same information in the context. Doing this duplicates it

#

Are your descriptions in that doc the same as what's in the comments of your functions?

river belfry Mar 31, 2025, 3:39 PM

#

I'll try again to be sure, but what I saw is that works well with big models, like when you use gpt, but with small models if I don't add again the tools (sometimes in a different format) that doesn't work well, the LLM will for instance not find the tools to run. Especially with qwen model I'd say.

smoky ocean Mar 31, 2025, 3:45 PM

#

river belfry I'll try again to be sure, but what I saw is that works well with big models, li...

That's good to know! We should add it to the eval

smoky ocean Mar 31, 2025, 3:45 PM

#

river belfry I'll try again to be sure, but what I saw is that works well with big models, li...

It could be linked to other variables, for example our MCP/tool calling implementation changed a lot over the last week

river belfry Mar 31, 2025, 3:46 PM

#

Here is what chatgpt says to me when I ask it to improve my prompt:

Ah, you’re super close — but here’s the catch: LLaMA 3.2 1B is extremely small and may not reliably infer when to call a tool, even when told it can. Smaller models like this often need explicit prompting to take actions like calling tools.

smoky ocean Mar 31, 2025, 3:47 PM

#

We could add this to the LLM type

#

Like if model == "qen" { /* inject system prompt */ }

hidden tartan Mar 31, 2025, 3:50 PM

#

river belfry I'll try again to be sure, but what I saw is that works well with big models, li...

System prompt makes thing quite inaccurate though, I can clearly see the diff when I set it or not, it seems it fails to use the tool correctly :/

shrewd ermine Mar 31, 2025, 4:05 PM

#

river belfry I'll try again to be sure, but what I saw is that works well with big models, li...

yeah I'd make sure you're on at least 0.17.2, but if that system prompt makes a huge difference I'd try adjusting the documentation on the functions in the module instead. Especially with a really small model it might actually be worse to use that system prompt in addition to the tool descriptions because it's just more context

quiet ether Mar 31, 2025, 4:22 PM

#

@spring wave As I mentioned in the prod-dev sync, I'm getting into a situation where after my agent selects the firs tool it needs, it seems like it gets stuck within that context and doesn't know how to use the other tools it knew about before selecting the current one. I'm currently using Claude 3.5 as an LLM, not sure if that matters.

spring wave Mar 31, 2025, 4:23 PM

#

quiet ether <@108011715077091328> As I mentioned in the prod-dev sync, I'm getting into a si...

might be the known issue around losing track of Query - @worn hill has context

quiet ether Mar 31, 2025, 4:23 PM

#

spring wave might be the known issue around losing track of `Query` - <@430802613848506380> ...

seems to be related.

spring wave Mar 31, 2025, 4:23 PM

#

basically we might need another tool, analogous to selectQuery but maybe not named that because the model might not understand

worn hill Mar 31, 2025, 4:26 PM

#

spring wave might be the known issue around losing track of `Query` - <@430802613848506380> ...

#agents message link to previous mention, definitely related if not exactly the same thing @quiet ether

#

it's definitely the sort of UX bug where it makes you wonder if you're doing something wrong, but the fact that both of us hit the exact same thing trying to use the interactive-onramp UX strongly implies this is not user error at all

quiet ether Mar 31, 2025, 4:31 PM

#

worn hill it's definitely the sort of UX bug where it makes you wonder if you're doing som...

@worn hill wanna tackle this one together? seems important

smoky ocean Mar 31, 2025, 4:31 PM

#

Be careful of split brain guys

quiet ether Mar 31, 2025, 4:38 PM

#

@spring wave @worn hill thread about selectQuery tool

woeful quiver Mar 31, 2025, 4:41 PM

#

FYI, you cannot use Rancher Desktop with Claude to register/test MCP servers - has to be Docker Desktop

river belfry Mar 31, 2025, 5:00 PM

#

shrewd ermine yeah I'd make sure you're on at least 0.17.2, but if that system prompt makes a ...

Ok, so I did more tests.
And I removed both the system prompt and the tools list I was passing to the llm.
And that works fine!
Initially I added the tool list because they weren't correctly found, and now it's better to remove them to avoid confusion. (I also improved the function docs)
Let's call that learning 😅

#

But the (important) result is that works great 🙂

shrewd ermine Mar 31, 2025, 5:02 PM

#

Amazing!

river belfry Mar 31, 2025, 5:04 PM

#

(ok, it works better 😅 )

smoky ocean Mar 31, 2025, 5:31 PM

#

Masking fields 🧵

merry scarab Mar 31, 2025, 6:04 PM

#

I find myself constantly rate limited by 4o - are there any good patterns to either

consistently reduce my token size
get visibiliyt into what the input tokens acftually look like?

spring wave Mar 31, 2025, 6:16 PM

#

merry scarab I find myself constantly rate limited by 4o - are there any good patterns to eit...

are you using the company gpt-4o account? it's still severely rate limited, but if we're all using it, we'll burn enough money more quickly to get into the higher tiers 💸

#

the biggest problem is that it currently gets every single API exposed to it as a tool, depending on the currently selected object, which is where #1354656055925149716 came in, which showed promise but led to the model making more mistakes

merry scarab Mar 31, 2025, 6:17 PM

#

spring wave are you using the company gpt-4o account? it's still severely rate limited, but ...

Yeah for sure.

Getting the token count is dope but I wish I could see the whole injput, would help people debug easier IMO

#

Sorry if dumb querstion but can I use dall-e-3?

.model allows me to switch but then seeing this error when I try

│🤖 0.1s
│ ! POST "https://api.openai.com/v1/chat/completions": 403 Forbidden {
│ !         "message": "You are not allowed to sample from this model",
│ !         "type": "invalid_request_error",
│ !         "param": null,
│ !         "code": null
│ !     }
! input: llm.withQuery.withModel.withPrompt.sync select: POST "https://api.openai.com/v1/chat/completions": 403 Forbidden {
!         "message": "You are not allowed to sample from this model",
!         "type": "invalid_request_error",
!         "param": null,
!         "code": null
!     }

shrewd ermine Mar 31, 2025, 6:41 PM

#

merry scarab Sorry if dumb querstion but can I use dall-e-3? `.model` allows me to switch bu...

you are not allowed to sample from this model

#

But to answer more specifically, the model has to support chat generation. And if you want to give it objects, it has to support tool calling

smoky ocean Mar 31, 2025, 6:49 PM

#

Oh @steep onyx there's another papercut we could use help with...

The "EnvironmentHook" in core/env.go install all core types in Environment.with[TYPE]Input and Output.as[TYPE] but really, half of those types can be removed...

#

--> https://gist.github.com/shykes/228f2fb0b9485a959da61f1e073ab202

Gist

gist:228f2fb0b9485a959da61f1e073ab202

GitHub Gist: instantly share code, notes, and snippets.

merry scarab Mar 31, 2025, 6:51 PM

#

shrewd ermine you are not allowed to sample from this model

Not really sure what "sample" means here.

So im thinking of just wrapping their SDK then?

https://platform.openai.com/docs/guides/image-generation

I was thinking I could use it via text since it accepts a prompt and returns a URL.

smoky ocean Mar 31, 2025, 6:52 PM

#

I'm thinking we could remove the following:

current-module
*type-def
env
error
function-*
generated-code
llm-token-usage
sdk-config
source-map

shrewd ermine Mar 31, 2025, 7:04 PM

#

Not really sure what "sample" means here

steep onyx Mar 31, 2025, 7:11 PM

#

I'm thinking we could remove the

fleet fiber Mar 31, 2025, 9:37 PM

#

Oh good I know MCP now https://youtu.be/HyzlYwjoXOQ?si=-k0tnlMzN3IZDQbs

YouTube

Fireship

I gave Claude root access to my server... Model Context Protocol ex...

Deploy your app without complexity and $50 in free credits on Sevalla https://sevalla.com/fireship

Learn the fundamentals of Anthropic's Model Context Protocol by building an MCP server can give any AI model superpowers. In this tutorial, we build an TypeScript server that provides Claude with additional context and the ability to modify data o...

▶ Play video

smoky ocean Mar 31, 2025, 9:49 PM

#

lol

#

OK @steep onyx @wraith remnant @worn hill @spring wave we're in countdown to release... The idea is to get the new environment API out, so we can port all examples & docs to it by the hack night tomorrow

#

@spring wave what say you? 🙂

#

Night crew ready to do final testing here in London

spring wave Mar 31, 2025, 9:54 PM

#

fixing $_ is the last blocker i think? i can figure something out there

#

and maybe revive -i / life-alert since that was based on returning, which we have now (in an even more solid form)

steep onyx Mar 31, 2025, 9:55 PM

#

smoky ocean OK <@949034677610643507> <@274903880343748619> <@430802613848506380> <@108011715...

is this a v0.17.3 or the big v0.18?

spring wave Mar 31, 2025, 9:56 PM

#

@smoky ocean oh, and the exposing bindings as tools. that needs more testing i think

shrewd ermine Mar 31, 2025, 10:01 PM

#

eval it uuuuuup

smoky ocean Mar 31, 2025, 10:02 PM

#

steep onyx is this a v0.17.3 or the big v0.18?

I would suggest 0.17.3 since only experimental APIs break (with the caveat that there is no formal marker of experimental APIs... but we have communicated clearly that llm is experimental IMO)

#

Then we can cut 0.18 tomorrow when the stakes are less high 🙂 Probably safe to assume there's another (hopefully small) release to be had tomorrow for last minute fixes

#

But hopefully we can freeze API today

bronze fern Mar 31, 2025, 10:08 PM

#

https://github.com/dagger/dagger/issues/10027

GitHub

✨ Access state of container modified during `terminal` session wi...

What are you trying to do? Many users have requested the ability to capture the state of a Container that has been modified through an interactive session with terminal such as in ⋈ container | fro...

storm gate Mar 31, 2025, 10:11 PM

#

smoky ocean Then we can cut 0.18 tomorrow when the stakes are less high 🙂 Probably safe to ...

If we’re planning to cut 0.18 tomorrow on a stable API that ships in .17.3. I’d cut .18 today and .18.1 tomorrow. My 2p.

smoky ocean Mar 31, 2025, 10:13 PM

#

My 2p.

2 pounds? 🙂

storm gate Mar 31, 2025, 10:19 PM

#

smoky ocean > My 2p. 2 pounds? 🙂

I’ll go back to giving my 2c next week

smoky ocean Mar 31, 2025, 10:21 PM

#

@spring wave how can we ( @steep onyx @wraith remnant @worn hill ) help you get that PR merged?

spring wave Mar 31, 2025, 10:24 PM

#

smoky ocean <@108011715077091328> how can we ( <@949034677610643507> <@274903880343748619> <...

moar testing

#

particularly curious about the vars-as-tools bit, that's the biggest unknown at the moment, and there are other schemes we could try

smoky ocean Mar 31, 2025, 10:25 PM

#

@spring wave so you're positive you saw that working at least once?

spring wave Mar 31, 2025, 10:26 PM

#

i definitely see it call those tools, but what's worrying is that it seems to take their presence (or maybe the way they're described) as a sign that it can pass them by name in args to things, which if true could be a real wrench in the gears

#

so another thing we could try is to change/repurpose currentSelection to additionally list the known object IDs, MAYBE paired with a name, but that might have the same risks. ideally they'd have a description instead

#

that's the general area that needs de-risking atm

#

aside from that, just trying out existing agents and trying to find (not too cryptic) ways to break/confuse it

#

just got $_ working, will push soon. it's currently as-before, where it's only the last selected object, not arbitrary scalars, since that's by far the easiest to support and you can always just re-select whatever field you want from it

#

(pushed)

#

i'll add some telemetry to those env getters too so it's more obvious when it's using them

smoky ocean Mar 31, 2025, 10:34 PM

#

Let me tweak the description for inputs as tools

spring wave Mar 31, 2025, 10:45 PM

#

@smoky ocean here's a run where I tried to make it rely on the tools - it seemed hesitant to use them: https://v3.dagger.cloud/dagger/traces/4e0c9b0390848d1c687b3bca8a72812f
could be worth just tossing the IDs directly in the description to cut out the extra roundtrips

Dagger Cloud

Browse and visualize Dagger traces.

spring wave Mar 31, 2025, 10:46 PM

#

spring wave i'll add some telemetry to those env getters too so it's more obvious when it's ...

also, pushed this

smoky ocean Mar 31, 2025, 10:47 PM

#

@spring wave did you want to make description mandatory in inputs?

#

probably last call to do that if so 🙂

spring wave Mar 31, 2025, 10:48 PM

#

https://tenor.com/bRn74.gif

Tenor

#

...lemme dogfood it a bit

#

worth noting there isn't a way to add descriptions to shell vars. would be cool to use comments for that

ctr=$(container | from golang) # a Go image to use for building

shrewd ermine Mar 31, 2025, 10:51 PM

#

spring wave worth noting there isn't a way to add descriptions to shell vars. would be cool ...

use ai 😉

spring wave Mar 31, 2025, 10:51 PM

#

"A container for preserving broccoli"

shrewd ermine Mar 31, 2025, 10:51 PM

#

(but actually what if "container | from golang" was the description)

smoky ocean Mar 31, 2025, 10:52 PM

#

Oh right the shell... damn

spring wave Mar 31, 2025, 10:52 PM

#

well, we could always accept a "" description

#

it's mostly about making it hard to forget, and easy to consistently provide

smoky ocean Mar 31, 2025, 10:53 PM

#

yeah

#

Having descriptions everywhere also clarifies that inputs and outputs each have their own namespace...

spring wave Mar 31, 2025, 10:53 PM

#

spring wave worth noting there isn't a way to add descriptions to shell vars. would be cool ...

this might not actually be hard, either

#

probably easier in the REPL than it would be in a script, though

#

having said that, right now var syncing doesn't work at all in a script anyway

smoky ocean Mar 31, 2025, 10:57 PM

#

@spring wave this seems like dead code no? Should I remove?

spring wave Mar 31, 2025, 10:57 PM

#

yeah noticed that, you can rm it

smoky ocean Mar 31, 2025, 10:57 PM

#

could it have influenced some of the above?

shrewd ermine Mar 31, 2025, 10:58 PM

#

ok i have to sleep, cant wait to try 0.18 (or 1.0?) when i wake up 🙏

smoky ocean Mar 31, 2025, 10:58 PM

#

smoky ocean Bug report by <@933501536624054272> : `_currentSelection` tool is always sent to...

#

@spring wave we should fix that papercut also

@wraith remnant if you're available? 👆

spring wave Mar 31, 2025, 10:59 PM

#

smoky ocean could it have influenced some of the above?

possibly, though worth noting its value has been 0 1 etc. on your branch. but just the word "variable" could have done that

woeful quiver Mar 31, 2025, 10:59 PM

#

shrewd ermine ok i have to sleep, cant wait to try 0.18 (or 1.0?) when i wake up 🙏

trying to get to that 1.0

wraith remnant Mar 31, 2025, 11:00 PM

#

smoky ocean <@108011715077091328> we should fix that papercut also <@274903880343748619> if...

~~On the functionmask PR / main or environment-api ~~? Yes, we're available to help on that 🙏

On it

smoky ocean Mar 31, 2025, 11:07 PM

#

@spring wave trying to not get pulled into too many changes at once, to avoid conflict. pushing soon

spring wave Mar 31, 2025, 11:07 PM

#

smoky ocean <@108011715077091328> trying to not get pulled into too many changes at once, to...

👍 - i'm testing required descriptions now, feels pretty good to use actually

woeful quiver Mar 31, 2025, 11:19 PM

#

MCP Server issues

spring wave Mar 31, 2025, 11:20 PM

#

@smoky ocean rebased + pushed required input descriptions

wraith remnant Mar 31, 2025, 11:27 PM

#

smoky ocean <@108011715077091328> we should fix that papercut also <@274903880343748619> if...

Shall i make a pr against your environmnet-api branch ? I have it ready

smoky ocean Mar 31, 2025, 11:28 PM

#

@spring wave I don't see Env.outputs and Env.output in the llm schema, ok for me to add?

#

btw I just had a very successful run with the only explicit prompt being "do it" 😛

spring wave Mar 31, 2025, 11:29 PM

#

smoky ocean <@108011715077091328> I don't see `Env.outputs` and `Env.output` in the llm sch...

sure - right now it's still binding i believe? or did you just change that to inputs?

smoky ocean Mar 31, 2025, 11:29 PM

#

It really feels like with this pattern we're getting closer to agents being declarative reactive functions themselves, not just the outside code envelope, but the LLM itself 🙂

spring wave Mar 31, 2025, 11:29 PM

#

smoky ocean btw I just had a very successful run with the only explicit prompt being "do it"...

was this with described inputs + outputs?

smoky ocean Mar 31, 2025, 11:29 PM

#

spring wave sure - right now it's still `binding` i believe? or did you just change that to ...

Yeah just changed that to inputs. Didn't realize I had to split 😅

spring wave Mar 31, 2025, 11:29 PM

#

smoky ocean It really feels like with this pattern we're getting closer to agents being decl...

that's what i've been saaaaayiiiiiing! 😛

#

and why the idea of mutable bindings felt off

smoky ocean Mar 31, 2025, 11:29 PM

#

spring wave was this with described inputs + outputs?

sadly I was in a nested dagger so don't have a trace. but here's the snippet:

#

env=$(
  .core | env |
  with-container-input base-container $(container | from alpine) |
  with-git-repository-input dagger-source $(git https://github.com/dagger/dagger) |
  with-file-output dagger-binary "The dagger command-line binary, built in Go from the latest stable release of Dagger, in a containerized dev environment" |
  with-container-output go-env "The go environment used to build the dagger CLI, with everything setup such that 'go build' works on the first try when entering the
  container"
)

result=$(llm | with-env $env | with-prompt "do it")

(sparing you the middle part)

│🤖 1.2s ◆ Input Tokens: 2,355 ◆ Output Tokens: 26
│ ✔ return(
│ │ │ dagger-binary:🤖 Container.file(path: "/bin/dagger"): File! 0.0s
│ │ │ go-env:🤖 Container.withWorkdir(path: "/app"): Container! 0.0s
│ │ ): String! 0.0s
│🤖 I've successfully prepared everything:
│ ┃
│ ┃ • Dagger Binary: The Dagger CLI binary has been built and is available.
│ ┃ • Go Environment: The containerized Go environment is set up with the Dagger source code mounted at /app .
│ ┃
│ ┃ You can now proceed with your tasks using these resources.
│ ┃ 1.8s ◆ Input Tokens: 2,433 ◆ Output Tokens: 62

spring wave Mar 31, 2025, 11:33 PM

#

sweet

smoky ocean Mar 31, 2025, 11:33 PM

#

It did try to cheat and return File#1

#

(before actually getting a file)

#

so that might be a weakness of the numerical ID system

#

maybe we need to make it look a little more random

spring wave Mar 31, 2025, 11:33 PM

#

hmm yeah

#

what model?

smoky ocean Mar 31, 2025, 11:34 PM

#

gpt-4o

wraith remnant Mar 31, 2025, 11:34 PM

#

smoky ocean gpt-4o

https://github.com/shykes/dagger/pull/351. @smoky ocean feel free to just take the code and close the PR. Made this for an easy review. Didn't want to push directly without asking

spring wave Mar 31, 2025, 11:36 PM

#

env input chaining papercut

smoky ocean Mar 31, 2025, 11:44 PM

#

@spring wave I'm a little confused trying to add Env.output() and Env.outputs(): it looks like the output definitions are saved in place (outputsByName) but the actual values in another (objsByName) is that right?

spring wave Mar 31, 2025, 11:44 PM

#

smoky ocean <@108011715077091328> I'm a little confused trying to add `Env.output()` and `En...

yeah, we could dedupe them but I wasn't sure if a binding with a nil value was safe or if it would lead to panics somewhere

smoky ocean Mar 31, 2025, 11:44 PM

#

I would have expected objsByName to become inputsByName, with a mirror outputsByName of the same type: map[string]*Binding.

smoky ocean Mar 31, 2025, 11:44 PM

#

spring wave yeah, we could dedupe them but I wasn't sure if a binding with a `nil` value was...

Oh I see. ~~Yeah its perfectly fine~~ to clarify having a non-null binding with a null value is perfectly fine

spring wave Mar 31, 2025, 11:45 PM

#

or that yeah

smoky ocean Mar 31, 2025, 11:45 PM

#

OK I'll do that then?

#

Then need to go to sleep... Will you be able to carry the release today guys? I know it's getting late even for your timezeons

steep onyx Mar 31, 2025, 11:46 PM

#

smoky ocean Then need to go to sleep... Will you be able to carry the release today guys? I ...

I am still good with kicking it off tonight. Just need to know when and a final call on version number.

#

My assumptions atm are that "when" == "when that environment-api PR is merged" and "version number" == "v0.18"

#

so just tell me if that's wrong

smoky ocean Mar 31, 2025, 11:48 PM

#

steep onyx My assumptions atm are that "when" == "when that environment-api PR is merged" a...

When: 👍
Version: your call

wraith remnant Mar 31, 2025, 11:49 PM

#

smoky ocean - When: 👍 - Version: your call

just in case it was missed: there's this PR against the environment-api on his fork -- https://github.com/shykes/dagger/pull/351 ; if you guys want i can open it on dagger/dagger (or you can direclty copy-paste and push on the branch thinkies )

spring wave Mar 31, 2025, 11:52 PM

#

current status: nothing in progress, was just dogfooding the required-descriptions stuff for my evals, and then gonna run them, which I'll do after @smoky ocean pushes the new Env.output API since I need it now. 😛

but, have to go on a 40m-ish car ride so that's it from me for a bit. pushed my evals changes in case anyone wants to try them while i'm in the 🚗

#

also want to try adding input descriptions to the tool descriptions, not sure if that's done yet

steep onyx Mar 31, 2025, 11:56 PM

#

wraith remnant just in case it was missed: there's this PR against the environment-api on his f...

It LGTM, I'd just push that commit to the branch

smoky ocean Mar 31, 2025, 11:58 PM

#

spring wave current status: nothing in progress, was just dogfooding the required-descriptio...

https://tenor.com/view/pressure-sweating-nervous-gif-22339203

Tenor

wraith remnant Mar 31, 2025, 11:58 PM

#

steep onyx It LGTM, I'd just push that commit to the branch

thanks, doing it

smoky ocean Mar 31, 2025, 11:59 PM

#

@spring wave do you use the type_ argument of WithOutput anywhere ?

#

my guess is that type enforcement was left as a todo?

#

or I'm blind

spring wave Apr 1, 2025, 12:07 AM

#

smoky ocean <@108011715077091328> do you use the `type_` argument of `WithOutput` anywhere ?

I thought it was passed along

smoky ocean Apr 1, 2025, 12:10 AM

#

spring wave I thought it was passed along

Yeah but I didn't see it used anywhere. But, I since realized that I was blind 🙂

It's enforced in the return builtin at the dagql layer - not explicitly, doh!

#

OK I think I'm done, testing real quick

#

I apologize in advance if there are sleep-depravation bugs

#

@wraith remnant another papercut request... env doesn't work in the shell, you have to call .core | env...

#

mmm there's a Terminal type? 🤔 isn't that long deprecated?

steep onyx Apr 1, 2025, 12:19 AM

#

smoky ocean mmm there's a `Terminal` type? 🤔 isn't that long deprecated?

I think the object still shows up because we have legacy API support (pre-v0.12) for the field on it. But the view only hides the field, not the object type itself

#

I can append it to that list of things to hide from the env extensions

smoky ocean Apr 1, 2025, 12:23 AM

#

@spring wave pushed

smoky ocean Apr 1, 2025, 12:23 AM

#

steep onyx I can append it to that list of things to hide from the env extensions

I think that and SDK config would be great thanks. Obviously not a release blocker...

smoky ocean Apr 1, 2025, 12:24 AM

#

smoky ocean <@108011715077091328> pushed

One last tweak, the description of the input is not passed through to the llm. fixin gnow

#

oh also missing in output arguments

#

(I think)

#

shall I add them too?

spring wave Apr 1, 2025, 12:27 AM

#

smoky ocean <@108011715077091328> pushed

hitting a compile error but I think we can just nix the code causing it. seems like we can just get rid of currentSelection entirely

spring wave Apr 1, 2025, 12:27 AM

#

smoky ocean oh also missing in output arguments

those should be there thinkspin

smoky ocean Apr 1, 2025, 12:27 AM

#

oh you're right. Mmm then I guess the LLM ignored it in my eval

#

I was getting a little too cocky with the "do it" prompts 😛

spring wave Apr 1, 2025, 12:28 AM

#

lol

smoky ocean Apr 1, 2025, 12:28 AM

#

spring wave hitting a compile error but I think we can just nix the code causing it. seems l...

oh no, I rebased and didn't try rebuilding.

#

fixing it

spring wave Apr 1, 2025, 12:29 AM

#

so, the currentSelection tool was originally added as a hint to the model so it knows when it's been given an initial selection, but now that isn't a thing

#

so i'm not sure if it's even needed anymore? unless the hint is still helping it? not sure

smoky ocean Apr 1, 2025, 12:30 AM

#

Oh I see. Not needed the rest of the time?

spring wave Apr 1, 2025, 12:30 AM

#

my other idea was to repurpose it into a general purpose "your current context" tool, which lists the inputs + descriptions in its description, that way we don't need all the getter tools

smoky ocean Apr 1, 2025, 12:30 AM

#

spring wave my other idea was to repurpose it into a general purpose "your current context" ...

Honestly that seems like the getter tools but less native

spring wave Apr 1, 2025, 12:31 AM

#

well, the advantage is a) not having to run those tools ever, and b) not risking confusing tool names because of what people named their inputs

smoky ocean Apr 1, 2025, 12:32 AM

#

I mean it doesn't change the API so we can let the evals decide 🙂

spring wave Apr 1, 2025, 12:32 AM

#

In my testing simply putting stuff in tool descriptions is pretty high leverage

smoky ocean Apr 1, 2025, 12:32 AM

#

For now I'll just fix the code as is, feel free to remove the hint

spring wave Apr 1, 2025, 12:32 AM

#

smoky ocean I mean it doesn't change the API so we can let the evals decide 🙂

Yeah we can iterate

#

/me writes an eval that sets an input called "return" elmofire

smoky ocean Apr 1, 2025, 12:34 AM

#

(force) pushed

#

OK this is it for me...

spring wave Apr 1, 2025, 12:34 AM

#

Night!

smoky ocean Apr 1, 2025, 12:35 AM

#

Hopefully not too late for release?

steep onyx Apr 1, 2025, 12:35 AM

#

smoky ocean Hopefully not too late for release?

I'm still good, releases go pretty quick now so it's not a big deal

#

The LLM integ tests are very upset on that PR right now, I'll work on updating them

spring wave Apr 1, 2025, 12:43 AM

#

probably need a sdk all generate + docs generate too (I would but I'm on a laptop in a car and my legs are hot enough as it is)

#

hmm looks like we need a .sync before getting values out of the env - adding one to LLM.env. also just remembered I had a fix for .sync on another branch, gonna try pulling that in, and MAYBE -i support but that might be too much too late

wraith remnant Apr 1, 2025, 12:48 AM

#

smoky ocean <@274903880343748619> another papercut request... `env` doesn't work in the shel...

Done ✅

quiet ether Apr 1, 2025, 12:50 AM

#

steep onyx I'm still good, releases go pretty quick now so it's not a big deal

@steep onyx I'm around for approvals / support / whatever you need

#

feel free to ping / dm me

steep onyx Apr 1, 2025, 12:53 AM

#

steep onyx The LLM integ tests are very upset on that PR right now, I'll work on updating t...

hm to update these tests I need to regenerate the golden examples, which seems like it requires I have all of our API keys for various providers? And then to uncomment this code or similar?

#

@spring wave @quiet ether does that sound right?

spring wave Apr 1, 2025, 12:54 AM

#

steep onyx <@108011715077091328> <@336241811179962368> does that sound right?

I run this: dagger call test update --pkg=./core/integration --run="TestLLM" --env-file=file://$PWD/.env -o .

steep onyx Apr 1, 2025, 12:54 AM

#

or do I just need one provider?

spring wave Apr 1, 2025, 12:54 AM

#

one provider should be sufficient

#

running evals now, noticed claude-3-5-sonnet-latest sometimes doesn't call return 😬 maybe more prompting needed?

spring wave Apr 1, 2025, 1:18 AM

#

anyone know what causes this? my ./hack/dev is wedged

│ │ ✘ moduleSource(disableFindUp: true, refString: "/var/home/vito/src/dagger/docs"): ModuleSource! 30.0s
│ │ ! failed to resolve dep to source: failed to load local dep: select: failed to load sdk for local module source: failed to load local dep: select: local path "/var/home/vito/src/dagger/sdk/php/dev/php" does not exist: unknown builtin sdk
│ │ ! The "php" SDK does not exist. The available SDKs are:
│ │ ! - go
│ │ ! - python
│ │ ! - typescript
│ │ ! - php
│ │ ! - elixir
│ │ ! - java
│ │ ! - any non-bundled SDK from its git ref (e.g. github.com/dagger/dagger/sdk/elixir@main)

https://v3.dagger.cloud/dagger/traces/cdf57de1006c67015097cfffdfc7edaa

Dagger Cloud

Browse and visualize Dagger traces.

quiet ether Apr 1, 2025, 1:19 AM

#

maybe sdk gen busted your local sdk folder? @spring wave

#

trying here

spring wave Apr 1, 2025, 1:20 AM

#

hmm could be

❯ git clean -ffdnx
Would remove .dagger/dagger.gen.go
Would remove .dagger/internal/
Would remove .env
Would remove .jj/
Would remove .ropeproject/
Would remove bin/
Would remove sdk/dotnet/sdk/Dagger.SDK/introspection.json
Would remove sdk/php/vendor/
Would remove sdk/rust/target/

quiet ether Apr 1, 2025, 1:20 AM

#

seen that happening in the past but with a different error

spring wave Apr 1, 2025, 1:21 AM

#

now it's complaining about elixir - progress!

#

gonna try pruning the cache

quiet ether Apr 1, 2025, 1:24 AM

#

spring wave gonna try pruning the cache

oh yes

#

elixir builds fail also in CI due to caching

#

I think the elixir cache is racy somehow

#

seen that happening quite a few times in CI and re-running generally fixes it

#

@steep onyx where you able to get the golden files? I have them here

steep onyx Apr 1, 2025, 1:39 AM

#

quiet ether <@949034677610643507> where you able to get the golden files? I have them here

Yep gonna push fixes in a min, just had to update the modules in our dagger-test-modules repo too

spring wave Apr 1, 2025, 1:49 AM

#

(pruning cache worked but now a fresh build is taking ages...)

steep onyx Apr 1, 2025, 1:55 AM

#

spring wave anyone know what causes this? my `./hack/dev` is wedged ``` │ │ ✘ moduleSource(d...

It's quite buried but eventually in the error trace I found this: https://v3.dagger.cloud/dagger/traces/cdf57de1006c67015097cfffdfc7edaa?span=db3424adfa16f059

host github.com not found

Which probably explains sorta what's happening; it tried to get the php sdk from git but then probably hits a fallback case where it assumes it's not meant to be a git ref and just a local ref

Dagger Cloud

Browse and visualize Dagger traces.

#

So might be a DNS problem elmofire

spring wave Apr 1, 2025, 1:56 AM

#

ah nice find, saw that error later and decided to restart the engine, seems ok now

steep onyx Apr 1, 2025, 1:58 AM

#

I pushed TestLLM fixes and sdk/doc regen to the PR, so hopefully CI is happy now

spring wave Apr 1, 2025, 3:01 AM

#

@steep onyx sorry, got caught up in a rebase, trying to hoist over some fixes from the life-alert branch. are you getting close to a cut-off time? 😬

steep onyx Apr 1, 2025, 3:02 AM

#

spring wave <@949034677610643507> sorry, got caught up in a rebase, trying to hoist over som...

No problem, I'm good, I'd say 10pm is probably my cutoff time to start (so 2hrs)

spring wave Apr 1, 2025, 3:24 AM

#

@steep onyx ok, calling it for now - Claude 3.5 Sonnet still has some issues, but I don't think it's blocking

#

(just did a push -f)

steep onyx Apr 1, 2025, 3:24 AM

#

spring wave <@949034677610643507> sorry, got caught up in a rebase, trying to hoist over som...

Do these failures in TestLLM look legit or just something that needs the golden examples regen'd again? https://v3.dagger.cloud/dagger/traces/c497d1a248cf0410a4ea9f49fe50595c?listen=1a235b3cd96d0ccf&listen=9b8f517aaba53937&listen=d20cbf598d89915d

Dagger Cloud

Browse and visualize Dagger traces.

spring wave Apr 1, 2025, 3:25 AM

#

steep onyx Do these failures in TestLLM look legit or just something that needs the golden ...

ah this might be fixed by what I just push -f'd - went back to return

#

I temporarily renamed it to try to drop a stronger hint to Claude 3.5 but it didn't work well enough to be worthwhile

steep onyx Apr 1, 2025, 3:26 AM

#

spring wave ah this might be fixed by what I just push -f'd - went back to `return`

oh okay that was an even newer -f 😄 , cool sounds good

steep onyx Apr 1, 2025, 3:38 AM

#

spring wave <@949034677610643507> ok, calling it for now - Claude 3.5 Sonnet still has some ...

Okay merged it, will start the release after main looks nice and green ( portalto https://discord.com/channels/707636530424053791/1355235431746240794)

quiet ether Apr 1, 2025, 4:35 AM

#

going to bed folks, it's quite late here 🙏 😴

steep onyx Apr 1, 2025, 5:11 AM

#

v0.18.0 is published: https://github.com/dagger/dagger/releases/tag/v0.18.0

storm gate Apr 1, 2025, 7:39 AM

#

steep onyx v0.18.0 is published: https://github.com/dagger/dagger/releases/tag/v0.18.0

Thanks all! Resuming the shift from the old continent 🌐

gloomy kindle Apr 1, 2025, 9:40 AM

#

question around required descriptions - just on porting code to the new style, it feels a little clunky? i'm also not entirely sure what i should be describing? it feels like i'm just parroting the type comments from the type itself for simple examples

gloomy kindle Apr 1, 2025, 10:28 AM

#

little nit about the new API - I have to repeat the magic string for the output var a couple times? it feels weird that it's not statically typed, and I could just get it wrong and get runtime errors. feels unavoidable though, and it always present (although less obvious) with single object before.
I also don't have any suggestions as to how to avoid it 😭 but it does feel very magically dynamic, and different from the test of our type system

#

(also I'm so far out of the loop, so you've probably discussed it all before)

shrewd ermine Apr 1, 2025, 10:50 AM

#

here's the diff for a python agent upgrading to v0.18.0 if anyone needs a reference. Will have other language reference soon https://github.com/kpenfound/dagger-programmer/commit/df1ced09304c4a18a20bc226b7c783293e74be84

shrewd ermine Apr 1, 2025, 11:54 AM

#

getting API errors if I do llm | with-prompt (not supplying an env). Not supported?

bronze fern Apr 1, 2025, 12:19 PM

#

shrewd ermine getting API errors if I do `llm | with-prompt` (not supplying an env). Not suppo...

came here to say the same. Thought it was me on bad wifi, but same on good wifi. Prompt mode working fine.

#

Shell mode:

✘ llm 1.0s
│ ● llm: LLM! 11.1s
! Post "http://dagger/query": unexpected EOF

Prompt mode:

dagger
Dagger interactive shell. Type ".help" for more information. Press Ctrl+D to exit.loading type definitions 0.1s
│ ┃ 0.0s
│
│🤖 Hello! How can I assist you today?
│ ┃ 1.4s ◆ Input Tokens: 1,089 ◆ Output Tokens: 11

shrewd ermine Apr 1, 2025, 12:22 PM

#

bronze fern Shell mode: ``` ✘ llm 1.0s │ ● llm: LLM! 11.1s ! Post "http://dagger/query": une...

which provider were you using on shell mode? With gemini I got a nice google API error. That looks like maybe your engine got crashed

bronze fern Apr 1, 2025, 12:22 PM

#

shrewd ermine which provider were you using on shell mode? With gemini I got a nice google API...

that was OPENAI

#

Get similar with Gemini and Anthropic. Kills the engine when I run llm in shell mode.

proper stratus Apr 1, 2025, 12:32 PM

#

I just hit the same error in two session

bronze fern Apr 1, 2025, 12:42 PM

#

https://github.com/dagger/dagger/issues/10032

GitHub

🐞 `llm` in 0.18 shell mode causing engine to panic and restart ...

What is the issue? Engine which may handle prompt mode fine with LLMs like Gemini, GPT, Claude, panics when llm is run in shell mode under 0.18.0 Dagger version 0.18.0 Steps to reproduce dagger llm...

quiet ether Apr 1, 2025, 12:57 PM

#

bronze fern https://github.com/dagger/dagger/issues/10032

building the engine to check this

shrewd ermine Apr 1, 2025, 12:57 PM

#

quiet ether building the engine to check this

its on 0.18.0

quiet ether Apr 1, 2025, 12:59 PM

#

shrewd ermine its on 0.18.0

yep, building to make a bisect