multi-object 🧵 | Dagger | Page 1

acoustic radish Mar 7, 2025, 1:29 AM

#

@shrewd skiff getting closer... but I'm getting this weird "empty spans" effect:

#

#

not sure what those robot heads are doing

shrewd skiff Mar 7, 2025, 1:31 AM

#

most likely the 'thinking/replying' span corresponding to API requests

#

aka the one that has token usage attached on non-OpenAI

acoustic radish Mar 7, 2025, 1:34 AM

#

Yes but I've never seen more than one in a row with no tool call - normally it stops in that situation

rich marsh Mar 7, 2025, 1:37 AM

#

I feel like that started happening a lot for me with ollama in one of the recent updates too. Haven't been able to dig in to see why yet. I was wondering if there was a regression in our openai implementation

acoustic radish Mar 7, 2025, 1:38 AM

#

Also I hit this at the end:

│
│🤖 0.2s
│ ! POST "https://api.openai.com/v1/chat/completions": 400 Bad Request {
│ !   "error": {
│ !     "message": "An assistant message with 'tool_calls' must be followed by tool messages responding to each 'tool_call_id'. The fol
│ !     "type": "invalid_request_error",
│ !     "param": "messages",
│ !     "code": null
│ !   }
│ ! }

rich marsh Mar 7, 2025, 1:43 AM

#

Vikram was hitting this too #1346226656162877460 message

acoustic radish Mar 7, 2025, 1:46 AM

#

~~@shrewd skiff looking at the web trace, there are in fact function calls. They just don't show in the TUI~~

--> my bad actually this is a problem across TUI and Web traces

#

https://v3.dagger.cloud/dagger/traces/d94760087d33186ce02170740cdf28e6#7cb3e82498247fd3

Dagger Cloud

Browse and visualize Dagger traces.

#

shrewd skiff Mar 7, 2025, 1:47 AM

#

ah interesting, will look into it

acoustic radish Mar 7, 2025, 1:48 AM

#

weird that part of the TUI trace has the same info

#

but then later in the same trace, it starts to drop information

#

Sorry that was wrong - everything seems to be the same in TUI and web

#

so it's NOT a TUI-specific problem after all

#

shrewd skiff Mar 7, 2025, 2:07 AM

#

oh OK - you were just looking at a different region?

acoustic radish Mar 7, 2025, 2:08 AM

#

yeah got confused

#

so back to not knowing why several "thinking" spans in a row with no tool call

rich marsh Mar 7, 2025, 2:11 AM

#

😭

acoustic radish Mar 7, 2025, 2:12 AM

#

hey how come I don't get the token count?

rich marsh Mar 7, 2025, 2:12 AM

#

😎

acoustic radish Mar 7, 2025, 2:12 AM

#

is it supposed to work with openai?

shrewd skiff Mar 7, 2025, 2:12 AM

#

it is, but openai is returning 0 for the token usage

#

no idea why. maybe they don't support it for streaming, but total guess

acoustic radish Mar 7, 2025, 2:17 AM

#

@shrewd skiff if you want to try, it works - now have to fine tune the prompt engineering

#

(so that the llm intuitively figures out what to do)

#

might need a _help

#

I'm curious how this would plug into dagger llm

acoustic radish Mar 7, 2025, 2:44 AM

#

@shrewd skiff @rich marsh nice side effect, you don't need a separate "prompt var" system. All vars in the llm env can be used to templatize the prompt

rich marsh Mar 7, 2025, 2:50 AM

#

nice. Does set-string have abilities beyond what with-prompt-var had?

shrewd skiff Mar 7, 2025, 2:50 AM

#

gonna go hide all those Secret.plaintext errors

acoustic radish Mar 7, 2025, 3:02 AM

#

rich marsh nice. Does `set-string` have abilities beyond what `with-prompt-var` had?

it's like setContainer setToyWorkspace etc

#

i renamed "with" to "set" to try the DX

#

you can actually also expand objects 🙂

shrewd skiff Mar 9, 2025, 1:44 AM

#

@acoustic radish lol, this is an interesting consequence of marrying prompt vars + llm env vars + shell vars

#

which ended up not working, because of course the model is looking for 'repo' and 'ctr'

acoustic radish Mar 9, 2025, 1:45 AM

#

mmm oops 😁

shrewd skiff Mar 9, 2025, 1:45 AM

#

but...what if we built on that, and just made them addressable by digest?!?!?!

#

eh you lose readability of the prompt itself

#

but, cool that we could

acoustic radish Mar 9, 2025, 1:46 AM

#

was that in my multi obj branch?

shrewd skiff Mar 9, 2025, 1:46 AM

#

yeah, i'm integrating it with the shell vars now

#

mostly works, except for that

#

e.g. https://v3.dagger.cloud/dagger/traces/2532348070c455e3b39b8b2c68b62bfe#08424bca26e5eea6

Dagger Cloud

Browse and visualize Dagger traces.

#

tl;dr after every eval we diff the vars from the last vars and set any new/different ones in the llm env, too

#

so no /set foo bar or /with, you just foo=$(container | from alpine) and the llm sees it too

acoustic radish Mar 9, 2025, 2:01 AM

#

acoustic radish was that in my multi obj branch?

lol just realized you wrote this in a thread called "multi-obj" my bad

acoustic radish Mar 9, 2025, 2:02 AM

#

shrewd skiff so no `/set foo bar` or `/with`, you just `foo=$(container | from alpine)` and t...

that's perfect - so much smoother IMO

#

I started using it in my demos today, noticed you already allow setting vars in /shell then using them in /with

#

@shrewd skiff were you able to get the LLM to use the objects properly?

#

I didn't have time to get back to that

#

on a plane now, I have 45mn to try 🙂

shrewd skiff Mar 9, 2025, 2:03 AM

#

getting there - I think it just needs a bit of prompt engineering. i'm having it generate a prompt 😛

#

i can commit + push what i have

acoustic radish Mar 9, 2025, 2:05 AM

#

oh yes please! I'll start from what you have

#

Any direction you'd want to me try first?

#

(or avoid because you're already on it?)

shrewd skiff Mar 9, 2025, 2:08 AM

#

@acoustic radish pushed to my fork - vito/llm-multiobj (don't have perms on your branch)

shrewd skiff Mar 9, 2025, 2:09 AM

#

acoustic radish Any direction you'd want to me try first?

none in particular - it's all super fresh, just kicking the tires for the first time really

acoustic radish Mar 9, 2025, 2:09 AM

#

oh oops. will fix perms

#

actually I thought upstream maintainers could push to forks?

#

do you lack perms upstream?

shrewd skiff Mar 9, 2025, 2:10 AM

#

shrug usually this isn't an issue, not sure

#

oh wait

shrewd skiff Mar 9, 2025, 2:10 AM

#

acoustic radish do you lack perms upstream?

oh is the branch on dagger/dagger?

#

oops. well there's one there now

To github.com:dagger/dagger
 * [new branch]          llm-multiobj -> llm-multiobj

acoustic radish Mar 9, 2025, 2:11 AM

#

no it's on my fork, but I thought if you write permissions on the upstream, that gave you write permissions to the forks also

shrewd skiff Mar 9, 2025, 2:11 AM

#

can delete, not sure why i can't push to shykes/llm-multiobj

acoustic radish Mar 9, 2025, 2:11 AM

#

maybe only if there's a PR

#

@shrewd skiff just opened the pr. Try again?

shrewd skiff Mar 9, 2025, 2:15 AM

#

that worked 👍

acoustic radish Mar 9, 2025, 2:15 AM

#

nice ok

#

pulling now

#

@shrewd skiff I think getting multiobj to work might be a matter of choosing the "metaphor" so that it feels natural to the LLM with minimal explanation - just name & description of the tools

#

for example I realized - _undo for rolling back the selection, makes it unintuitive to do sub-pipelines eg. container | with-directory /src $(git ...)

#

actually I guess _scratch becomes super important

#

maybe _start

#

and bring back the concept of pipeline or chain in the descriptions

#

@shrewd skiff getting a build error on the CLI: cmd/dagger/llm.go:147:12: pattern llm.md: no matching files found

#

https://dagger.cloud/dagger/traces/b9a5013e0444f43eebc37e2b061c9630

shrewd skiff Mar 9, 2025, 2:29 AM

#

huh thought i fixed that, 1 sec

shrewd skiff Mar 9, 2025, 2:30 AM

#

acoustic radish <@108011715077091328> getting a build error on the CLI: `cmd/dagger/llm.go:147:1...

try now (force-pushed, sry)

acoustic radish Mar 9, 2025, 2:30 AM

#

np I'm building from git remote, faster from the plane 🙂

#

(remote engine on home server)

#

time's up I'm landing

#

will try again later tonight

#

thanks for pushing this!

shrewd skiff Mar 9, 2025, 2:39 AM

#

np

#

hmmm so far results are not that great, but i might know why

#

if i try to do something like build $repo using $ctr, whenever it swaps from one to the other it forgets how to work with the other one

#

maybe the tools aren't returning the full set, and only the current value's? i remember you saying before that sometimes it extends instead of replacing

#

e.g. https://asciinema.org/a/8kKNaxWRsYcUHpKr3Nhz78q0n

asciinema.org

untitled

Recorded by vito

acoustic radish Mar 9, 2025, 2:45 AM

#

extends instead of replacing

didn't understand that sorry

shrewd skiff Mar 9, 2025, 2:45 AM

#

you mentioned something about the set of tools including both Container and Directory if it chained from a Container to a Directory

acoustic radish Mar 9, 2025, 2:46 AM

#

shrewd skiff you mentioned something about the set of tools including both Container and Dire...

yeah that's in the current BBI implementation

#

but in multiobj it's only the current object's functions + _builtins

#

so it needs to understand how to juggle objects

#

I think we don't give the llm enough to connect the dots

shrewd skiff Mar 9, 2025, 2:47 AM

#

hmm yeah that seems like the hard part

acoustic radish Mar 9, 2025, 2:47 AM

#

yes. but easier than eg. teaching it dagger shell or gql imo

shrewd skiff Mar 9, 2025, 2:48 AM

#

i'm not sure about that - those are languages, which llms are good at

acoustic radish Mar 9, 2025, 2:48 AM

#

i think it's doable with good results but haven't had a chance to try

shrewd skiff Mar 9, 2025, 2:48 AM

#

this seems like it would rely more on reasoning capabilities

#

just a hunch

acoustic radish Mar 9, 2025, 2:49 AM

#

I'll try a few things

#

the syntax part was not the issue with shell at least.

#

it was understanding the object model, chaining etc

#

so very similar but with more ground to cover because there was no 1-1 mapping of tools ever

acoustic radish Mar 9, 2025, 3:17 AM

#

FYI still getting the build error

#

dagger -m github.com/shykes/dagger@llm-multiobj -c 'engine | service llm | up'

#

✘ .withExec(args: ["go", "list", "-f", "{{if eq .Name \"main\"}}{{.Dir}}{{end}}", "./cmd/dagger"]): Container! 0.4s
cmd/dagger/llm.go:155:12: pattern llm.md: no matching files found

#

@shrewd skiff in case the current multiobj strategy doesn't pan out, what's plan B? gql? shell? other?

shrewd skiff Mar 9, 2025, 3:20 AM

#

acoustic radish FYI still getting the build error

yeah i think this is globs somewhere actually, the file is committed

acoustic radish Mar 9, 2025, 3:20 AM

#

trying from a local checkout in case it's a cache issue

#

god that version string compute andf its full git fetch... can't wait to optimize that away

shrewd skiff Mar 9, 2025, 3:21 AM

#

@acoustic radish pushed

acoustic radish Mar 9, 2025, 3:21 AM

#

shrewd skiff Mar 9, 2025, 3:23 AM

#

yeah 😭 can't wait for function caching

shrewd skiff Mar 9, 2025, 3:24 AM

#

acoustic radish <@108011715077091328> in case the current multiobj strategy doesn't pan out, wha...

not sure - not giving up on it yet, still tinkering

#

even with GraphQL I ended up doing a stateful vars thing

#

though, maybe it could do one big tool call that both sets vars + uses them in later queries. (kinda like a SQL CTE)

shrewd skiff Mar 9, 2025, 5:52 AM

#

!! https://v3.dagger.cloud/dagger/traces/175627a60bde3e1e9fb8ffef7c225603

Dagger Cloud

Browse and visualize Dagger traces.

shrewd skiff Mar 9, 2025, 5:53 AM

#

shrewd skiff <@488409085998530571> lol, this is an interesting consequence of marrying prompt...

i think this might be a feature, not a bug

#

the idea here is: no more _load, instead we load the tools for all types currently set to vars, and you use the Type@hash value to refer to them everywhere

#

sweet, this works without any prompting

#

the main downside is loading all the tools at once, not sure if that's a dealbreaker. hmm maybe. it caches well, at least

acoustic radish Mar 9, 2025, 6:01 AM

#

But there is a hard limit on number of tools

#

And even before the limit, it might hurt performance

#

but, we could optimize this later maybe

#

you use the Type@hash value to refer to them everywhere

Don't understand this part

shrewd skiff Mar 9, 2025, 6:03 AM

#

lemme see if i can work backward from where I ended up and keep one-toolset-at-a-time, there were other changes along the way

acoustic radish Mar 9, 2025, 6:03 AM

#

Feels like we're reinventing swap memory 😛

shrewd skiff Mar 9, 2025, 6:03 AM

#

lol

shrewd skiff Mar 9, 2025, 6:05 AM

#

acoustic radish > you use the Type@hash value to refer to them everywhere Don't understand this...

basically being consistent in printing Type@hash everywhere and accepting it as argument values in place of IDs, etc.

#

the llm seems to pick up how it works pretty well without any prompting

acoustic radish Mar 9, 2025, 6:12 AM

#

acoustic radish for example I realized - `_undo` for rolling back the selection, makes it unintu...

ah I see, so could be a solution to this 👆

shrewd skiff Mar 9, 2025, 6:15 AM

#

yeah, assuming the model is able to keep track of the hash values. not sure if that's reliable enough

acoustic radish Mar 9, 2025, 6:17 AM

#

-       // +ignore=["*", ".*", "!/cmd/dagger/*", "!**/go.sum", "!**/go.mod", "!**/*.go", "!**.graphql"]
+       // +ignore=["*", ".*", "!cmd/dagger/*", "!**/go.sum", "!**/go.mod", "!**/*.go", "!**.graphql"]
        source *dagger.Directory,

Oh noooo - sorry you got bit by that

shrewd skiff Mar 9, 2025, 6:18 AM

#

perf is really variable, hard to tell why but the # of tools is prob a good guess
https://asciinema.org/a/VPSLzOVjc20n1CunBdHf0o7uo

asciinema

vito

all tools available at once

Recorded by vito

#

(also claude seems generally slower than gpt-4o)

#

gpt-4o was much more fun
https://asciinema.org/a/JJQxdHa8XLU3c0CSKeyn5rnOL
https://v3.dagger.cloud/dagger/traces/f32918df8f8d7d2277b40b4719e1b074

asciinema

vito

debugging failed go build

Recorded by vito

Dagger Cloud

Browse and visualize Dagger traces.

#

pushed to vito/llm-multiobj-all-tools for now

#

this ^ showed a concerning issue though where it treated the container as if it were mutable, instead of chaining

acoustic radish Mar 9, 2025, 6:38 AM

#

@shrewd skiff in latest multi-object branch, looks like you can't set variables to a module

$ dagger llm
* /shell
⋈ workspace=$(github.com/shykes/toy-programmer/toy-workspace) 3.2s

✘ workspace=$(github.com/shykes/toy-programmer/toy-workspace) 3.2s
! input: llm.withPrompt.setToyWorkspace load: Call: Query has no such field: "toyWorkspace"
│ ✔ load module 0.5s
│
│ ○ toyWorkspace: ToyWorkspace! 1.1s
│
│ ✘ Llm.setToyWorkspace(
│ │ │ name: "workspace"
│ │ │ value: ○ toyWorkspace: ToyWorkspace! 1.1s
│ │ ): Llm! 0.0s
│ ! load: Call: Query has no such field: "toyWorkspace"

shrewd skiff Mar 9, 2025, 6:39 AM

#

ah good catch, will look into it tomorrow!

shrewd skiff Mar 9, 2025, 7:07 AM

#

shrewd skiff this ^ showed a concerning issue though where it treated the container as if it ...

oh interesting - it ran those tools in parallel, that's why it didn't chain

shrewd skiff Mar 9, 2025, 9:10 PM

#

I seem to be getting decent results with gpt-4o after going back to the model of selecting one toolset at a time 👍

#

renaming _load to _selectTools seems to yield even better results, but n=2

#

also thinking of letting _selectTools take a regex of tool names to enable, and having one of the other tools provide a simple list of all the fields (without descriptions). Maybe not necessary, but seems fun to try

shrewd skiff Mar 9, 2025, 9:40 PM

#

│ ┃ 5.0s ◆ LLM Input Tokens: 63,957 ◆ LLM Output Tokens: 132

😱 holy moly. that came from swapping to toyWorkspace?!

🤖 1.5s ◆ LLM Input Tokens: 9,237 ◆ LLM Output Tokens: 31
│ ✔ 🤖💻 _selectTools map[name:ToyWorkspace@xxh3:d0f1e35037b98313] 0.0s
│
│🤖 The  ToyWorkspace@xxh3:d0f1e35037b98313  workspace has several functions that we can explore. Here's a brief overview of its capabilities:
│ ┃
│ ┃ 1. Read: It can read files with a specified path.
│ ┃ 2. Write: It can write to files by providing a path and content.
│ ┃ 3. Build: It can build the code at the current directory in the workspace.
│ ┃
│ ┃ Please let me know if there's anything specific you'd like to do with this workspace, such as reading a specific file, writing data, or building the
│ ┃ code!
│ ┃ 5.9s ◆ LLM Input Tokens: 62,358 ◆ LLM Output Tokens: 121

acoustic radish Mar 9, 2025, 9:44 PM

#

shrewd skiff ``` │ ┃ 5.0s ◆ LLM Input Tokens: 63,957 ◆ LLM Output Tokens: 132 ``` 😱 holy mol...

that's weird. I don't see how!

shrewd skiff Mar 9, 2025, 9:44 PM

#

yeah there's only like 3 tools in that one

#

probably something goofy

#

unless you have incredibly long descriptions

#

@acoustic radish ok if I force-push now that it's back to one toolset at a time?

#

(to llm-multiobj)

acoustic radish Mar 9, 2025, 9:47 PM

#

// "what is in a word?" I pondered. To read - such a crucial, important word. it all started when..
func read()

acoustic radish Mar 9, 2025, 9:48 PM

#

shrewd skiff <@488409085998530571> ok if I force-push now that it's back to one toolset at a ...

yeah I haven't touched that branch today

shrewd skiff Mar 9, 2025, 9:48 PM

#

ok done - modules should work now

#

would recommend rolling with the prompt var <-> shell var <-> llm var alignment for now, it seems to work pretty well, and I dig that it makes the prompt even more explicit about what it's working with (e.g. if a var gets re-assigned)

#

one potential risk is the model losing track of the fact that Directory@xxh3:... is "repo" but seems OK so far. not sure if the model sees the original name for prompt vars or if it's just a text substition

#

here's how I've been testing:

/shell ctr=$(container | from golang)
/shell repo=$(git https://github.com/vito/booklit | head | tree)
use $ctr to build $repo

#

ah I think _load/_selectTools is dumping the entire object into the tool response. will fix

shrewd skiff Mar 9, 2025, 10:37 PM

#

pushed, + some tweaks to tool descriptions that help out. now when it sees "run unit tests against Bass@xxh3:..." it'll jump right to selectTools[Bass@xxh3:...] without having to look anything up

acoustic radish Mar 9, 2025, 11:01 PM

#

will try tonight!

#

can it still set variables?

#

it might get confused juggling too many IDs

shrewd skiff Mar 9, 2025, 11:02 PM

#

i haven't removed it, but there's no llm => shell env syncing yet. haven't seen it mess up IDs yet, but i've only done micro tests. should probably start trying more complicated tasks next

#

also don't think i've ever seen it call _save

#

i'm starting to wonder if we can get away with just one builtin, _load/_selectTools (depending on which name it likes)

#

but, there may be somewhere that a name <-> hash mapping is useful

acoustic radish Mar 9, 2025, 11:07 PM

#

yeah I saw it use save yesterday

#

i think vars will be useful for "returning" values but also as lightweight memory. ie compact the context, might forget IDs but not vars

shrewd skiff Mar 9, 2025, 11:10 PM

#

ah yeah makes sense

shrewd skiff Mar 10, 2025, 5:14 PM

#

@acoustic radish I noticed it tends to do this: Container.withExec(["obviously", "doesnt", "work"]) => "I did it! 😊" (not realizing it's lazy) - I remember there being a sync call somewhere at one point, but can't find it now on either llm or llm-multiobj. Did ya have an idea for that? Maybe we should just always sync against syncable tool call results?

acoustic radish Mar 10, 2025, 5:29 PM

#

shrewd skiff <@488409085998530571> I noticed it tends to do this: `Container.withExec(["obvio...

I saw an explicit call to sync, in bbi/flat. assumed you added it?

shrewd skiff Mar 10, 2025, 5:30 PM

#

grr not sure why my grep didn't find that

#

it was there already, i just fixed it. in the process of adding it for multiobj now

acoustic radish Mar 10, 2025, 5:30 PM

#

@shrewd skiff depends if we can give it a concept of starting/ending a pipeline maybe. but yeah maybe too ambitious for now

shrewd skiff Mar 10, 2025, 5:30 PM

#

i have setting vars working, just all my demos are blowing up cause of that 😛

shrewd skiff Mar 10, 2025, 7:37 PM

#

houston, we have moo-off 🚀 https://asciinema.org/a/MaoDDLTfUY2HgK1G4wthTbpum

asciinema

vito

llm setting vars

Recorded by vito

acoustic radish Mar 10, 2025, 7:43 PM

#

ha ha nice 🙂

#

@shrewd skiff do I have to resolve the variable in the shell? Or can the llm figure out what variables are available and use them?

#

Since it knows how to set variables I guess it knows how to get them too

#

I get why you're making sure expanded vars work. But I would prefer not to have to teach every developer how to do that trick as a pre-requisite to basic prompting

shrewd skiff Mar 10, 2025, 7:44 PM

#

acoustic radish <@108011715077091328> do I *have* to resolve the variable in the shell? Or can t...

nah it figures it out either way https://asciinema.org/a/HmSrmWz5gpwa5x7UbgurgXHrG

asciinema.org

untitled

Recorded by vito

acoustic radish Mar 10, 2025, 7:44 PM

#

nice

shrewd skiff Mar 10, 2025, 7:44 PM

#

i feel like it's more natural to type $foo though

#

less risk of ambiguity in the prompt

#

it just so happens that it expands atm, and the expanded form also works

acoustic radish Mar 10, 2025, 7:45 PM

#

Yeah in cases where you want to designate a specific var, but there may be cases where you don't want to, or can't (because the vars were dynamically set for example)

shrewd skiff Mar 10, 2025, 8:16 PM

#

a few learnings in this one: https://asciinema.org/a/H0O0Yk9tL61oG7dJXvYLKXFWD

the model didn't intuit that "bass" referred to a variable without a symbol prefix
the model understood the expanded $bass (expected)
there's no way to say or escape $bass literally (users prob wouldn't want to do this anyway)
using @bass was just enough of a hint, but might as well just type $bass at that point
it grabbed the entire stdout of the tests which added 30k to the token cost... twice 😬

asciinema.org

untitled

Recorded by vito

#

i'll go ahead and push as-is

acoustic radish Mar 10, 2025, 8:21 PM

#

@shrewd skiff does this feel like it could be merged to llm this week?

#

(trying to feel out the sequence relative to merge)

shrewd skiff Mar 10, 2025, 8:22 PM

#

i think so, the most disruptive thing is probably /with going away. should we keep it?

#

pushed (to llm-multiobj)

acoustic radish Mar 10, 2025, 8:32 PM

#

If the alternative is better I don't have a problem with replacing it

#

Question @shrewd skiff : are slash commands generalizable somehow? How do we explain them vs. dot-builtins?

#

Option 1: they're very different because: (reason that makes sense)
Option 2: they're the same, move everything to slash commands
Option 3: they're the same, move everything to dot builtins

shrewd skiff Mar 10, 2025, 8:38 PM

#

at least at a technical level: dot builtins exist within the confines of valid shell syntax, slash commands don't; they're at a higher level of control, and / has to be the very first character in the prompt

shrewd skiff Mar 10, 2025, 8:39 PM

#

acoustic radish If the alternative is better I don't have a problem with replacing it

i'm not sure - the UX is just different, it might be nice to not have to name something and just say "operate in this context"

acoustic radish Mar 10, 2025, 8:45 PM

#

So would slash commands make sense for anything that is applicable in both modes?

#

that might apply to some (not all) dot builtins

shrewd skiff Mar 10, 2025, 8:48 PM

#

yeah

acoustic radish Mar 10, 2025, 8:52 PM

#

that might actually free up a lot of builtins

#

/login
~~/install~~
~~/uninstall~~

?

#

argh although install/uninstall is linked to filesystem navigation ("install in the current directory)

acoustic radish Mar 10, 2025, 10:12 PM

#

@shrewd skiff testing with latest llm-multiobj, vars dont seem to be accessible

#

https://v3.dagger.cloud/dagger/traces/d2597f736ed9d5646f5df6bea962c696#feda8a174d78796e

Dagger Cloud

Browse and visualize Dagger traces.

acoustic radish Mar 10, 2025, 10:37 PM

#

Oh wait, I might have been using the wrong CLI build

shrewd skiff Mar 10, 2025, 10:39 PM

#

(unrelated:) ah looks like we lost the stdout/stderr propagation, adding that back now

acoustic radish Mar 10, 2025, 10:40 PM

#

do you still want to add "ambient module access"?

#

if so, I have questions 🙂

#

oh wait. token count works with openai now partyparrot

shrewd skiff Mar 10, 2025, 10:44 PM

#

acoustic radish do you still want to add "ambient module access"?

what's this again?

acoustic radish Mar 10, 2025, 10:46 PM

#

(not sure what to call it 😅 ) the idea that /prompt and /shell have access to the same functions - including 1) core API, 2) functions in the current module, and 3) dependencies of the current module

#

cross-posting my multi-obj experiment: #p-agent-smith message

shrewd skiff Mar 10, 2025, 10:57 PM

#

ack, it seems like this method of passing in self doesn't work very well with chaining - note how these all were called against the same initial empty toyWorkspace

acoustic radish Mar 10, 2025, 10:58 PM

#

ah, also lack of stderr pass-through tainted the result:

🧑 summarize what happened and what confused you
│ ┃ 0.0s
│
│🤖 I attempted to build a Docker container using a specific version of a git repository. Here's a summary of what happened:
│ ┃
│ ┃ 1. Container Setup: I initiated a container with build tools like build-base , git , and later added go for building a Go-based CLI in
│ ┃ the project.
│ ┃ 2. Git Operations: I checked out the last stable release ( v0.16.2 ) of the repository.
│ ┃ 3. Build Attempts: I tried several times to build the main CLI using go build , targeting the ./cmd directory which typically houses
│ ┃ command entries. Each attempt resulted in a failure without substantial error details.
│ ┃ 4. Logging Issues: Attempts to capture build errors in log files ( stdout.log and stderr.log ) resulted in errors as the files weren't
│ ┃ created or found in the filesystem.
│ ┃ 5. Diagnosis: I examined directory contents and the go.mod file for clues but remained unable to identify the root cause of build failures.
│ ┃
│ ┃ Confusion: My main confusion arose from:
│ ┃
│ ┃ • The inability to redirect and capture build error logs, which could have been instrumental in diagnosing the issue.
│ ┃ • The absence of detailed error messages from the failed go build commands, leaving the exact reason for failure unclear.
│ ┃
│ ┃ To proceed effectively, insights on specific build steps, environment setup, or access to documentation would be helpful.
│ ┃ 9.6s ◆ LLM Input Tokens: 22,229 ◆ LLM Output Tokens: 280

#

I'm still not convinced we can make this work without "simulating" mutation

shrewd skiff Mar 10, 2025, 10:59 PM

#

acoustic radish ah, also lack of stderr pass-through tainted the result: > 🧑 summarize what ha...

pushed fix for that, it was goofy (needed to use errors.As, not directly cast)

shrewd skiff Mar 10, 2025, 10:59 PM

#

acoustic radish I'm still not convinced we can make this work without "simulating" mutation

yeah this self arg thingy was from back when it loaded all tools at once and didn't depend on switching contexts, lemme see if I can just undo it

shrewd skiff Mar 10, 2025, 11:34 PM

#

pushed the removal of self but getting pretty bleh results https://dagger.cloud/dagger/traces/358ed43bfe65e8889f17577c8acc77aa

acoustic radish Mar 10, 2025, 11:40 PM

#

where is does seem to get stuck?

#

time for some evals? 🙂

shrewd skiff Mar 11, 2025, 3:51 PM

#

acoustic radish time for some evals? 🙂

interested in this but no idea where to start

acoustic radish Mar 11, 2025, 3:53 PM

#

Will give it a try. Do you have a favorite scenario you run against it?

shrewd skiff Mar 11, 2025, 3:58 PM

#

not really, I've just been winging it, it's actually the 'favorite scenarios' that I'm most interested in seeing haha

but here's one I've been using just to get things started:

go=$(container | from golang)
booklit=$(git https://github.com/vito/booklit | head | tree)
/prompt use $go to build $booklit

It's very vague but it's interesting seeing what it does. The general theory is "here's a container with Go, here's a trivially buildable Go project, do your thing" but even more ideally it would build ./cmd/booklit and give me the binary or something (nothing has done that yet, they just do go build ./... which is perfectly understandable given how vague the prompt is)

#

lots of variables could be tweaked, like having it figure out how to install Go itself

acoustic radish Mar 11, 2025, 4:03 PM

#

👀

#

@shrewd skiff I find myself entangling discussions of 1) dagger llm and dagger shell, and 2) multi-object. Should I embrace the entanglement and discuss it all in this thread? Or use a separate thread? (I want to involve @vernal ruin and discuss it live with you guys if possible... As the shell launch deadline looms large)

acoustic radish Mar 11, 2025, 4:05 PM

#

acoustic radish So would slash commands make sense for anything that is applicable in both modes...

What about /model? Only specific to prompt mode...

shrewd skiff Mar 11, 2025, 4:08 PM

#

acoustic radish <@108011715077091328> I find myself entangling discussions of 1) `dagger llm` a...

i think this thread makes the most sense for now, since multi-obj feels like a major dependency of llm/shell merge (since it's what enables the magic var syncing)

shrewd skiff Mar 11, 2025, 4:09 PM

#

acoustic radish What about `/model`? Only specific to prompt mode...

eh, i could see wanting to run this while in shell mode just to get it out of the way, if you're in the middle of shell stuff and know you want to switch. but, if we want to, we can make the slash commands modal

acoustic radish Mar 11, 2025, 4:15 PM

#

I don't mind having /model there, I just want to make sure the overall architecture we have, makes sense to users, and scales nicely when we add more features. Ideally there are core principles that are self-evident and we don't start arguing over them in 6 months 🙂

#

One interesting stress question: if /shell and /prompt are "modes", will there perhaps be more modes in the future? If so, what will be the rule for what belongs in a "mode"? Will they map to the Dagger API, for example 1 mode = 1 type? or something like that?

shrewd skiff Mar 11, 2025, 4:31 PM

#

just to toss another option in the mix, Claude Code supports ! for running a shell command. typing it immediately does a UI cue to show the mode change
https://asciinema.org/a/cUB037ythDOmE8BaqKjUSN2Fw

asciinema.org

untitled

Recorded by vito

#

so it's not really a mode switch in the persistent sense

#

which honestly is kind of nice, swapping back and forth is annoying

#

but it does raise the question of the 'default mode' (prompt, or shell? we're pretty sure about shell, but for claude it's prompt, so they use ! i guess to indicate 'danger, you're running a command'?)

#

if the default is shell, what do we use?

acoustic radish Mar 11, 2025, 4:35 PM

#

it's not too late to change the default 🙂

shrewd skiff Mar 11, 2025, 4:36 PM

#

i'm not proposing that, just saying ! prob won't make sense for a prompt indicator 😛

#

maybe it should match the symbol shown in the prompt, but for us that's a fancy unicode asterisk

#

regular asterisk aint too bad:

go=$(container | from golang)
repo=$(git https://github.com/vito/booklit | head | tree)
* build $repo with $go

shrewd skiff Mar 11, 2025, 4:52 PM

#

gonna try implementing that and see how it feels

#

i'll do both sides, ! to run shell from prompt, * to run prompt from shell

acoustic radish Mar 11, 2025, 4:53 PM

#

I think having to type any character first will kill the magic

#

(could be wrong)

shrewd skiff Mar 11, 2025, 4:54 PM

#

i mean the alternative is typing /shell or /prompt no? and since it's interpreted immediately as a (one-off) mode switch it won't feel like you're literally typing "! foo"`

#

but yea i'll just take a swing at it and we can see how it feels

#

just tryin' things on

acoustic radish Mar 11, 2025, 5:32 PM

#

https://tenor.com/view/math-thinking-zach-galifianakis-formulas-numbers-gif-7715569

Tenor

#

Oh you mean ! is like a hotkey in claude code?

shrewd skiff Mar 11, 2025, 5:41 PM

#

yeah - it doesn't show up in the text input, it swaps the mode immediately for that single input

shrewd skiff Mar 11, 2025, 5:41 PM

#

shrewd skiff just to toss another option in the mix, Claude Code supports `!` for running a s...

here's a demo

acoustic radish Mar 11, 2025, 6:00 PM

#

nice

#

mmmm "bash mode"

#

that's what we should call it too

rich marsh Mar 11, 2025, 6:01 PM

#

https://tenor.com/view/mighty-ducks-fist-bump-hand-shake-hockey-sports-gif-5200420

Tenor

Mighty Ducks

▶ Play video

acoustic radish Mar 11, 2025, 6:05 PM

#

I'm still not sure about slash-commands vs. dot-builtins

#

but I do like the claude hotkey for mode switch, now that I understand it

shrewd skiff Mar 11, 2025, 6:26 PM

#

acoustic radish that's what we should call it too

i think that'd imply running stuff on the host, not running dagger shell (I was confused by that in Claude too despite knowing it's not sandboxed)

acoustic radish Mar 11, 2025, 6:39 PM

#

But "dagger shell" will be the name of the whole thing (including prompt mode & all other modes). Basically it's the name of our repl.

#

It can either be that, or the name of a mode. Confusing if it's both

shrewd skiff Mar 11, 2025, 7:04 PM

#

yeah but if you say "bash" people are gonna expect bash, which our scripting language isn't at all

acoustic radish Mar 11, 2025, 7:08 PM

#

true

shrewd skiff Mar 11, 2025, 7:08 PM

#

also have to keep in mind dagger shell scripts which sort of exist outside of the idea of a repl

shrewd skiff Mar 11, 2025, 7:09 PM

#

acoustic radish It can either be that, or the name of a mode. Confusing if it's both

agree with this though

acoustic radish Mar 11, 2025, 10:26 PM

#

shrewd skiff also have to keep in mind `dagger shell` scripts which sort of exist outside of ...

will we want to allow "prompt mode" in script files also? seems tempting!

shrewd skiff Mar 11, 2025, 11:12 PM

#

acoustic radish will we want to allow "prompt mode" in script files also? seems tempting!

yeah could be a flag hyperthinkspin

#

dagger [shell] --prompt/-p <- in interactive mode, defaults to prompt, or when used in a shebang, interprets whole file as a prompt?

acoustic radish Mar 11, 2025, 11:48 PM

#

what what if I want to go back and forth in a script?

#

Maybe nobody will ever need that - but seems potentially risky to make that bet

#

but then, adding slash-commands to a shell script feels weird too

#

or maybe it's fine?

shrewd skiff Mar 11, 2025, 11:50 PM

#

ok finally got the * key thing working - took a long detour so that it works nicely with history too (going back to a prompt history item swaps out to prompt mode, shell swaps to shell mode, etc)
https://asciinema.org/a/ERQSvuILuIH12RgDvS90H50Lm

asciinema.org

untitled

Recorded by vito

#

TODO: swap the prompt back after you hit enter

acoustic radish Mar 11, 2025, 11:51 PM

#

oh nice! yeah I noticed that "history mixup"

#

by the way another bug log for later: if you start editing a new command while a previous command is running, your line gets wiped when the command completes (or fails?)

shrewd skiff Mar 11, 2025, 11:52 PM

#

ha yea, fixed that too

#

so annoying

acoustic radish Mar 11, 2025, 11:53 PM

#

btw demo today went great

#

(single object)

shrewd skiff Mar 11, 2025, 11:53 PM

#

nice!

acoustic radish Mar 11, 2025, 11:53 PM

#

had agent write curl clone in Java. Then change it to use gradle instead of maven.

#

Got a request: "have you considered using a LLM to explain why my trace fails in Dagger Cloud? Could you do it better than Github Actions in their 'explain error' feature, because you have better data? Maybe even suggest actions to take?"

shrewd skiff Mar 12, 2025, 12:05 AM

#

i have always looked at that button in github and never clicked it lol. have you?

#

makes sense, but sounds tricky (what AI do we use? who pays for it?)

acoustic radish Mar 12, 2025, 12:08 AM

#

Yeah those questions will have answers eventually 😛 But for now just a seed that needs water and time to grow

shrewd skiff Mar 12, 2025, 12:19 AM

#

acoustic radish what what if I want to go back and forth in a script?

maybe this?

.prompt <<EOF
Do a thing.
EOF

i realize this resurfaces builtin vs slash commands, but I think slash commands never make sense in a script file

#

and maybe with this new model we don't need /prompt and /shell (MAYBE)

rich marsh Mar 12, 2025, 12:28 AM

#

acoustic radish Got a request: "have you considered using a LLM to explain why my trace fails in...

see my new video 😛

#

(not uploaded to yt yet)

shrewd skiff Mar 12, 2025, 12:50 AM

#

@acoustic radish pushed if you want to try it out. llm still defaults to prompt mode, so press ! for shell, and if in shell mode press * for prompt
TODO for tomorrow:

switch shell to this new implementation
get rid of llm command
update help text below prompt, current version is next to useless (M-? doesnt even work on Mac), should show !/* key at minimum
add -p/--prompt to dagger shell?

acoustic radish Mar 12, 2025, 1:08 AM

#

shrewd skiff <@488409085998530571> pushed if you want to try it out. `llm` still defaults to ...

nice nice!!

#

will try ~~tomorrow~~ tonight for sure

#

yeah in this model we may not need slash-commands (unless they replace dot-builtins 😛 but that's a stretch)

only one left would be /model?

shrewd skiff Mar 12, 2025, 1:10 AM

#

There's still /compact /clear etc if we want em

#

I think it's a useful and common enough pattern to keep in our back pocket

acoustic radish Mar 12, 2025, 1:43 AM

#

shrewd skiff There's still /compact /clear etc if we want em

ah right. those are cool

shrewd skiff Mar 12, 2025, 6:07 PM

#

update help text below prompt, current version is next to useless (M-? doesnt even work on Mac), should show !/* key at minimum
pushed this part + fixed shell completion

acoustic radish Mar 12, 2025, 7:14 PM

#

@shrewd skiff anything I can to help on the "model UX" part? Ie. getting the LLM to better pick up the multi-object tools?

shrewd skiff Mar 12, 2025, 7:20 PM

#

i think just a lot more tire-kicking - trying out more scenarios, seeing if it gets confused

#

i want to hack on script support to make that easier, like this:

#!/usr/bin/env -S dagger shell --no-mod

go=$(container | from golang)

repo=$(git https://github.com/vito/booklit)

bin=$(.prompt "Build the ./cmd/booklit binary in $repo using $go.")

$bin | terminal

but trying to resolve the merge situation with main first

acoustic radish Mar 12, 2025, 7:31 PM

#

shrewd skiff i want to hack on script support to make that easier, like this: ```sh #!/usr/bi...

btw @vernal ruin added -n I think

#

@shrewd skiff is shykes/dagger@llm-multiobj still the branch to build from?

shrewd skiff Mar 12, 2025, 7:40 PM

#

yep

shrewd skiff Mar 12, 2025, 7:52 PM

#

acoustic radish <@108011715077091328> is `shykes/dagger@llm-multiobj` still the branch to build ...

oops doesn't build atm, pushing fix + llm and main merge soon

acoustic radish Mar 12, 2025, 8:04 PM

#

OK I did a quick poll ( @devout niche @rough kiln @velvet garden ), we explored the question: "should dagger default to prompt or shell mode? And why?".

We reached consensus on:

AI companies want software to be inside their AI. So their UI defaults to prompting.
Dagger wants AI to be inside your software. So our UI defaults to shell.

And I really like that framing, because the UI difference actually reflects a profound philosophical difference.

#

cc @grizzled vine 👆 integration of marketing and UX 🙂

#

shrewd skiff Mar 12, 2025, 8:06 PM

#

makes sense to me shipit

acoustic radish Mar 12, 2025, 8:15 PM

#

Tibor mentioned "* is far on the keyboard, maybe >? Also it's a literal shell" 🙂

#

@shrewd skiff re use of the word "shell".

I am warming up to:

CLI: the entire command-line experience.
shell mode: one way to interact with the CLI
prompt mode: another way to interact with the CLI

acoustic radish Mar 12, 2025, 8:25 PM

#

shrewd skiff oops doesn't build atm, pushing fix + `llm` and `main` merge soon

FYI still doesn't build for me, just checking that it's normal

shrewd skiff Mar 12, 2025, 8:25 PM

#

yep

acoustic radish Mar 12, 2025, 8:26 PM

#

https://tenor.com/view/sales-black-friday-gif-5602611

Tenor

sales

▶ Play video

shrewd skiff Mar 12, 2025, 8:55 PM

#

@acoustic radish ok try now

shrewd skiff Mar 12, 2025, 9:27 PM

#

acoustic radish Tibor mentioned "* is far on the keyboard, maybe `>`? Also it's a literal shell"...

done

acoustic radish Mar 12, 2025, 10:20 PM

#

(sorry was on a call)

#

@shrewd skiff doesn't build for me

#

https://dagger.cloud/dagger/traces/42153a13e663520b5316e166937bbbb9

acoustic radish Mar 12, 2025, 11:04 PM

#

@shrewd skiff assuming you're in the zone, going to try and fix that build error myself (seems unrelated to your code, maybe a main merge side effect?)

acoustic radish Mar 12, 2025, 11:20 PM

#

update: still debugging that random elixir-dev error

#

kind of a PITA

shrewd skiff Mar 13, 2025, 12:19 AM

#

I'm out to dinner atm

acoustic radish Mar 13, 2025, 12:55 AM

#

--> moved to #1349529845436121112 since the issue doesn't seem specific to multiobj

shrewd skiff Mar 13, 2025, 4:31 AM

#

ok now that shell is the default and we have one-off mode switching i think i'm team Remove Slash Commands. anything that was a slash-command can just be a builtin. and to run a builtin from prompt mode, just press ! first: instead of /compact it's !.compact.

also for a persistent mode switch maybe it can accept the switcher twice, so >> to stay in prompt mode, !! to stay in shell mode

acoustic radish Mar 13, 2025, 4:33 AM

#

do you think claude code users might miss the slash commands?

shrewd skiff Mar 13, 2025, 4:48 AM

#

acoustic radish do you think claude code users might miss the slash commands?

hmm i don't have a read on that really, need input from people who use it more. but, it has to be super discoverable for sure. as long as that's the case, and the UX isn't worse, I'd probably just go "oh" and move on.

so, need to try it and see if the ux feels worth it

#

i think this is closer to what we would have ended up with naturally, at least, since we already have a whole language with builtins and default to shell - claude code doesn't have that, and defaults to prompt

shrewd skiff Mar 13, 2025, 6:05 AM

#

pushed: llm command gone, assimilated into shell, slash commands replaced with builtins.
currently working on: .prompt in repl swaps to prompt mode ✅, in a script it runs a prompt and returns the value
TODO: >>, !!, more polish

acoustic radish Mar 13, 2025, 4:42 PM

#

update: until the main build issue from hell is resolved ( #1349559353589497946 ) I a building llm-multiobj with dagger 0.6.3. Building now!

acoustic radish Mar 13, 2025, 5:14 PM

#

@shrewd skiff UX feedback: when I prompt the LLM, and it replies, it switches back to shell mode, so I often find myself typing my reply in the shell mode, getting an error; then have to fully re-type the same message because going back in history "switches" me to the shell mode

shrewd skiff Mar 13, 2025, 5:16 PM

#

acoustic radish <@108011715077091328> UX feedback: when I prompt the LLM, and it replies, it swi...

yeah that's why I want the persistent switching keybind

to fix the previous message, you can go to the beginning and type > after-the-fact

acoustic radish Mar 13, 2025, 5:17 PM

#

Yeah kind of feels like there should be a "regular" keybind (say ctrl-/) that can be triggered at any time (not just when editor is empty) and its only effect is to toggle the input mode (only visual effect is to change the prompt character). Everything else remains the same including my current line buffer.

shrewd skiff Mar 13, 2025, 5:29 PM

#

acoustic radish Yeah kind of feels like there should be a "regular" keybind (say `ctrl-/`) that ...

the >/! prefix also works when the editor has content, you just have to be at position=0 when you type it - maybe try that out? it's the same way Claude Code works and feels intuitive once you know it's a thing (Up-> Home/Ctrl-A -> >)

keybind makes sense too but a) it's especially difficult to find one that works across all platforms in prompt mode, and b) it'll compete with >/! for discoverability - e.g. which do we show in the keymap?

acoustic radish Mar 13, 2025, 5:46 PM

#

If we do a keybind it would be as a replacement for > > !, I wouldn't do both. Will try the "position 0" trick

acoustic radish Mar 13, 2025, 6:32 PM

#

@shrewd skiff OK with stream of consciousness feedback as it comes to me while using? Or do you prefer a thoughtful, digested summary at the end of the day (crazy I know)

shrewd skiff Mar 13, 2025, 6:36 PM

#

acoustic radish <@108011715077091328> OK with stream of consciousness feedback as it comes to me...

all good!

#

lots of stuff is changing so it's good to check in and build muscle memory on things like that sooner

acoustic radish Mar 13, 2025, 6:44 PM

#

@shrewd skiff I'm getting the hang of it now. But would prefer a toggle mechanic rather than having to memorize two different characters I think. I can see how two different characters is more idempotent though, you can mash that key 10 times and know exactly the end state

#

(but current behavior of having to carefully orchestrate the cursor position before pressing those two characters, is NOT idempotent)

shrewd skiff Mar 13, 2025, 7:06 PM

#

acoustic radish (but current behavior of having to carefully orchestrate the cursor position bef...

will think on it, but want to give a bit of time for muscle memory still; the mental model is that there's an invisible symbol at the start of the input, for example pressing backspace at position=0 undoes the mode change too. IME so far it's pretty smooth, so maybe familiarity with the old way (persisten mode switching) is still getting in the way

#

you might be dogfooding it even more than me at this point though

shrewd skiff Mar 13, 2025, 7:48 PM

#

sobblood

shrewd skiff Mar 13, 2025, 8:19 PM

#

we are so back

acoustic radish Mar 13, 2025, 9:00 PM

#

my session is going haywire..

#

https://dagger.cloud/dagger/traces/5ab4f7fdb8f82ba53aebc6bae56eef42#5c9855619d7dd876

shrewd skiff Mar 13, 2025, 9:26 PM

#

probably need newer engine? or new client?

#

guessing that's from Llm -> LLM

acoustic radish Mar 13, 2025, 9:42 PM

#

Ah maybe

#

Trying again

acoustic radish Mar 13, 2025, 10:10 PM

#

shrewd skiff will think on it, but want to give a bit of time for muscle memory still; the me...

backspace undoing the mod change bit me a few times. I tend to compulsively press backspace more than necessary, and it dropping down to shell is disruptive

#

But agree with giving it a little time (just not too much time 😛 )

#

My nickname for this model is "russian doll": there is shell mode, and then inside it there is prompt mode. There is a clear hierarchy

shrewd skiff Mar 13, 2025, 10:12 PM

#

acoustic radish backspace undoing the mod change bit me a few times. I tend to compulsively pres...

ah yeah that's fair

#

i'm a Ctrl+Wer

acoustic radish Mar 13, 2025, 10:20 PM

#

Bug report: when switching to prompt mode, with a 1password secret reference for llm config:

I press >
I get the 1password popup
Almost immediately, this error appears: context deadline exceeded
Pressing > again works

shrewd skiff Mar 13, 2025, 10:21 PM

#

oh fun

#

will fix, it's from the lazy initialization, tricky to work in to the UI loop

#

there's a 1 second timeout there which i thought would never get hit, forgot about password prompts 😛

acoustic radish Mar 13, 2025, 10:24 PM

#

yeah and don't forget - user consent prompts are coming from @knotty imp too

#

😭

#

latest multi-obj CLI & engine 👆

#

goddamn it 😛

#

Then:


✘ base_container=$(container | from alpine) 1.2s
│ ✔ container: Container! 0.0s
│ ○ .from(address: "alpine"): Container! 1.1s
! returned error 422: {"data":null,"errors":[{"message":"Cannot query field \"loadLlmFromID\" on type \"Query\

⋈```

#

So, same error as before @shrewd skiff . I don't think it's a version mismatch issue

shrewd skiff Mar 13, 2025, 10:29 PM

#

oh, yep, missed a spot

#

pushed 😅

acoustic radish Mar 13, 2025, 10:30 PM

#

btw it's a little addictive that I can build + run any version of dagger, bootstrapped from an older version of dagger, with no git checkout or other dependencies, in another command

#

as our build gets gradually faster, this flow becomes more and more like a superpower

shrewd skiff Mar 13, 2025, 10:40 PM

#

@acoustic radish for backwards compatibility I brought back the withFoo/foo setters/getters. worth it?

#

pushed - added the multi-object feature flag but kept its default as true for now since we're still on the multiobj branch

acoustic radish Mar 13, 2025, 10:41 PM

#

shrewd skiff <@488409085998530571> for backwards compatibility I brought back the withFoo/foo...

you mean as a temporary stopgap to make merging + feature flag easier?

#

or a more permanent reversal?

shrewd skiff Mar 13, 2025, 10:42 PM

#

just wondering how concerned we are with keeping existing LLM demos working after merging llm-multiobj ultimately into main

#

without those, they'll have to make up a variable name i suppose

acoustic radish Mar 13, 2025, 10:44 PM

#

I feel like it's OK to break everything that came before merging into main

#

as long as it feels like the best version of the API so far, at the time of merge

shrewd skiff Mar 14, 2025, 12:29 AM

#

acoustic radish as long as it feels like the best version of the API so far, at the time of merg...

i'm not sure tbh - there's something about the single-object API that's still compelling

#

i don't have strong feelings on it yet though

shrewd skiff Mar 14, 2025, 12:49 AM

#

re: mode switching, now that I'm using it, maybe the mode switch should just be persistent

velvet garden Mar 14, 2025, 1:03 AM

#

shrewd skiff re: mode switching, now that I'm using it, maybe the mode switch should just be ...

I got the same feeling by watching @acoustic radish use it

acoustic radish Mar 14, 2025, 1:04 AM

#

Yeah I got it wrong like 10 times in one demo

shrewd skiff Mar 14, 2025, 1:06 AM

#

lol

acoustic radish Mar 14, 2025, 1:07 AM

#

I managed to get it wrong once in the other direction - gave a shell command to the llm, and it actually executed it for me (for a small fee)

shrewd skiff Mar 14, 2025, 1:52 AM

#

pushed persistent modes

#

pushed a fix for var are autocompletion in prompt mode (that's a thing)

shrewd skiff Mar 14, 2025, 4:52 AM

#

pushed some more UI polish

builtin tool calls now look like function calls
actor emoji is now respected on non-message spans
syntax highlighting

shrewd skiff Mar 14, 2025, 1:52 PM

#

could do a custom "tool call" UI if we're worried about overloading the regular function call look

shrewd skiff Mar 14, 2025, 2:55 PM

#

update: I merged into llm, so maybe time to retire this thread

acoustic radish Mar 14, 2025, 3:21 PM

#

ok!

#

thank you 🙏

#

@shrewd skiff what's the feature flag?

#

(if there is one)

shrewd skiff Mar 14, 2025, 3:22 PM

#

acoustic radish <@108011715077091328> what's the feature flag?

I'm not 100% sure we actually need one - currently it's on by default, but I'm thinking of changing it to just turn on once you set a variable. Less fiddly

#

I merged a bit aggressively because I'm tired of conflicts 😛 - going to just keep working off of llm since the plan was already made to work forward from llm-multiobj

#

we might want an explicit opt-in for shell mode though, depends on what we want to demo from here on

#

since setting a variable in shell is obviously going to be common

acoustic radish Mar 14, 2025, 3:28 PM

#

If there's no FF then we need to prepare for breaking agent modules

shrewd skiff Mar 14, 2025, 3:29 PM

#

they shouldn't break - they'll just keep using the withFoo/foo APIs

acoustic radish Mar 14, 2025, 3:35 PM

#

ah so for now both APIs still coexist ?

shrewd skiff Mar 14, 2025, 3:35 PM

#

yep

#

the LLM type is a chonker at the moment, though it probably already crossed that threshold haha

#

i think there's a chance this is actually a desired state anyway

#

since the notion of a "current state" never really went away

#

and not having to name things when you don't need to is nice (for the programmatic API)

#

but, it's up in the air

acoustic radish Mar 14, 2025, 3:39 PM

#

I worry that it will be too much to grok...

#

but it was the right call to leave for now

#

we still have things to add before we start slimming it down:

1- modules
2- access to core API?
3- access to current module's functions and dependencies

no idea how to do 2 and 3

shrewd skiff Mar 14, 2025, 3:41 PM

#

for 2) do you mean conceptually swapping the tools to Query?

#

(maybe without all the loadFooFromIDs)

acoustic radish Mar 14, 2025, 3:44 PM

#

Not sure. Thinking about prompt mode and how to give your "copilot" access to the same environment as the shell (if you want it to)

#

or, maybe we explicitly don't want to

#

same for modules

#

one issue at the moment with passing modules: agent can never call the constructor

acoustic radish Mar 14, 2025, 6:34 PM

#

acoustic radish we still have things to add before we start slimming it down: 1- modules 2- acc...

@shrewd skiff mind if we talk live about this sometime today?

shrewd skiff Mar 14, 2025, 6:34 PM

#

sure

acoustic radish Mar 14, 2025, 6:36 PM

#

was talking to Guillaume and Tibor about how MCP plugs into multiobj, and 🤷‍♂️

knotty imp Mar 14, 2025, 6:45 PM

#

acoustic radish <@108011715077091328> mind if we talk live about this sometime today?

I wanna come too 🙂

acoustic radish Mar 14, 2025, 6:47 PM

#

all are welcome 🙂 on a call now but will propose a time after. maybe tell me what times work for you today?

shrewd skiff Mar 14, 2025, 7:01 PM

#

I'll be good in about an hour - grabbing lunch

shrewd skiff Mar 14, 2025, 7:44 PM

#

@knotty imp @acoustic radish I'm good whenever, no rush, just checking in in case I'm the bottleneck

acoustic radish Mar 14, 2025, 7:47 PM

#

ready in 5mn

knotty imp Mar 14, 2025, 7:48 PM

#

i am ready whenever, backfilling tests is like the most interruptible thing ever lol

rich marsh Mar 14, 2025, 7:49 PM

#

I wish I could thread a thread 😬 what's the substitute for WithPromptVar if I'm converting some code?

shrewd skiff Mar 14, 2025, 7:50 PM

#

rich marsh I wish I could thread a thread 😬 what's the substitute for `WithPromptVar` if I...

SetString

#

forgot about that one - could add it back, since I brought back the withFoo ones, it'd be a drop in the bucket

rich marsh Mar 14, 2025, 7:51 PM

#

no worries, I found the SetFoo too so I'm fully immersing myself 😛

acoustic radish Mar 14, 2025, 7:52 PM

#

I've been thinking about bind as a possible verb.

#

I find myself using the terminology "bind an object to the llm environment" and it seems to work (ie. people understand & like it 🙂

shrewd skiff Mar 14, 2025, 7:54 PM

#

I like it more than set which sounds like a mutation

#

do you have a new one for get? 😛

acoustic radish Mar 14, 2025, 7:55 PM

#

shrewd skiff do you have a new one for `get`? 😛

nope 😛

#

but I have a plan

shrewd skiff Mar 14, 2025, 7:57 PM

#

https://tenor.com/bcbVS.gif

Tenor

#

fwiw I still find with the clearest, aside from the clash with the current non-variable version

#

(which would be a dealbreaker if we're keeping it, unless we make name optional, but that would clutter up code usage)

#

give and take? 😛 (though 'take' sounds like the LLM no longer has it lol)

#

yeet and yoink

acoustic radish Mar 14, 2025, 8:01 PM

#

almost ready

#

(my plan is to ask you guys)

knotty imp Mar 14, 2025, 8:03 PM

#

if set is bind, get could ostensibly be bound

acoustic radish Mar 14, 2025, 8:03 PM

#

shrewd skiff `yeet` and `yoink`

need a new plan

knotty imp Mar 14, 2025, 8:03 PM

#

in the FP bind analogy i don't think there's a named operator to get the value of a bound variable, but when talking about it you do call the thing a bound variable lol

acoustic radish Mar 14, 2025, 8:05 PM

#

ok joining dev-audio!

shrewd skiff Mar 14, 2025, 8:09 PM

#

(cc @knotty imp if you wanted to join)

shrewd skiff Mar 14, 2025, 9:22 PM

#

another option for the pile:

withContainerVar("foo", ctr)
containerVar("foo")

(aligns with withPromptVar but I guess that could also be withStringVar)

acoustic radish Mar 14, 2025, 10:30 PM

#

haha makes me think of withServiceBinding() 🙂

knotty imp Mar 14, 2025, 10:57 PM

#

knotty imp if set is `bind`, get could ostensibly be `bound`

self-bump, i'd like feedback on this not because i think it's necessarily a good idea, but because i wanna understand if i have any clue what's going on in this API design convo XD

shrewd skiff Mar 14, 2025, 11:01 PM

#

knotty imp self-bump, i'd like feedback on this not because i think it's necessarily a good...

Didn't see that, yeah makes sense to me 😛 so like boundContainer

acoustic radish Mar 14, 2025, 11:02 PM

#

https://tenor.com/view/it-was-bound-to-happen-at-some-point-ollie-dixon-gonna-happen-anyway-destined-to-happen-might-as-well-let-it-happen-gif-24962868

Tenor

knotty imp Mar 14, 2025, 11:03 PM

#

usually verb tenses as APIs are a smell, but if bind is the right word it does kinda work

shrewd skiff Mar 14, 2025, 11:23 PM

#

withServiceBinding becomes kind of an awkward inconsistency though (different use of unique word, different tense)

acoustic radish Mar 14, 2025, 11:25 PM

#

Possible pairs:

with<Foo>Binding + get<Foo>Binding
with<Foo>Binding + <Foo>Binding
bindFoo + getFoo
bindFoo + boundFoo

#multi-object 🧵