‘dagger run’ & session security | Dagger | Page 1

junior totem Nov 29, 2022, 7:23 PM

#

In the low level API it would just be a matter of running a service that enables dagger-in-dagger and also forwards a container unix sock <-> host localhost. They can choose to include localdirs and env vars (using the same mechanisms we have for exposing those today).

We could then have a higher level "porcelain" around it in the dagger CLI. Something like:
dagger sandbox localhost:8080 --local-dir ~/src --env MY_TOKEN

which would result in a service being started that has dagger-in-dagger enabled, forwards the dagger api to the host's localhost:8080 and has the provided local dirs and env from the host exposed

#

Honestly, if we want an even shorter route to this we can skip the low-level API parts and the dagger-in-dagger aspect. It would be relatively easy today to add a dagger listen localhost:8080 --local-dir ~/src --env MY_TOKEN. The only difference between the previous version of dagger listen being that now host resources are exposed on an opt-in basis.

junior totem Nov 29, 2022, 7:54 PM

#

@thorny badger What do you think about doing that ^^ instead of relying on a random id in the URL?

Basically, we change dagger run such that host resources are exposed on an opt-in basis w/ --local-dir and --env. Honestly, I'd probably be okay with dagger listen if that's the case too.

Env is easy to selectively expose; local dirs actually are too thanks to our filesync session attachable @balmy oyster added.

#

I think it's doable in the next day if we want to go this route

thorny badger Nov 29, 2022, 8:00 PM

#

@junior totem under the above scenario, what happens if you run dagger run without any local-dirs or env? The engine will be pretty much useless?

junior totem Nov 29, 2022, 8:01 PM

#

thorny badger <@949034677610643507> under the above scenario, what happens if you run `dagger ...

No it just won't have access to host resources, which still covers tons of use cases

#

But then when you want those host resources, you just add a flag

thorny badger Nov 29, 2022, 8:02 PM

#

got it. In that case that SGTM. I'm all in for least privilege principle. Are those features already in main so I can implement them in the CLI?

#

if we decide to go this route, we need to update the docs with these considerations cc @polar palm

junior totem Nov 29, 2022, 8:05 PM

#

thorny badger got it. In that case that SGTM. I'm all in for least privilege principle. Are th...

Yeah the local dir part will just require updating this code to optionally accept a list/map of allowed directories: https://github.com/sipsma/dagger/blob/3642f7fb8f03df3b403652f8454adfebad57bf5a/engine/engine.go#L278-L278

That's pretty much just plumbing though from cli arg -> engine.Config field -> that AnyDirSource.

Env will be pretty much the same, but just plumb the allowed vars to here instead: https://github.com/sipsma/dagger/blob/3642f7fb8f03df3b403652f8454adfebad57bf5a/core/schema/host.go#L59-L59

#

We'll want the SDKs to have the same behavior as before (everything is allowed) but that shouldn't change much

thorny badger Nov 29, 2022, 8:09 PM

#

I'm a bit lost in the dagger-in-dagger part, how would dagger run work in that mode?

#

since it's currently execing the cmd sent as an argument

junior totem Nov 29, 2022, 8:11 PM

#

thorny badger I'm a bit lost in the `dagger-in-dagger` part, how would `dagger run` work in th...

Oh okay, I'm saying we could skip that for now. Makes sense in longer run but not needed atm. I'm talking about the shorter route I mentioned here: #1047230108773142528 message

Basically just continue running engine in the same way but for these CLI commands that expose the API on localhost, make the host resources not exposed by default

#

And just to keep it minimal, we can still drop dagger listen for now. We could just make dagger run work this way and then consider adding dagger listen back with the same mechanism.

I like this approach of selectively exposing the host more than the random URL one because I think there's a higher chance it is something we'd actually want to stick with in the long run

thorny badger Nov 29, 2022, 8:16 PM

#

junior totem And just to keep it minimal, we can still drop `dagger listen` for now. We could...

dagger listen will be hidden (only used for the playground), so no worries about that in the meantime. Going back to dagger run , are you saying to add the --local-dirs and --envnow? or as a near future improvement?

junior totem Nov 29, 2022, 8:16 PM

#

thorny badger `dagger listen` will be hidden (only used for the playground), so no worries abo...

I'm suggesting that we do that right now for dagger run instead of using the random url approach

thorny badger Nov 29, 2022, 8:24 PM

#

junior totem I'm suggesting that we do that right now for `dagger run` instead of using the r...

gotcha. One question about --local-dir though. AFAIK host.directory currently allows referencing any path in the system. IIUC your suggestion is to use this this local-dirs as an "allowlist only" option for dagger run, correct?

junior totem Nov 29, 2022, 8:26 PM

#

thorny badger gotcha. One question about `--local-dir` though. AFAIK `host.directory` currentl...

Yeah, should be fairly easy to implement as we already have a session attachable for localdir access: #1047230108773142528 message

It's just now we would optionally check local dirs against an allowlist

#

And actually, we totally can also do the random url too, it would work perfectly well with this. I forgot that it's really easy to implement the random url because of the fact that we're wrapping the go sdk

thorny badger Nov 29, 2022, 8:29 PM

#

it, perfect. Yeah, the random URL is easy to implement. In that case I'm "mostly" sold with the caveat that we might want to make the CWD of the dagger run command allowed by default. Mostly for DX purposes since having to specifcy --local-dir every time even for CWD, might be annoying. WDYT?

junior totem Nov 29, 2022, 8:31 PM

#

thorny badger it, perfect. Yeah, the random URL is easy to implement. In that case I'm "mostly...

My inclination would be to start with none allowed by default. My overall motivation for wanting these flags is that it results in the least one way doors. Exposing the workdir in the first release would be a one-way door, so by that I'd prefer to avoid it.

#

Of course we can always change it to be exposed by default in subsequent releases, the reverse direction is just much more painful

thorny badger Nov 29, 2022, 8:35 PM

#

I see your point. I guess it's ok to avoid it initially.

So, @random ice TL;DR

Add a --local-dir ([]string) flag to dagger run which will serve as an allow list of local directories which can be used from the engine.
Add a --env ([]string) flag that sets the env variables that you want to pass to the dagger run wrapped command. By default only $DAGGER_SESSION_URL will be present.

#

@junior totem am I missing something?

junior totem Nov 29, 2022, 8:37 PM

#

I'd also add that SDKs will continue to work the same; this "opt-in" exposure of host resources will be exclusive to when the api is being served over tcp.

thorny badger Nov 29, 2022, 8:43 PM

#

junior totem I'd also add that SDKs will continue to work the same; this "opt-in" exposure of...

yes, correct. SDKs are untouched

random ice Nov 29, 2022, 8:52 PM

#

guys let’s talk about this live… this is a big last minute change

thorny badger Nov 29, 2022, 8:52 PM

#

👍

random ice Nov 29, 2022, 8:55 PM

#

I don’t see how we can change dagger run to have no host access by default without at least thinking through the second order implications . For example tools using an official SDK with auto-install will not be sandboxed; but if wrapped with dagger run they will be? How do I figure out which directories to allow access to, trial and error? Pretty big change for end users

#

I like the idea of that feature (possibly with interactive prompt ios-style too?) but requires careful UX design

#

I think somewhere along the way we stepped over from “how to avoid security flaw in exposing the session” to “let’s improve the trust & permission system like we discussed in the past, regardless of how session is exposed”.

#

both are good things to do, but IMO we should either decouple them, or if that’s not possible then push back the launch and get it right

#

One protects from malicious local users, the other from malicious code

thorny badger Nov 29, 2022, 9:00 PM

#

random ice both are good things to do, but IMO we should either decouple them, or if that’s...

I'm in #team-audio if you can/want talk about it

thorny badger Nov 29, 2022, 9:03 PM

#

random ice I think somewhere along the way we stepped over from “how to avoid security flaw...

I agree with this sentiment. OFC I want to make the system more secure, but it feels wierd if the SDK's have one model and the CLI a different one

junior totem Nov 29, 2022, 9:39 PM

#

Summary so I don't forget:

Leave dagger run as is (no flags) but document the security considerations (treat url as secret, especially if you have a long-running process)
Can re-add dagger listen in very near future (not needed by launch though) but with the localdir+env allowlist feature.
- I started working on the localdir part of this, will send out as draft in a bit

thorny badger Nov 29, 2022, 9:52 PM

#

junior totem Summary so I don't forget: 1. Leave dagger run as is (no flags) but document the...

cc @polar palm I can help with the docs

random ice Nov 30, 2022, 12:18 AM

#

@thorny badger quick DX question: what is the most practical for client developers:

A single $DAGGER_SESSION_URL variable?
Two variables $DAGGER_SESSION_PORT and $DAGGER_SESSION_PATH?

Strictly from the perspective of DX convenience: which one is the least work to pass to vanilla graphql clients in various languages?

thorny badger Nov 30, 2022, 12:27 AM

#

random ice <@336241811179962368> quick DX question: what is the most practical for client d...

Generally a URL is better. i.e: https://www.apollographql.com/docs/react/get-started#step-3-initialize-apolloclient

I've just went through some client initializations and they all expect a URL targeting the graphql endpoint which is what we set in our env variable

Apollo Docs

Get started with Apollo Client

random ice Nov 30, 2022, 12:29 AM

#

Ok 👍

green mist Nov 30, 2022, 1:23 AM

#

Hey, I’m sick today, been in bed until now. Catching up

#

@random ice @junior totem @thorny badger To me this feel rushed.

I think treating the URL as a secret is unconventional — no other API does this

I think in a non-Unix world we should have taken the time to design the security model — auth header, basic auth; anything

#

Also: the “DSI” conversation started with the goal of making connection simpler / standardized / documented (e.g. let’s not have different schemes, methods).

With this we’re going to have two different methods (unix and tcp)?

random ice Nov 30, 2022, 1:49 AM

#

No unix method is the idea

#

tcp only

#

I agree it's unconventional but that doesn't make it the wrong option. We are deep in "unconventional" territory no matter what, between session helper, local API endpoint etc. So need to find a pragmatic solution that matches the constraints we have

#

The constraints basically boil down to:

Reduce DX friction as much as possible. Especially important since we're starting with an unfamiliar model, so adding friction only makes it worse
Don't create security flaws that we'll regret later.

Balancing those two things is what makes this design discussion tricky.

#

Things that add DX friction:

Requiring client developers to use a unix socket
Making it hard for client developers to use curl

#

Possible security flaws:

User accidentally exposes a privileged API on the internet
Local session can be used for local privilege escalation, via tcp port scanning

junior totem Nov 30, 2022, 2:06 AM

#

Yeah if we are starting w/ dagger run over tcp then I agree we may as well just leave the SDKs mostly as is (except w/ an update to read the random url from the session bin), the docs we write about security implications of dagger run will just be applicable to use of an SDK too.

#

And if someone complains we can choose between A) switching everything to unix/npipe or B) supporting both protocols

#

Among other options

green mist Nov 30, 2022, 2:20 AM

#

Path as a secret feels wrong. It can leak everywhere in logs, and is generally not considered to be secret

“Making it hard for client developers to use curl” —> Stripe has the same constraint yet they pass the token as a curl flag rather than URL

junior totem Nov 30, 2022, 2:25 AM

#

From my perspective, it's a pretty small lift from secret URL->auth token. The big thing was just tcp vs unix for me.

Only tiny downside I can think of is we'll have two env vars from dagger run: DAGGER_SESSION_URL and DAGGER_SESSION_TOKEN or similar. But I'm personally fine with that in the tcp side of the decision tree.

random ice Nov 30, 2022, 2:43 AM

#

I’m fine with making it an auth header instead if it’s not too much extra friction (I guess auth basic support is ubiquitous)

#

I think leaking the path in the url is a non-issue in the context of an ephemeral session that will last a few seconds. Different story with listen

#

But yeah if we can remove that weirdness without adding DX friction, fine with me

random ice Nov 30, 2022, 2:47 AM

#

green mist Path as a secret feels wrong. It can leak everywhere in logs, and is generally n...

I don’t understand how stripe has the same constraint. Do they also require a helper to run curl?

junior totem Nov 30, 2022, 3:54 AM

#

It's pretty late for Marcos so here's a PR in case we want to get going quicker on that: https://github.com/dagger/dagger/pull/4030

junior totem Nov 30, 2022, 3:07 PM

#

@thorny badger ^^

#

Any thoughts?

thorny badger Nov 30, 2022, 3:16 PM

#

👋 missed this. I'll reply in a sec. Talking to Gerhard and Vikram

junior totem Nov 30, 2022, 3:18 PM

#

thorny badger 👋 missed this. I'll reply in a sec. Talking to Gerhard and Vikram

(Sorry I felt torn between pinging you at you 1:30 AM your time and trying to loop you in asap)

thorny badger Nov 30, 2022, 3:59 PM

#

junior totem Any thoughts?

np! just jumping into this. I don't think we'll release today given the time and the fact that we're still going back and forth with this decisions

random ice Nov 30, 2022, 4:00 PM

#

I commented there

thorny badger Nov 30, 2022, 4:00 PM

#

now, regarding the basic auth, I personally don't think it hurts the UX since all graphql clients have a way to specify auth headers. From the docs and examples perspective, the change is not disruptive also since it's basically s/random path/auth header in the messaging

random ice Nov 30, 2022, 4:02 PM

#

‘dagger run’ & session security

thorny badger Nov 30, 2022, 4:02 PM

#

@random ice from the UX perspective this is quite common amongst amongst graphql clients. Pretty much all "getting started" docs highlight how to send additional headers specifically for auth purposes. So I don't see any red flags there

#

I'd merge Erik's PR and wrap listen and run as they are. listen being hidden

random ice Nov 30, 2022, 4:03 PM

#

Can you reply in the PR for the record? 😁

thorny badger Nov 30, 2022, 4:03 PM

#

sure

junior totem Nov 30, 2022, 4:03 PM

#

thorny badger I'd merge Erik's PR and wrap `listen` and `run` as they are. `listen` being hidd...

I'll rebase it on latest main and take out of draft

random ice Nov 30, 2022, 4:04 PM

#

And yes I agree with all this

#

Just want to make sure the many people involved can see the context later

thorny badger Nov 30, 2022, 4:04 PM

#

thx 🙏 I saw your message in #1037754887720681482

#

@junior totem mind updating the dagger run help example as well to add the header please 🙏

random ice Nov 30, 2022, 4:07 PM

#

does this conflict with the docs main PR?

junior totem Nov 30, 2022, 4:08 PM

#

thorny badger <@949034677610643507> mind updating the `dagger run` help example as well to add...

Will do. Also since playground is the only user of dagger listen, what do you need in order to make that use case work? Simplest possible thing is to print out the session ID alongside the server listening on ... message

thorny badger Nov 30, 2022, 4:08 PM

#

yes, docs need to be updated to highlight this header thing

junior totem Nov 30, 2022, 4:09 PM

#

junior totem Will do. Also since playground is the only user of `dagger listen`, what do you ...

Also, we could just disable this auth from dagger listen for now since you are the only user and the command is hidden

thorny badger Nov 30, 2022, 4:09 PM

#

junior totem Also, we could just disable this auth from dagger listen for now since you are t...

this. Since otherwise it'll be more painful to make the playground work

random ice Nov 30, 2022, 4:09 PM

#

would the clients support embedding the token in the url?

thorny badger Nov 30, 2022, 4:09 PM

#

random ice would the clients support embedding the token in the url?

since it's basic auth you can do it either way, through the URL or the auth header

random ice Nov 30, 2022, 4:10 PM

#

ie http://<TOKEN>:@localhost:<port>

thorny badger Nov 30, 2022, 4:10 PM

#

usually header is better since URL parsing depends what URL parsing logic the client is doing

#

it "generally" works.. but it could break in some cases.

random ice Nov 30, 2022, 4:11 PM

#

ok then separate env var is fine

#

I feel obligated to bring this up since we’re still making changes @junior totem @thorny badger : at this point should we consider biting the bullet and just going to DAGGER_SESSION_PORT now?

#

to avoid a future change for devs, wdyt

#

would remove the confusion about whether it’s local or not; is this ehere I plug a remote engine etc

#

just observing that we actually resolved the DSI bikeshedding as a byproduct of this discussion (I think)

#

so if we wanted, we could just do it now and be done with it

thorny badger Nov 30, 2022, 4:14 PM

#

random ice to avoid a future change for devs, wdyt

but do we know how future will look like? Particularly with the work around unix sockets which AFAIK the UX is still undefined

#

I mean.. we know we want to support TCP, but it's not clear to me how that and UDS will remain playing together

junior totem Nov 30, 2022, 4:15 PM

#

I was under the impression that DAGGER_SESSION_PORT resulted in slightly more work relative to DAGGER_SESSION_URL

random ice Nov 30, 2022, 4:16 PM

#

I think if we have addressed @junior totem ‘s security concerns then we should kill the unix socket transport, simpler to just have one thing

random ice Nov 30, 2022, 4:16 PM

#

junior totem I was under the impression that `DAGGER_SESSION_PORT` resulted in slightly more ...

you mean interpolation in the url?

junior totem Nov 30, 2022, 4:17 PM

#

random ice you mean interpolation in the url?

I mean what we are currently doing (DAGGER_SESSION_URL=localhost:12345)

#

if that's what you mean by interpolation

random ice Nov 30, 2022, 4:18 PM

#

yes 👍 I think that’s fine and is slightly better actually because it reduces opportunities for footgun and confusion

#

“oh this is always local”

#

“ah this is not where I plug a remote engine”

#

I’m still ok to make this change later but observing that we could do it now

junior totem Nov 30, 2022, 4:20 PM

#

random ice I think if we have addressed <@949034677610643507> ‘s security concerns then we ...

Yeah the SDKs will need to do a little more work to obtain the session token from the engine session binary but it's not that bad. The security considerations are the same: not quite as ideal as uds/npipe but good enough w/ proper docs (which we need anyways since we doing this in dagger run).

random ice Nov 30, 2022, 4:21 PM

#

Right. Also when I say “we could do it now” I’m of course referring to the dagger run part, SDKs we should change soon but can be post-launch

junior totem Nov 30, 2022, 4:21 PM

#

thorny badger this. Since otherwise it'll be more painful to make the playground work

updated the PR, just needs an approval https://github.com/dagger/dagger/pull/4030

thorny badger Nov 30, 2022, 4:21 PM

#

🤔 I'm not sure how DAGGER_SESSION_PORT bring us closer to DSI, but I'd prefer to have $DAGGER_SESSION_URL since it's easier to make less mistakes when setting up clients

junior totem Nov 30, 2022, 4:22 PM

#

Yeah I think URL works for now and decent chance it will in the long run too

random ice Nov 30, 2022, 4:22 PM

#

~~ok let’s keep it then~~. @thorny badger the tradeoff is that some users will want to point that URL to something remote and be confused that they can’t - or worse find a way to do it

#

just preparing you for those support conversations 😉

thorny badger Nov 30, 2022, 4:23 PM

#

random ice just preparing you for those support conversations 😉

302 -> docs.dagger.io/dagger-run#local-engine 😛

random ice Nov 30, 2022, 4:24 PM

#

someone will run dagger listen on a public port and manually set the url to that, and complain if it doesn’t work (or will get hacked)

#

actually this reminds me of “docker is insecure” memes

thorny badger Nov 30, 2022, 4:25 PM

#

random ice someone will run dagger listen on a public port and manually set the url to that...

well ... dagger listen "officially" doesn't exist.. so if it's not part of the public API, it's not supported 😛

random ice Nov 30, 2022, 4:25 PM

#

it will exist soon though

#

I think this actually may be a “daggerbleed” scenario

#

https://www.google.com/search?hl=en&q=docker socket insecure

junior totem Nov 30, 2022, 4:26 PM

#

(on that note: PR for configurable sandboxing of localdirs for dagger listen is here when people have time to look: https://github.com/dagger/dagger/pull/4029)

thorny badger Nov 30, 2022, 4:27 PM

#

random ice https://www.google.com/search?hl=en&q=docker%20socket%20insecure