secrets | Dagger | Page 1

heavy egret Nov 20, 2024, 6:44 PM

#

🧵

#

secrets

#

I've been thinking about your idea that maybe we can get rid of setSecret altogether. I think it's right

#

I also think instead of replacing it with mapSecret, we should put all builtin secrets provider under dag.Host

#

eg.

dag.Host().Secret("op://foo/bar")
dag.Host().Secret("file://foo/bar")
dag.Host().Secret("env://foo")

#

Or perhaps (open for bikeshedding)

dag.Host().OnePassword().Read("op://foo/bar")
dag.Host().HashicorpVault().Read("foo/bar")
dag.Host().File("foo/bar").AsSecret()
dag.Host().Env("foo").AsSecret()

#

Benefits:

No more concerns around caching. The host controls every avenue for populating a secret, so it can ensure it's not cached
No more concerns about sandbox abstraction leak. I don't want a function 5 layers down fetching op://foo/bar in my 1password account.
Still possible to support dynamic secrets: your function just has to write to a file in the local filesystem (making sure it's in tmpfs, to avoid caching) and call dag.Host().File("foo/bar").AsSecret(). This can be done dynamically.

#

cc @rose star

mossy shuttle Nov 20, 2024, 7:09 PM

#

The concern is finding the right migration path, we'll probably have to keep setSecret around for a few releases alongside the new API to give folks a chance to migrate.

Migration depends on the use case for setSecret. I'm trying to figure out which one those are. I can think of:

"Secret Provider Modules" (e.g. Vault module) which can just be replaced by native secret providers
[can't think of anything else yet]

@rose star ?

Also /cc @azure grove: AFAIK setSecret is the main thing preventing function call caching

azure grove Nov 20, 2024, 7:19 PM

#

Yep this is what I said we should do: https://github.com/dagger/dagger/pull/8730#issuecomment-2421378726

Mark's modules have a lot of usage of SetSecret: https://github.com/search?q=repo%3Asagikazarmark%2Fdaggerverse SetSecret&type=code

Worth looking through the use cases there to see the migration path

#

Still possible to support dynamic secrets: your function just has to write to a file in the local filesystem (making sure it's in tmpfs, to avoid caching) and call dag.Host().File("foo/bar").AsSecret(). This can be done dynamically.
I don't think we could do this in any reasonable way any time soon. If the secret is written to a tmpfs, the tmpfs can't be persistently cached (either locally across engine start/stop or especially for remote cache use cases). You'd need to somehow identify that the secret is being accessed from a tmpfs and then go back to the previous operation and say "this operation can never be cached and must always re-run", but that itself is a game of finding which previous op was the one that touched that file most recently, etc. etc.

Never say never but it would be complicated to the point that I'd say it's non viable for now.

#

I think dynamic secrets can probably fit into the secret provider model most likely, especially if we add support for function calls serving as secret providers

mossy shuttle Nov 20, 2024, 7:26 PM

#

mossy shuttle The concern is finding the right migration path, we'll probably have to keep `se...

Quickly skimming through:

Another use case seems to be "Cast To Secret". The plaintext secret is hardcoded, user needs to "cast" it to a dagger.Secret. In this scenario, we could provide an "unsafe caster" that just embeds the plaintext secret in the Secret (since it's not secure in any case). Or maybe a simple plaintext:// provider?

mossy shuttle Nov 20, 2024, 7:29 PM

#

mossy shuttle Quickly skimming through: - Another use case seems to be "Cast To Secret". The ...

e.g.

    // User defaults to "postgres".
    if user == nil {
        user = dag.SetSecret("postgres-default-user", "postgres")
    }

this could be something like plaintext://postgres

azure grove Nov 20, 2024, 7:31 PM

#

Yeah, we would have to actually cache it on disk for it to work and not have the problems of SetSecret, but like you said that's fine since it's just a dummy testing secret anyways. Probably worth a name like "insecure-plaintext" or something

#

just to emphasize that it's not actually secret

mossy shuttle Nov 20, 2024, 7:33 PM

#

Another one is secret "manipulation". e.g. here it looks like he's converting a secret into a different data structure containing the original secret. I don't have a solution for that: https://github.com/sagikazarmark/daggerverse/blob/7fc77a4f8dd54d4d4c56e40517f7109e8585bcdd/registry-config/config.go

azure grove Nov 20, 2024, 7:35 PM

#

The SSH keygen use case there is interesting too, basically using a module to generate a cert or similar

mossy shuttle Nov 20, 2024, 7:35 PM

#

There's another use case that I don't understand how it deals with cache in the first place:

ssh keygen: https://github.com/sagikazarmark/daggerverse/blob/7fc77a4f8dd54d4d4c56e40517f7109e8585bcdd/ssh-keygen/main.go#L58
random secret: https://github.com/sagikazarmark/daggerverse/blob/7fc77a4f8dd54d4d4c56e40517f7109e8585bcdd/svix/utils.go#L26

#

first thing that comes to mind is, for those, he'd probably want to disable function caching yeah? Not even considering the secrets aspect of that

azure grove Nov 20, 2024, 7:37 PM

#

Those use cases are tough when the cert is meant to be cached+persistent (as opposed to throw away and regenerated every time the functions run).

The only thing I can think of is to tell users to persist those certs to disk as normal Files but use encryption, with the password that decrypts being a secret obtained from a provider

rose star Nov 20, 2024, 7:37 PM

#

I'm not clear on this part

No more concerns about sandbox abstraction leak. I don't want a function 5 layers down fetching op://foo/bar in my 1password account.

Would only the original module (the one I dagger call) have access to dag.Host()?

Separate question - with the current thinking would it be possible for me to use a different provider based on the client's configuration? or is the code explicitly pointing at a specific provider?

mossy shuttle Nov 20, 2024, 7:39 PM

#

azure grove Those use cases are tough when the cert is meant to be cached+persistent (as opp...

Yeah ... I'm wondering about the caching aspect because I'm wondering what the use case is

e.g. if it's generating ssh keypairs for throwaway testing purposes (since if they're not persisted, what's the point anyway?) then the plaintext route would work

azure grove Nov 20, 2024, 7:39 PM

#

The sandbox leak problem is already solved by isolating secrets per client and only granting access based on explicit providing of secrets to a client based on function args/returns. The implementation would carry over to this new model by just applying to access to secret providers, so I don't think there's any difference before or after in that respect

azure grove Nov 20, 2024, 7:41 PM

#

mossy shuttle Yeah ... I'm wondering about the caching aspect because I'm wondering what the u...

Yeah exactly, not sure what Mark's exact use case was there. But either way I can imagine someone wanting to create a dagger module for generating certs (it's a super annoying problem once you get to a certain level of complexity, would be nice to modularize) but in such a way that the certs are actually cached. It seems like a legit use case (though that doesn't mean it's priority 0 to support it immediately necessarily)

mossy shuttle Nov 20, 2024, 7:44 PM

#

azure grove Yeah exactly, not sure what Mark's exact use case was there. But either way I ca...

The caching aspect is key in that though

no caching: this module is meant to return a a brand new certificate whenever invoked
caching: I'm using this module for persisting certificates [which is not really a valid use case as caching is not good for persistency?]

#

(e.g. for the latter, your best bet would be to export that file locally? in which case you don't really want caching and are back to the "no caching" use case?)

azure grove Nov 20, 2024, 7:46 PM

#

mossy shuttle The caching aspect is key in that though - no caching: this module is meant to ...

Right, that's true it's not good to rely on caching for this anyways, so we can probably just rule it out for now. If someone wanted to do it, they could probably coerce it to working with the right engine gc config, but that's getting way too esoteric for now.

mossy shuttle Nov 20, 2024, 7:46 PM

#

basically wondering if there's yet another use case for something like an "ephemeral://" secret or something. Only works on non-cached functions or something

azure grove Nov 20, 2024, 7:48 PM

#

mossy shuttle basically wondering if there's yet another use case for something like an "ephem...

Yeah part of function cache control is that you will be able to mark a function as "never cached". So we'd be able to know in the engine if a function call client is making a ephemeral secret from a function call that's cached and error out if so.

mossy shuttle Nov 20, 2024, 7:49 PM

#

e.g. (don't mind the syntax)

"hardcoded secrets use case":

    // User defaults to "postgres".
    if user == nil {
        // user = dag.SetSecret("postgres-default-user", "postgres")
        user = dag.NewSecret("insecure-plaintext://postgres")
    }

"ssh key use case":

        // +no-cache
        // ...
    return &KeyPair{
        // PrivateKey: dag.SetSecret(name, string(pem.EncodeToMemory(sshPrivateKey))),
        PrivateKey: dag.NewSecret("ephemeral://"+string(pem.EncodeToMemory(sshPrivateKey)))
    }, nil

#

(the latter is basically today's implementation)

azure grove Nov 20, 2024, 7:56 PM

#

Oh wait, I just remembered why this is extra hard... It's a pain to explain but it's a real non-obscure use case.

Function A creates an ephemeral secret, it's correctly marked as "never cached"
Function B is a cached function call. It calls out to Function A and gets a return value that contains the secret (either the return is the secret or the secret is embedded in a returned Container as secret env/file, etc.)
Function C is a cached function call. It calls to Function B. Say Function B was cached from a previous run. If Function B returns a value that has the secret in it (either direct or embedded), the secret will not be found.

Basically, you'd need to cascade function call cache invalidation, but only when an ephemeral secret is involved.

There's a zillion variations on the above idea too. It would actually matter in the real world

mossy shuttle Nov 20, 2024, 7:57 PM

#

right

azure grove Nov 20, 2024, 7:58 PM

#

The only world I can imagine that working is one where we've 100% taken over the entire cache logic from buildkit; it just departs completely from the model. And like yeah we probably should do that for many reasons but that's an enormous pre-req to take on for this work 😄

And even in that scenario, it would still be some incredibly gnarly logic. So it may be worth thinking through ways of avoiding ephemeral

mossy shuttle Nov 20, 2024, 7:58 PM

#

and even in that scenario, it would be incredibly confusing for the user

#

even if it works 100% correctly

azure grove Nov 20, 2024, 7:59 PM

#

Yes 100%

mossy shuttle Nov 20, 2024, 7:59 PM

#

"why is my function not cached?" "well, see, 20 layers deep in the stack, someone used this"

azure grove Nov 20, 2024, 8:00 PM

#

The only thing I can think of to avoid it is to support a secret provider that's backed by either:

#

a Function Call ( which would be never cached when used as a secret provider)
a Service (which is already never cached in execution)

#

I think that would create the same end effect as ephemeral but avoid those issues

#

Basically just ways of getting secrets on-demand every time they are needed, but in this case sourced from other dagger-native things (functions and/or services) rather than from the host. But it would all be in the same on-demand model, so to speak

mossy shuttle Nov 20, 2024, 8:03 PM

#

Yeah. Kinda beefy though

azure grove Nov 20, 2024, 8:05 PM

#

It might not be as bad as it seems, especially the Function call approach. I don't think the engine would need a ton of new features, or possibly any. We can already call out to arbitrary functions based on a given call.

#

The dagql call for a secret provider backed by a Function call would just have the metadata on what function to call (which is not much). Then all the plumbing needed to actually dynamically make that call exists today.

#

Services would be harder because you need to decide on some sort of protocol. But function calls avoid that problem entirely

heavy egret Nov 20, 2024, 9:02 PM

#

ok I'm lost on the part about dynamic secrets and implications for caching (will catchup on all the messages above).

But also would love feedback on the other part - moving secret providers to Host()

heavy egret Nov 20, 2024, 9:04 PM

#

azure grove The sandbox leak problem is already solved by isolating secrets per client and o...

The difference is that if a function calls mapSecret("op://Solomon/OpenAI/token") there's only 2 possible results that make sense: 1) giving my credential to random functions or 2) that function gets an error and doesn't work. Both are bad, so IMP the API for asking this should not be available to the function in the first place

azure grove Nov 20, 2024, 9:09 PM

#

heavy egret The difference is that if a function calls `mapSecret("op://Solomon/OpenAI/token...

Oh okay I misunderstood what you were saying, I agree that functions shouldn't be able to make those calls and that putting the API on Host thus makes sense since that's not available to functions already.

I was referring to the fact that when the CLI/shell invoke a function, it will be making those calls to Host and then passing the secret providers to functions as args (which can then pass the secret provider around to other function calls if needed). That's where the pre-existing logic around ensuring functions only have access to secret providers they were explicitly passed will kick in and ensure there's no weird leaks possible.

So we're on the same page there I think.

#

Putting the API on Host SGTM, the wrinkle is figuring out whether to/how to support the ephemeral use case mentioned above. If we go with my suggestion to support that by allowing function calls to serve as secret providers, then there would be an additional way to create a secret that doesn't involve the Host API. Namely, if an object implements a SecretProvider interface (that we'd add as part of this), then you can turn that object into a secret provider too.

#secrets