#kubernetes | Dagger | Page 1

proud oyster Jun 29, 2023, 7:00 PM

#

This channel could get interesting, because there are three ways to use Dagger and Kubernetes together:

Dagger on Kubernetes. This is the most common, since many CI runners run on Kubernetes these days, and in that case we recommend running the Dagger Engine as a "sidecar" to the CI runner, using a Kubernetes Daemonset.
Kubernetes on Dagger. This is less common, but very cool. You can run an ephemeral kubernetes (or k3s) cluster for testing purposes, on Dagger itself.
kubectl on Dagger. Regardless of where your Dagger engine is running, sometimes your pipeline may need to deploy to a remote kubernetes cluster.

This channel is a good place for discussing all of those things 🙂

half canyon Jun 29, 2023, 7:46 PM

#

also Kubernetes in Dagger pipelines! for testing Helm charts, etc
https://discord.com/channels/707636530424053791/1120935751069208699

cerulean mountain Jun 29, 2023, 7:58 PM

#

ref: https://github.com/dagger/dagger/issues/5292

GitHub

How to get kuberentes to work in a Dagger pipeline? · Issue #5292 ·...

What is the issue? Some users in discord (https://discord.com/channels/707636530424053791/1114570469958484008) have requested the ability to test kubernetes pipelines in Dagger. With the help of @v...

orchid blade Jun 30, 2023, 9:47 AM

#

Testing Helm charts is the exact use-case I have right now! 💥

The Helm chart is in the same git repository as the application. We use Traefik as a reverse proxy so the application can provide different UIs according to the target audience. Helm templates and proxies are confusing enough to those familiar with them, now imagine the complexity it adds to front-end engineers who do not have prior experience with the topics. We want to enable them to change and add new routes, and the only way that will happen is if we simplify testing! I have picked Dagger for that, and I have been checking out how much I can use it to achieve the goal.

I need to build the app, send the image somewhere local (preferably), install the Helm chart in a Dagger-provided Kubernetes cluster, and run the Playwright tests against this deployment

ionic hare Jun 30, 2023, 1:26 PM

#

Following @orchid blade thread above, we have a very similar use-case in our builds that I'll be working on next quarter so I'm quite curious to see what discoveries and pitfalls you run into

lavish summit Jul 1, 2023, 9:12 PM

#

@proud oyster hi and thanks for all the work so far! I have a use case for ci system that is limited in options where to deploy and run it. Looking into running dagger as a deployment+replicaset exposed via TCP port. Dagger pods would run as privileged. Then I would point the SDK to the K8S service via the experimental flag. Did anyone try this out yet or is it yet in the roadmap? Thanks 🙂

proud oyster Jul 1, 2023, 9:16 PM

#

lavish summit <@488409085998530571> hi and thanks for all the work so far! I have a use case f...

Hello! Yes this has been done, but with a daemonset for optimal resource use (one shared engine + local cache per node on the kub cluster). We are happy to help you set this up. We haven’t yet published official resources (tools & documentation) for this scenario, but will soon.

#

(cc @shell turret @low gorge for when they are back from weekend)

lavish summit Jul 1, 2023, 9:19 PM

#

That’s great! What’s the best way to connect? Europe/CEST time zone here. I am planning to solve the storage issue using EFS (AWS’s super fast type of NFS) which is attachable to multiple pods, and run dagger in EKS. Therefore I can have round robin balancing to dagger pods behind a Service without thinking about cache miss or hit; as soon as the build ran and cached something the first time, the centralised EFS mount (mounted as rw on multiple pods) will be able to use it in subsequent builds

proud oyster Jul 1, 2023, 9:36 PM

#

lavish summit That’s great! What’s the best way to connect? Europe/CEST time zone here. I am p...

I think you might run into performance and concurrency issues with that approach to distributing the cache data. I will wait for the infra experts to weigh in.

Context is that we explored this architecture for our own Dagger Cloud service, and decided against it. Instead we are building a managed control plane that can orchestrate storage and distribution of cache data between all your engines and object storage services

#

But it’s possible that I’m misremembering, and both approaches are equally valid. Either way we’re happy to help you in finding the ideal setup

proud oyster Jul 1, 2023, 10:02 PM

#

lavish summit That’s great! What’s the best way to connect? Europe/CEST time zone here. I am p...

Best way to connect is either A) keep talking here 🙂 or B) fill the “request a demo” form on our website, which leads to a zoom call with the team to discuss your use case and figure out the best stack together.

lavish summit Jul 2, 2023, 12:03 AM

#

There was a Dagger Community call on June 1st, (https://github.com/dagger/dagger/issues/3140#issuecomment-1552470864) related to my question above is there any update about running without privileged?

GitHub

🖐️ How to run dagger inside a container ? · Issue #3140 · dagger/d...

Dagger Cloud run URL https://dagger.cloud/runs What happened? What did you expect to happen? There are multiple examples in the docs about running dagger using multiple CI's, but I want to run ...

errant panther Jul 3, 2023, 11:06 AM

#

I did a small POC for our Kubernetes Cluster. I deployed the Dagger Engine as a Deployment (only 1 replica, no serivce), and the connected from my local machine with _EXPERIMENTAL_DAGGER_RUNNER_HOST=kube-pod://buildkit-podname (perquisite, set the right Kubecontext before)
But as Solomon already pointed out, I think you will run into concurrrency issues if you try to share the cache with rwx volumes

low gorge Jul 3, 2023, 6:55 PM

#

This might help: https://www.youtube.com/watch?v=c93_EsedP1s . To browse the code docker run -it --entrypoint nvim registry.dagger.io/equinix-demo-day-2023 .

As for unprivileged & rootless, it's complex:

YouTube

Equinix Developers

Dagger On Equinix Metal | Demo Day 2023 | Dagger

Watch the full Demo Day! https://www.youtube.com/watch?v=-siv1ga0l_o

In this segment, Fen Aldrich and Gerhard Lazu show us a Dagger Demo on Equinix Metal.

Fen Aldrich, Developer Advocate - Equinix
Gerard Lazu, Software Engineer - Dagger
Kyle Penfound, Solutions Engineer - Dagger

Read Gerhard Lazu's blog post https://gerhard.io/talk/dagger-on-...

▶ Play video

GitHub

✨ Support privileged Exec · Issue #3874 · dagger/dagger

What are you trying to do? The goal is to support setting a Exec privileged so that processes like docker-in-docker can be run. Why is this important to you? Some of our projects are Kubernetes con...

GitHub

fix: Default to a rootless Buildkit container by gerhard · Pull Req...

Instead of using --privileged with the default buildkit container, use a rootless one which makes it safer: https://github.com/moby/buildkit/blob/master/docs/rootless.md#docker
One reason to not re...

cerulean mountain Jul 7, 2023, 12:42 AM

#

lavish summit There was a Dagger Community call on June 1st, (https://github.com/dagger/dagger...

👋 there's some users playing with running Dagger in sysbox which doesn't require privileged containers in case you're interested: https://discord.com/channels/707636530424053791/1121964610493362238

proud oyster Jul 7, 2023, 8:29 PM

#

https://www.youtube.com/watch?v=u1Q6RNaQHTY

YouTube

Dagger

Community Call Demo: Exploring Kubernetes in Dagger

In this demo, Marcos shares his experiences with running Kubernetes in Dagger and explores different possibilities for testing pipelines.

Want to ask the presenter a question about the demo? Join us on the Demo Discord Forum here to discuss this specific demo:
https://discord.com/channels/707636530424053791/1120935751069208699

▶ Play video

south thorn Jul 12, 2023, 6:25 PM

#

If you want to use Dagger on Kubernetes, I recommend joining our community call tomorrow to see how with a demo from @glad locust and @low gorge. You can register here: https://dagger-io.zoom.us/webinar/register/9716685521713/WN_USQjVBGXT0SWhNMvqYVvCA

languid badge Jul 13, 2023, 5:14 PM

#

I think I missed the call... were yall talking about hosting buildkit in k8s by chance

#

Trying to sort out if I can do non-privileged dagger engine. Getting some errors related to dns if I run unprivileged

buildkitd: install resolv.conf: remount /etc/resolv.conf to upstream alias: operation not permitted

proud oyster Jul 13, 2023, 5:18 PM

#

@languid badge yes it was about hosting Dagger engine (including its embedded buildkit) on kubernetes. Recording will be up soon, and we're happy to discuss the specifics here!

languid badge Jul 13, 2023, 5:19 PM

#

great timing lol

proud oyster Jul 13, 2023, 5:24 PM

#

there’s a PR in progress by @shell turret

languid badge Jul 13, 2023, 5:28 PM

#

🔥 literally my exact setup. I've got the dagger engine remote though - maybe that's not required...

proud oyster Jul 13, 2023, 5:29 PM

#

Without knowing your specific constraints, I would recommend sticking to the setup as documented as much as possible

#

Remote is possible but it's not the most common, if you hit issues there will be less pooled knowledge

languid badge Jul 13, 2023, 5:30 PM

#

👍 - no constraints exactly, was just thinking to persist cache a little more. I think there's S3 options available as well though if I have individual engines like this right?

low gorge Jul 13, 2023, 5:32 PM

#

Hi @languid badge

There will be a blog post which ties everything together: pull requests, issues (re unprivileged & rootless), previous community call video, Equinix Demo Day 2023 talk, etc. Will drop the link to the blog post here as soon as it goes live. For now, this is a good one to follow: https://github.com/dagger/dagger/pull/5446

FWIW: cc @shell turret

languid badge Jul 13, 2023, 5:33 PM

#

doooope. ty!

south thorn Jul 15, 2023, 3:47 PM

#

If anyone missed the community call this week, you can find the Dagger on Kubernetes demo here! #1129800557801001031 message

rose oak Jul 31, 2023, 3:28 PM

#

I just watched that video and several time they talk about a blog post. I can't find that blog post.

proud oyster Jul 31, 2023, 3:41 PM

#

rose oak I just watched that video and several time they talk about a blog post. I can't...

the blog post is not published yet, because the best practices are still work in progress, and there’s a higher bar for publishing in our official blog or docs than for sharing on a community call or discord. Next step is an article in the docs; at some point later, a blog post about how Dagger uses Dagger in production for our own CI.

rose oak Jul 31, 2023, 3:43 PM

#

The way they talked in the community meeting it sounded like it was going to be published that day. That was where my confusion comes from.

proud oyster Jul 31, 2023, 3:48 PM

#

Yeah I understand the confusion. We caught the problem late in the process, in other words: we shelved the blog post after it was mentioned in the demo.

As a team we like to ship fast which sometimes comes at the cost of imperfect synchronization (a delicate balance). So in exchange for the occasional confusion, you get a lot more features 🙂

#

Here’s an unfinished documentation PR that still gives a good idea of the general direction: https://github.com/dagger/dagger/pull/5446

GitHub

Dagger on Kubernetes by jlongtine · Pull Request #5446 · dagger/dag...

Signed-off-by: Joel Longtine joel@dagger.io

#

Note that the PR is not merged, so it’s not authoritative: what is documented there may not be exact supported in the future. But it’s directionally correct and comes from our own infra - so worth looking at

rose oak Jul 31, 2023, 3:51 PM

#

yeah, this is very github-action centeric. I am hoping to run dagger in a k8s pod created by ago-workflows.

#

Seems that isn't a setup you have all really looked at yet.

proud oyster Aug 1, 2023, 7:36 AM

#

the kub part, yes. quite common

#

the argo workflow part, not as much

#

but the argo-specific part shouldn’t matter in your case

fathom carbon Aug 3, 2023, 7:43 PM

#

will it be possible to use something like a container registry or cloud bucket for caching?

cerulean mountain Aug 3, 2023, 9:02 PM

#

So, you can use buildkit's default cache exporters (https://github.com/moby/buildkit/#export-cache) with the _EXPERIMENTAL_DAGGER_CACHE_CONFIG env variable which gets passed directly to buildkit in your runs. However, we know there are gotchas and opened issues for using the basic buildkit cache. Because we want something solid that also has better performance through enhancements like caching of volumes (like for pip cache), etc, we've been investing in the Dagger cache service. Happy to show that to you if you're interested 🙂. cc @half canyon

GitHub

GitHub - moby/buildkit: concurrent, cache-efficient, and Dockerfile...

concurrent, cache-efficient, and Dockerfile-agnostic builder toolkit - GitHub - moby/buildkit: concurrent, cache-efficient, and Dockerfile-agnostic builder toolkit

fathom carbon Aug 14, 2023, 7:11 AM

#

Daggerfile to generate CUE schemas for Kubernetes API objects

https://github.com/hofstadter-io/cuelm/blob/main/schema/dagger.go

GitHub

cuelm/schema/dagger.go at main · hofstadter-io/cuelm

Pure CUE implementation of Helm Kubernetes package manager - hofstadter-io/cuelm

proud oyster Aug 30, 2023, 6:05 AM

#

@dire oasis I am very curious to learn more about how you’ve leveraged kubernetes for per-job and per-repo resource quotas in your builds. I’m confident we can address your concerns (how to avoid losing those benefits) but I’m sure there are tradeoffs involved, I’d like to understand them better.

#

cc @cerulean mountain I’m moving the party here 😁

dire oasis Aug 30, 2023, 3:19 PM

#

Thanks @proud oyster. I'm trying to digest/understand the information about DAG and how it will help, I still feel apprehensive about Dagger Engine's ability to scale and the reliance on Docker. Everything else though makes me excited about Dagger. Though I think I need to stop asking and start trying Dagger. I want to try and put Dagger through the paces, maybe throw quite a few concurrent builds at it to see how Dagger Engine handles them.

proud oyster Aug 30, 2023, 3:55 PM

#

That sounds good 👍 Some apprehension is appropriate for a relatively young product

#

Note that there is no hard dependency on Docker. It’s only used as a convenience default to bootstrap the engine. In a kubernetes daemonset configuration, no docker needed

dire oasis Sep 5, 2023, 1:37 PM

#

proud oyster Note that there is no hard dependency on Docker. It’s only used as a convenience...

Why is Docker not needed? Or is it just Buildkit needed?

low gorge Sep 5, 2023, 2:15 PM

#

dire oasis Why is Docker not needed? Or is it just Buildkit needed?

Because the Engine has been already provisioned on K8s using the container image.

BuildKit is an internal dependency.

FWIW: https://github.com/dagger/dagger/blob/6ac5cf6a345400f7aea0e7bddc9f304efb5917f4/core/docs/d7yxc-operator_manual.md

Also: https://docs.dagger.io/541047/alternative-runtimes/

dire oasis Sep 5, 2023, 2:19 PM

#

low gorge Because the Engine has been already provisioned on K8s using the container image...

Thanks @low gorge

sacred osprey Sep 16, 2023, 4:52 AM

#

proud oyster Note that there is no hard dependency on Docker. It’s only used as a convenience...

Is there a recommended yaml manifest for a k8s daemonset?

low gorge Sep 16, 2023, 7:32 AM

#

sacred osprey Is there a recommended yaml manifest for a k8s daemonset?

Still WIP, but you will find it in this PR cc @shell turret https://github.com/dagger/dagger/pull/5505

GitHub

Dagger Engine Helm chart by jlongtine · Pull Request #5505 · dagger...

Signed-off-by: Joel Longtine joel@dagger.io

glossy egret Sep 18, 2023, 1:13 PM

#

is there a good way to share cache between engine instances?

#

i see buildkit has a few options for external cache, perhaps could use the registry or s3

#

I see "magicache" mentioned in the Helm PR?

half canyon Sep 18, 2023, 5:17 PM

#

glossy egret is there a good way to share cache between engine instances?

Hey @glossy egret 👋 Indeed there are some buildkit approaches, but yes, we have a cloud caching service (sometimes called "magicache" 🙂 ) that is in early access that makes sharing cache between engines and CI runs super easy and includes extras like the ability to use cache volumes (like for go, node, python deps). We have customers running that setup in k8s.

proud oyster Sep 18, 2023, 5:20 PM

#

@glossy egret we originally hoped that buildkit's cache export features would be enough, but it turns out there are many limitations, some of them fundamental, so we built a distributed cache service (which @half canyon mentioned)

glossy egret Sep 18, 2023, 5:21 PM

#

Cool, didn't know Dagger was doing anything commercial

proud oyster Sep 18, 2023, 5:22 PM

#

It's been mostly under wraps, we have customers in early access but haven't announced anything yet

proud oyster Sep 22, 2023, 4:01 PM

#

@low gorge @daring wraith 👋 for future Dagger+Kubernetes discussions 🙂

sacred osprey Sep 22, 2023, 5:22 PM

#

and @sacred osprey (the old man running the homelab on k3s)

#

Hi @low gorge , I'm installing argocd (used arkade install argocd) and will expose it similar to https://tekton.inlets.tutes.ai and https://gitea.inlets.tutes.ai .

low gorge Sep 22, 2023, 5:58 PM

#

sacred osprey Hi <@796825768600141844> , I'm installing argocd (used `arkade install argocd`) ...

OK! Let us know how it goes.

sacred osprey Sep 22, 2023, 6:22 PM

#

Dude! Where's my gitea !?

south thorn Sep 26, 2023, 4:15 AM

#

New Dagger on Kubernetes Guide! https://docs.dagger.io/194031/kubernetes/

Run Dagger on Kubernetes | Dagger

Introduction

sacred osprey Sep 27, 2023, 1:16 PM

#

Future-proofing dagger

amber narwhal Oct 5, 2023, 11:18 AM

#

I'm just gonna leave this idea here: I've been using Garden (https://garden.io/) lately and it's awesome....but Dagger is awesomer and it's capable of most of the things Garden does. It just needs a reusable module....

Garden - The DevOps automation tool for K8s

Accelerate the DevOps workflow. Build, deploy, and test in production-like environments with one platform.

cerulean mountain Oct 11, 2023, 4:54 PM

#

amber narwhal I'm just gonna leave this idea here: I've been using Garden (https://garden.io/)...

indeed. Are you thinking of packaging garden as a Dagger module? or you'd like to see some of the Garden features provided by Dagger out of the box?

amber narwhal Oct 11, 2023, 5:52 PM

#

cerulean mountain indeed. Are you thinking of packaging garden as a Dagger module? or you'd like t...

I’d rather rebuild some of the features in native dagger code

#

Image builds already exist in native Dagger

#

So it might make more sense to to rebuild the rest

half canyon Oct 17, 2023, 10:17 AM

#

amber narwhal Image builds already exist in native Dagger

I'd love to know which Garden features 🙂

amber narwhal Oct 17, 2023, 12:23 PM

#

Well, the basic features are build, deploy, test and run.

Build is kinda given, but build also "loads" a container image into the local Kubernetes cache (no idea how they do it).

Deploy is basically a nice wrapper around Helm, Kustomize and all the rest.

The thing that makes Garden stand out is that you can define dependencies between the different steps (which is also given in Dagger).

I'm not very familiar with their testing capabilities, but I'm pretty sure it's also just some high level API around running containers in a cluster.

So I think it's mostly just supporting the different deployment strategies and creating some glue around the different steps to make it easier for people to use. (For example: Garden is configured through YAML, can be organized into modules, etc.)

#

I understand YAML is not really for Dagger, but the nice thing about Garden is that it's easy to use and the API it provides is just enough and super easy at the same time.

I'm not saying we need YAML, but need the simplicity that it provides for Garden.

#

Here is a demo I did with Garden: https://github.com/sagikazarmark/demo-bank-vaults

It was super easy to create and run.

GitHub

GitHub - sagikazarmark/demo-bank-vaults: Demonstrate Bank-Vaults fe...

Demonstrate Bank-Vaults features. Contribute to sagikazarmark/demo-bank-vaults development by creating an account on GitHub.

#

Another project I used Garden: https://github.com/bank-vaults/vault-secrets-webhook

Interestingly, I used Garden here so that I can avoid running the entire CI pipeline which is slow at the moment. Just running a local Kind cluster, deploying and building everything with Garden ended up being super easy.

GitHub

GitHub - bank-vaults/vault-secrets-webhook: A Kubernetes mutating w...

A Kubernetes mutating webhook that makes direct secret injection into Pods possible. - GitHub - bank-vaults/vault-secrets-webhook: A Kubernetes mutating webhook that makes direct secret injection ...

#

Happy to show you how I use Garden from a user perspective @half canyon

amber narwhal Nov 8, 2023, 10:37 PM

#

Here is a module idea: FluxCD image automation, but instead of Flux doing the work from within the cluster, Dagger makes the image reflections in the FluxCD repo (as part of a CI pipeline): https://fluxcd.io/flux/components/image/

Image reflector and automation controllers

The GitOps Toolkit Image Automation Controllers documentation.

#

Looks like it basically needs a wrapper around this: https://github.com/fluxcd/image-automation-controller/blob/main/pkg/update/setters.go#L56

GitHub

image-automation-controller/pkg/update/setters.go at main · fluxcd/...

GitOps Toolkit controller that patches container image tags in Git - fluxcd/image-automation-controller

#

Anybody working on that by any chance?

sharp hedge Nov 8, 2023, 10:47 PM

#

I was looking the https://github.com/stefanprodan/flux-local-dev to convert it dagger based module.

GitHub

GitHub - stefanprodan/flux-local-dev: Flux local dev environment wi...

Flux local dev environment with Docker and Kubernetes KIND - GitHub - stefanprodan/flux-local-dev: Flux local dev environment with Docker and Kubernetes KIND

#

but agree they have multiple places for dagger to leverage their existing CI

amber narwhal Nov 8, 2023, 10:59 PM

#

I'm not talking about developing Flux itself. I'm talking about their image automation feature that automatically deploys new images based on certain policies. In order to achieve that they continuously poll container registries and then update the gitops repo with the new image.

Instead of that, I want dagger to update the gitops repo with the new image tags automatically after a new image is pushed.

It solves a whole lot of issues with Flux's own automation, namely write access to the gitops repo, multi-tenancy and a bunch of other issues.

sharp hedge Nov 8, 2023, 11:04 PM

#

oh, it make sense

sacred osprey Nov 16, 2023, 3:25 AM

#

Build OKD using Dagger?

sacred osprey Nov 17, 2023, 3:20 PM

#

Awesome k8s/nodejs troubleshooting session with @violet hatch and @cerulean mountain ! I think I have a new lab buddy with Noe because he knows nodeJS and I know k8s.

rocky scarab Nov 22, 2023, 11:08 AM

#

Hello everyone,
I'm trying to run dagger inside a container in k8s since two days, but without success for now.

evel=fatal msg="failed to mount {Type:overlay Source:overlay Target: Options:[index=off lowerdir=/var/lib/containerd/io.containerd.snapshotter.v1.overlayfs/snapshots/310/fs:/var/lib/containerd/[................] on \"/tmp/initialC1941586119\": operation not permitted"
: exit status 1

Kubernetes cluster on EKS version 1.24.
My image is build with ghcr.io/containerd/nerdctl as base FROM ghcr.io/containerd/nerdctl and we have a symlink ln -s `which nerdctl` /usr/local/bin/docker

the volumes mounted (docker is not needed but we have another cluster on 1.22 that will need docker.sock):

volumes:
  - name: docker-sock
    hostPath:
      path: "/var/run/docker.sock"
  - name: containerd-lib
    hostPath:
      path: "/var/lib/containerd"
  - name: tmp-host
    hostPath:
      path: "/tmp"
  - name: containerd-dir
    hostPath:
      path: "/run/containerd"

security on template:

securityContext:
  runAsUser: 0
  runAsGroup: 1001
  fsGroup: 1001
  fsGroupChangePolicy: "OnRootMismatch"

on container:

  securityContext:
    privileged: true
    capabilities:
      add:
        - ALL
    runAsUser: 0

Everything working fine with docker in local when binding the docker.sock in docker-compose.

I have tried a lot of things if you have any idea on how to make it works thank you !

sharp hedge Nov 22, 2023, 11:45 AM

#

Running Dagger in K8S

sacred osprey Dec 6, 2023, 4:37 PM

#

#general message

timber solstice Jan 1, 2024, 10:22 AM

#

Has anyone run dagger with EKS fargate?

cerulean mountain Jan 11, 2024, 3:18 PM

#

timber solstice Has anyone run dagger with EKS fargate?

hey! it's not currently possible since Fargate doesn't support privileged containers unfortunately that Dagger requires to run at this moment

worldly gate Feb 10, 2024, 7:02 PM

#

Is there a reason the socket must be hostpath mounted? I see one example that doesn't do this. The Tekton example just uses emptyDir? https://docs.dagger.io/213240/tekton Is that example wrong?

    - name: dagger-socket
      emptyDir: {}
    - name: dagger-storage
      emptyDir: {}
...
      volumeMounts:
        - mountPath: /var/run/buildkit
          name: dagger-socket
        - mountPath: /var/lib/dagger
          name: dagger-storage

Use Dagger with Tekton | Dagger

Introduction

low gorge Feb 12, 2024, 11:34 AM

#

Use Dagger with Tekton | Dagger

shadow dragon Mar 1, 2024, 12:48 AM

#

Hey, I'm looking into developer tools for working in k8s. I found quite a few (okteto, devspace, tilt, skaffold, and I may be misssing some)... It is kind of a tangent subject to dagger but If I get it right most of these tools provide some alternative way to watch resources, then build a Docker image "just in time" and then push it to a local k8s cluster with minimum delay.

I was wondering if someone here has used any of these tools and combined it with dagger by any chance? I'm thinking The step of building the Docker image may be better implemented through dagger, but I'm not sure just yet. In particular I'm not sure if dagger has any support for watching resources... sounds like I may just have to use some golang file watcher and implement it myself.

Thx!

shadow dragon Mar 1, 2024, 3:47 AM

#

Hey, I'm looking into developer tools

shrewd hazel Mar 2, 2024, 9:28 AM

#

I noticed the new dagger docs no longer show a path to using Dagger in k8s. Is this an oversight? Or does the new version no longer work with k8s? (Yeah, I have yet to actually use Dagger and wanted to get started with Dagger in k8s).

proud oyster Mar 2, 2024, 5:15 PM

#

shrewd hazel I noticed the new dagger docs no longer show a path to using Dagger in k8s. Is t...

Yes it's an oversight we're working on fixing. That kubernetes guide is still valid.

#

By the way that guide is for production optimization. On k8s the Dagger CLI will work out of the box, as long as you support DinD

shrewd hazel Mar 2, 2024, 5:57 PM

#

That kubernetes guide is still valid.
Great to know. I'll work on getting a Dagger Engine and my first workflows going in the next few days.

shrewd hazel Mar 3, 2024, 2:28 PM

#

What would we be missing without using the Dagger Cloud distributed cache?

#

Or asked maybe in a better way, what is cached?

versed holly Mar 4, 2024, 12:04 PM

#

shrewd hazel Or asked maybe in a better way, what is cached?

Hey Scott!!

Caching in Dagger is the main feature that can potentially speed up your pipelines. When you write a Dagger function such as:

func (m *Stdout) Echo(ctx context.Context) (string, error) {
    return dag.Container().
        From("alpine:latest").
        WithExec([]string{"sh", "-c", `(echo "This is stdout" ; echo "This is stderr" 1>&2)`}).
        Stdout(ctx)
}

If you run this function twice you will see that the second time will be much faster. This is because each layer required to execute this function has already been computed and can thus be re-used. In this case, the image alpine:latest has already been fetched and the command has already been executed and we know it's output. So there is no need to re-execute any step. If I where to modify the contents of the WithExec step then that will need to be executed on the next run and then cached.

The other quite relevant aspect of caching is CacheVolume, there is a guide that does a better job at explaining this: https://docs.dagger.io/user-guide/cloud/572923/get-started/#step-4-use-cache-volumes-with-the-experimental-dagger-cloud-cache.

Related to your first question, there is an issue where we are debating "How to scale Dagger in production". Caching is one of the topics when we talk about scaling. There is a comment that @low gorge wrote that does a great job at explaining the different ways of scaling Dagger in prod and how that affects Dagger's caching capabilities: https://github.com/dagger/dagger/issues/6486#issuecomment-1910551524

GitHub

How to scale Dagger in production? · Issue #6486 · dagger/dagger

Problem We haven't conclusively answered the question: "what is the best way to scale Dagger in production?". This is in part because there is a wide variety of requirements and prefe...

Get Started with Dagger Cloud | Dagger

Introduction

shrewd hazel Mar 4, 2024, 2:30 PM

#

So, if I understand correctly, the Dagger Cloud distributed cache is the docker build cache being stored externally from the Dagger engine? And if I understand what Gerhard wrote (nice write-up btw), we could run Dagger engine as a never-dying service and the build cache would be available locally nonetheless? If yes, that's the answer I'm looking for, as we'd want a full time dagger engine (or multiple engines depending on load) running in our clusters. We don't intend to sell the CI service as its own thing, but the process of development will entail Dagger for CI.

Also, it would be a welcome option for us in the future, if we could self-host our own distributed Dagger cache, especially if we find the ephemeral usage of Dagger to be more cost effective. It could be a paid option for sure. 🙂 Thing is, we are purposely avoiding any 3rd party out-of-cluster management solutions. We feel our platform must be 100% independent of 3rd parties. This is, however, not to say that the users of the platform could and in fact should buy into any 3rd party solutions they may need. We just don't want lock-in on our side (whereby, we know anyone buying into our platform is being locked in. It's the nature of a platform. 🙂 ).

low gorge Mar 4, 2024, 3:23 PM

#

shrewd hazel So, if I understand correctly, the Dagger Cloud distributed cache is the docker ...

Thanks!

There is a bit more to the Dagger Cloud Cache. @eternal kraken puts it best in a few minutes here: https://pod.gerhard.io/2#t=17m5s

If you can make the same volume available to Dagger Engine, state from previous operations (a.k.a. cache) can be re-used and builds will be more efficient.

I personally don't bother stopping the Dagger Engine in my K8s setup. Dagger is always ready to service requests, the only wait time is the CI runners (ARC in my case: https://github.com/actions/actions-runner-controller).

If you run the Dagger Engine as a daemonset on a dagger-runner node type, and only schedule CI runners on those nodes, then you don't need to worry about provisioning it. In this scenario, everything stays on your cluster.

Upgrades require some more thought, but nothing too involved.

We are actively working in this area, I expect us to have a bunch of production-related improvements over the coming months. Also relevant to this discussion: https://github.com/dagger/dagger/issues/5583

Other video resources:

https://www.youtube.com/watch?v=sogSICwyg0Y (hot off the press)
https://www.youtube.com/watch?v=c93_EsedP1s (last spring, but still relevant)

shrewd hazel Mar 4, 2024, 4:31 PM

#

Wow! Thanks Gerhard. Bist du Deutsch zufälligerweise? 😄
This is all very interesting and to some points way over my head, but I'll keep doggy paddling away. 🙂

#

Nevermind. Schweizer, kein Deutscher. 🙂

stuck oracle Mar 4, 2024, 8:23 PM

#

Has anyone spent time getting dagger engine configured / spun up and managed by ArgoCD inside of k8s? About to embark down that path and figured I'd ask if there are any demons there compared to the typical setup path for applications managed by argo.

shrewd hazel Mar 5, 2024, 5:52 AM

#

@stuck oracle - I'm also embarking on this path - sort of. Not with ArgoCD at first, but in running the Dagger engine persistently in a k8s cluster. The answers to the questions I posed above, if you didn't happen to read them, will probably interest you too. Especially the point made in Gerhard's post about vertical and horizontal scaling of Dagger engines and the fact that the upgrade process needs to be carefully designed when using the Dagger Engine long-running/ persistently, more than likely needing a blue-green deployment process (that is my recollection of my comprehension of his article. I may have misunderstood or mixed things up).

So, TLDR; and AFAIK, using ArgoCD alone won't completely work for a long-running/ non-ephemeral Dagger Engines. Well, the auto-updating won't. As I see it, you'd need something like Argo-Rollouts on top if ArgoCD too.

#

Anyone with real knowledge, please do correct me, if I am wrong.

proud oyster Mar 5, 2024, 5:59 AM

#

The only thing I will add is that although we support long-running the engine container today, we are discussing moving away from that architecture, because it introduces versioning headaches (CLI and container are tightly coupled).

So it's best to keep that in mind before investing too much custom configuration or tooling.

shrewd hazel Mar 5, 2024, 6:43 AM

#

@proud oyster - If you all decide to not support the long-running engine, then my comment above about desiring a self-hosted distributed cache would be even more significant. 🙂

After reading the stuff given to me above, I don't see the coupling between the CLI and the Engine as a terribly hard to solve challenge, is it? In k8s?

The version of Engine running depends on the CLI used (so sort of looking at the dependency backwards) and because the pods running the CLI will be much more short-lived, the number of Engine versions that need to run is two at most. The hardest part would be the trigger to allow the newer versions of the CLI to be used in the CI workflow pods (for a lack of a better name), indicating the newest Engine is up and running and ready for work. And, also knowing when to kill the older version Engine. But, actually, I don't see those as a big issues either. 🤔 Of course, my experience in this is limited. So, I'd love to hear about why it's a more difficult challenge, than I can imagine. 🙂

Could it be that supporting the two paths in the Engine development is the harder part? 🤔

Or that k8s usage with long-running Engines avoids the commercial side of your enterprise to a point? 🤔 (Which it doesn't have to be.) 😉

proud oyster Mar 5, 2024, 6:52 AM

#

A few points:

This is unrelated to commercialization strategy. We don't make product design decisions based on monetization.
We plan on supporting self-hosted distributed cache regardless.
The issue of CLI/engine coupling is complicated. One aspect is that CI configuration (where CLI version is managed) and Kubernetes configuration (where long-running container version is managed) have different lifecycles; often they are not even owned by the same people. And there is not always 1-1 mapping between them. An organization may have multiple CI configurations, and multiple kubernetes configurations, and possibly multiple permutations of them. In a simple stack, it's mildly tricky; in a more complex enterprise it's a nightmare.
On top of this, the engine container is simply not designed to be a long-running service. It's not safely multi-tenant; its remote communication protocol is private at the moment (unlike the GraphQL API exposed by the CLI)
Lastly, the operational model is currently split between 2 very different architectures: long-running container (on some kub installations) and CLI-managed (everone else). Having 2 operational models makes everything more complicated.

Some further reading:

Stateless or stateful drivers? https://github.com/dagger/dagger/issues/5484
Compute Drivers (already mentioned by @low gorge ): https://github.com/dagger/dagger/issues/5583

GitHub

Stateless engine or stateful engine? · Issue #5484 · dagger/dagger

Question I want to start a discussion on an important aspect of Dagger's architecture: whether to make the engine stateful or stateless. This is a complex topic with important ramifications, th...

GitHub

Compute Drivers · Issue #5583 · dagger/dagger

This issue was previously named "engine drivers", but "compute drivers" is proving more clear. Problem The Dagger CLI has a builtin “compute driver”: a software interface which ...

shrewd hazel Mar 5, 2024, 7:26 AM

#

We plan on supporting self-hosted distributed cache regardless.

Ok. This makes me very happy. 🙂

And in fact, this comment now pushes me to think about what we will want to achieve with Dagger differently. Thanks for that!!!! 🙂

This is unrelated to commercialization strategy. We don't make product design decisions based on monetization.

Ah, a breath of fresh air! OMG! 🙂 Hard to believe, but awesome if you can continue to achieve it. And, this makes me now much more happier to take on Dagger as a solution and as a partner and to support it once we can. Very nice! 😊

low gorge Mar 5, 2024, 12:49 PM

#

stuck oracle Has anyone spent time getting dagger engine configured / spun up and managed by ...

Yes. That's how I manage my own Dagger Engines. I haven't encountered any demons. Works both as a Helm chart, or plain YAML. FTR:

Let us know how it goes!

low gorge Mar 5, 2024, 1:49 PM

#

shrewd hazel <@221792763732033536> - I'm also embarking on this path - sort of. Not with Argo...

Actually Argo CD should be sufficient to manage one or more Engines on Kubernetes. I keep my Dagger Engines pinned to a specific minor version, e.g. 0.6.x, 0.9.x, etc. and only apply patch upgrades, which worked fine for the last 12 months.

When a new patch bump goes out, the running Engine will be stopped gracefully, which introduces minimal disruption.

So what are the hard parts of long-running Engines? It all ties back to the pre-provisioned concept, meaning that the CLI is given an existing Engine to work with, and this might cause issues. For example, running Dagger CLI 0.6.x against a 0.9.x Engine will not work.

Compute Drivers (linked to above) is what we are currently exploring as a potential solution to the above problem.

As for the stateful vs stateless, I have to spend more time considering this approach. I have been in the stateful camp for as long as I can remember, so that feels more natural. Same for bare metal, immutable infrastructure and declarative outer shells.

shrewd hazel Mar 5, 2024, 2:57 PM

#

@low gorge - What's the reason for needing the older version along side the newer version?

low gorge Mar 5, 2024, 4:24 PM

#

shrewd hazel <@796825768600141844> - What's the reason for needing the older version along si...

In my case, some workflows are still using 0.6.x, while others are on 0.9.x.

From experience, rather than updating an Engine in place, especially as you are bumping minors, it's usually better to do it alongside (like a blue/green). It introduces less disruption for everyone, and if you hit issues, it's easy to back out of the upgrade, rather than being commmitted to it.

shrewd hazel Mar 5, 2024, 5:01 PM

#

So, the workflow code is also "tied" to the versions?

#

I can see how that could be a real problem....

low gorge Mar 5, 2024, 7:23 PM

#

It's the CLI version, specifically dagger . There weren't many breaking changes - a few per year - but to make sure that everything works well together, the recommendation is to always use the same version for both the CLI and the Engine.

For example, we often use different patch versions of the CLI vs the Engine in our own dagger/dagger workflows, but I don't remember us ever using different minor versions. Behaviour is undefined, so keeping the minors in sync is strongly recommended.

proud oyster Mar 5, 2024, 9:44 PM

#

low gorge Actually Argo CD should be sufficient to manage one or more Engines on Kubernete...

I'm also on "stateful compute" camp, but that doesn't mean necessarily the engine container itself should be stateful. For example I love to ssh+rsync files into a long-running server. sshd is long-running, but the rsync server is ephemeral and short-lived. They work great together.

In this analogy, half of our community deploying the engine like sshd, and the other half like rsync. Eventually we will need to pick one, and I think the rsync model is a better north star to aim for.

low gorge Mar 6, 2024, 7:41 AM

#

proud oyster I'm also on "stateful compute" camp, but that doesn't mean necessarily the engin...

Got it. I will go over https://github.com/dagger/dagger/issues/5484 and continue this conversation there so that it's easier to reference in the future. 👍

GitHub

Stateless engine or stateful engine? · Issue #5484 · dagger/dagger

Question I want to start a discussion on an important aspect of Dagger's architecture: whether to make the engine stateful or stateless. This is a complex topic with important ramifications, th...

shrewd hazel Mar 6, 2024, 8:28 AM

#

I'm also on "stateful compute" camp
Me too. But, only because I've been using VMs and bare metal servers in the past. Kubernetes (and indirectly Docker) offers that paradigm shift to more ephemeral/ stateless usage of applications and it is more "cloud-like" in the end. 🙂 The only thing making this difficult is the fact that practically every application has some files or persisted data they work on and need config to run. This all needs to be "hooked up" and that then becomes the complication - the challenge... 🙂

proud oyster Mar 6, 2024, 3:37 PM

#

Yes that is one complication. The other is that Dagger itself is a container orchestrator, although one with very different goals and design from kubernetes. It's possible for Kubernetes to run a container that itself runs more containers, but it can be awkward sometimes.

proud oyster Mar 6, 2024, 3:38 PM

#

low gorge Got it. I will go over https://github.com/dagger/dagger/issues/5484 and continue...

While reflecting on this conversation, you and @daring wraith just gave me an idea 😁

shrewd hazel Mar 9, 2024, 6:59 AM

#

Is there any ETA on the self-hosted Dagger distributed cache, [mentioned above](#kubernetes message)? 😊 I'm just looking for a ball-park like, "it's actually around the corner" aka a few weeks away or "it's still in planning stage with no ETA yet" aka it will come soon™. 😛

proud oyster Mar 9, 2024, 7:43 AM

#

shrewd hazel Is there any ETA on the self-hosted Dagger distributed cache, [mentioned above](...

still planning stage. but there are workarounds available. It depends what your architectural constraints are for production.

shrewd hazel Mar 9, 2024, 8:03 AM

#

proud oyster still planning stage. but there are workarounds available. It depends what your ...

Thanks for the quick reply. What would you consider to be examples of architectural constraints, say in k8s? This is exactly the open question currently in my mind. I'm asking, because I just watched Kyle's Argo Workflows with Dagger video (again) and read the guide, but I'm uncertain about how to get the "CACHED" results he got, without Dagger Cloud. He didn't mention using Dagger Cloud in the video (which was a miss to sell it 😛 ), and I highly doubt he had a workaround going. But, this scenario of using Argo Workflows to trigger CI runs is where I'm heading. Caching is secondary for now for sure, but it brought me back to remembering what you said about the plans on the self-hosted cache and my question. 🙂

shrewd hazel Mar 9, 2024, 8:41 AM

#

Hmm.. I just watched the video again, and I didn't realize he said "cache persisted within my runs within kubernetes". So, I think I'm still out to lunch about what caches what and how. Sorry. But, any explanation would be greatly appreciated. 🙂

proud oyster Mar 9, 2024, 8:47 AM

#

No worries. I promise it will get better over the next few months. With Functions launched, production readiness is the new top priority.

#

Caching architecture is actually quite simple:

Each engine always has one local cache. It's stored in the engine container's local filesystem.
Optionally, an engine can sync its local cache to a remote storage service. This requires a centralized orchestration service. Dagger Cloud is the only such orchestration service today. It combines orchestration & storage for ease of use (at the expense of flexibility).

So the two main parameters of your production architecture are:

can you use Dagger Cloud, and
how persistent is your dagger engine's local storage?

The more persistent your local storage, the less you need distributed caching (which today means Dagger Cloud). If your local storage is very ephemeral, and you can't or won't Dagger Cloud for distributed caching, then today your best bet is to find ways to make your local storage less ephemeral.

In the future we will decentralize cache orchestration - removing the need for a centralized service altogether. The engines will have configurable storage drivers for plugging directly to commodity object storage. Making a decentralized design is a more challenging design, but the reward is that you will have more options to distribute cache, relieving the pressure to make local storage more persistent.

How to make your local storage more persistent depends on your compute architecture, and how much flexibility you have in changing it. Those are the constraints I was referring to earlier.

I hope this helps!

shrewd hazel Mar 9, 2024, 9:51 AM

#

Oh yes. Very much. Thank you so much for taking the time.

Correct me if I am wrong, Kyle mentioned setting up a volume for Dagger. I thought this was only for the gomod cache, but it is for all caching used by Dagger? 🤔

If the volume is not for "all caching" (which I don't think it is), what path would I need to create a volume on to get a persisted cache for the Dagger Engine? If I know that, I think I might have what I need for an ephemeral setup, which might be a naive take on all this, but I also think would be an awesome start. 🙂

On a side note: I rarely get super enthusiastic about any OSS project I tackle. Some, the rare but very important ones, I've gotten very close to and have become a sort of an evangelist and some even a member of the team. Dagger is a project that is giving me this very good vibe and feeling of wanting to give back as soon as I can. Not sure how I will do that or when, but I thought I'd mention it (and I hope it doesn't come off as blowing smoke up your you-know-what 🙂 It's heartfelt and true. 😊 ).

stuck oracle Mar 10, 2024, 5:30 PM

#

When running the dagger engine in K8s, how would I access environment variables established through helm? I.e. I want to set up various secrets for our functions to leverage when they run. It's not clear to me that's possible and that all values need to be passed in?

proud oyster Mar 10, 2024, 7:42 PM

#

Yes when calling a Dagger Function you need to pass everything explicitly as arguments. There is a native secret type which you can use to securely pass those secrets.

#

This works exactly the same regardless of where your dagger engine runs, by design. It ensures your functions are as portable and reproducible as possible.

stuck oracle Mar 11, 2024, 12:30 AM

#

Im going to start the work over here to get dagger running in my K8s cluster. I need to setup a new node pool in GKE and I know that Dagger wants local SSDs to preform well. There is some docs on google about createing node pools with "local SSDs with block storage", and I assume this is the route that I want to be going. https://cloud.google.com/kubernetes-engine/docs/how-to/persistent-volumes/local-ssd-raw#node-pool

I wanted to reach out and double check that this was the right option I should be going down. Granted I could get this node pool spun up properly, I was curious about how I should be configuring Dagger through helm to leverage these SSDs properly for it's cache. Does anyone have any example helm chart setups that might be a good guide for me to at least reference in getting that setup properly?

Google Cloud

Provision and use Local SSD-backed raw block storage | Google Kub...

This page explains how to provision Local SSD storage on clusters and provides examples of how workloads can consume data from Local SSD-backed raw block storage.

low gorge Mar 11, 2024, 12:27 PM

#

stuck oracle Im going to start the work over here to get dagger running in my K8s cluster. I ...

As long as you make those volumes available on the node as /var/lib/dagger - XFS works very well - then Dagger will use that for its state.

This is the simplest user data script that will do the trick:

sudo mkfs -t xfs /dev/nvme1n1
sudo mkdir /var/lib/dagger
sudo mount /dev/nvme1n1 /var/lib/dagger

low gorge Mar 11, 2024, 12:35 PM

#

stuck oracle Im going to start the work over here to get dagger running in my K8s cluster. I ...

I had a look at that documentation, and adapted this - didn't test:

apiVersion: v1
kind: PersistentVolume
metadata:
  name: "dagger-engine-pv"
spec:
  capacity:
    storage: 375Gi
  accessModes:
  - "ReadWriteOnce"
  persistentVolumeReclaimPolicy: "Retain"
  storageClassName: "local-storage"
  local:
    path: "/var/lib/dagger"
  nodeAffinity:
    required:
      nodeSelectorTerms:
      - matchExpressions:
        - key: "kubernetes.io/hostname"
          operator: "In"
          values:
          - "gke-test-cluster-default-pool-926ddf80-f166"

#

Not sure if a filesystem is created automatically.

shrewd hazel Mar 11, 2024, 12:46 PM

#

@low gorge - could it be you answered my question too above?

what path would I need to create a volume on to get a persisted cache for the Dagger Engine?
So `/var/lib/dagger'?

low gorge Mar 11, 2024, 2:30 PM

#

shrewd hazel <@796825768600141844> - could it be you answered my question too above? > what p...

Sorry, missed your question. If you are using the Helm chart to install, then yes, that's where the Dagger Engine stores its state.

shrewd hazel Mar 11, 2024, 3:34 PM

#

low gorge Sorry, missed your question. If you are using the Helm chart to install, then ye...

Hmmm... I'm looking to go the ephemeral route... (I think).

#

But, hoping to keep hooking up the volume (by making sure the engine containers are started on the same nodes)

#

How are you triggering the workflows to start?

proud oyster Mar 11, 2024, 4:17 PM

#

@shrewd hazel to be clear you cannot share the same state directory between engines.

shrewd hazel Mar 11, 2024, 5:03 PM

#

Ok. So, back to stateful. This is the understanding that is missing because Dagger is like a black-box in my mind. 😊
So, if I create a daemon set with node affinity, I can run say two engines on two nodes and have a local volume attached and all is good in terms of cache (hopefully).

Then each CLI instance would need to be "guided" to the one or other engine. If the same workflows are called over and over, they should be guided to the same engines. By guided, I mean setting _EXPERIMENTAL_DAGGER_RUNNER_HOST. Am I getting closer to a correct understanding? 😛

proud oyster Mar 11, 2024, 5:32 PM

#

shrewd hazel Ok. So, back to stateful. This is the understanding that is missing because Dagg...

Yes that's right 👍

#

Note that using a daemonset vs. regular deployment is up to you, it boils down to the kind of persistence you want. We document daemonset + tight coupling to a CI runner configuration, because it's a reasonable default for the most common use of dagger-on-kub.

But there's nothing magical about dagger that somehow requires a daemonset no matter what.

#

Generally we're shifting towards decoupling your dagger compute architecture from CI compute architecture when you can. Keeps your options open.

low gorge Mar 11, 2024, 8:05 PM

#

shrewd hazel Ok. So, back to stateful. This is the understanding that is missing because Dagg...

The simplest & most reliable setup that I am aware of - and is available today - is 1 Dagger Engine per K8s node with a dedicated local disk.

Store the Dagger Engine unix socket on hostPath, and mount it into any pod that needs it. Not putting unnecessary pressure on the container network stack will help.

New CI runners will spin up around the Engine, mount the Dagger Engine unix socket, and get to work. I wouldn't worry too much about jobs picking a specific Engine. As jobs run, cache will spread naturally across all Engines. If one K8s node becomes unavailable for any reason, you should have at least one more to run the workloads.

As you become more comfortable with this setup, you will add new K8s clusters, and then you will have multiple redundancies.

For KubeCon EU, I am preparing a talk which covers this very setup. Dagger Engines are spread across UK, France, Germany & Poland, one per K8s node. GitHub runners spin around them on demand. This is what that looks like:

If you're up for it, I'm happy to talk more after KubeCon, and even set up a live pairing session. Perhaps others will be interested too.

shrewd hazel Mar 11, 2024, 8:08 PM

#

@low gorge - Thanks so much. Ok. Let me run with this understanding now. I'm getting closer... 🙂

proud oyster Mar 11, 2024, 8:13 PM

#

One best practice in this setup (tell me if you agree @low gorge) is to configure client pods at the infra level, rather than leak the config in the app/CI logic. ie. make sure to set EXPERIMENTAL_DAGGER_RUNNER_HOST in the kub config running your CI job, rather than in the CI configuration itself. That way the CI configuration remains portable.

low gorge Mar 11, 2024, 8:14 PM

#

Our Helm chart will get you 50% there - specifically https://github.com/dagger/dagger/blob/0188846a62af20c66d59fa82bcb7f72a24c42dac/helm/dagger/templates/engine-daemonset.yaml#L83-L89

low gorge Mar 11, 2024, 8:14 PM

#

proud oyster One best practice in this setup (tell me if you agree <@796825768600141844>) is ...

Definitely!

low gorge Mar 11, 2024, 8:23 PM

#

shrewd hazel <@796825768600141844> - Thanks so much. Ok. Let me run with this understanding n...

If you want to look at some code - it goes back to June 2023 & Dagger v0.6 - I have it all in a container image (including the slides!). This is how you can access:

shrewd hazel Mar 11, 2024, 8:26 PM

#

Much appreciated @low gorge. I'll have a look tomorrow. 🙂

#

@low gorge - You noted you're using Github runners. We don't wish to be dependent on Github to trigger CI workflows. Our goal is to allow any git merge that can send off a webhook to be the CI trigger. Do you see any issues with that direction?

proud oyster Mar 11, 2024, 8:33 PM

#

that sounds even better to me 😁

#

how do you plan on processing the webhooks?

#

I'll let @low gorge answer on operational caveats today (I can't think of any). But in terms of where we want to go: the goal is full self-hosting for your dagger-based workflows. Meaning that the event handlers that call dagger functions (for example a webhook server, or ci runner) should themselves be runnable as dagger functions.

#

Today: run Github Actions or any other event trigger service alongside the dagger engine
Tomorrow: run event trigger service on (not alongside) the dagger engine.

shrewd hazel Mar 11, 2024, 8:46 PM

#

how do you plan on processing the webhooks?
We'll be using Argo Workflows. 🙂

#

Which is probably overkill. Theoretically, we could just simply run our own webhook servers to kick off Dagger functions. 🙂

proud oyster Mar 11, 2024, 8:51 PM

#

shrewd hazel Which is probably overkill. Theoretically, we could just simply run our own webh...

Whether it's overkill depends on your constraints. The important thing is that we will support you whether you call Dagger Functions from GHA, Argo Workflows, a custom webhooks server, or anything else.

And in every case, we will make it easier and easier to self-host the whole thing on dagger.

candid verge Mar 11, 2024, 11:04 PM

#

I have a question about this setup, I did it end of 2023 but I was facing to an issue with dagger cloud on the cache sync, after the dagger run ... the cache was not sync in dagger cloud
The support tell me I had to stop the engine as a workaround to this issue
So at the end I had to:

migrate my daemonset to a sidecar docker engine
start the dagger engine in my pod calling the dagger cli
after stoping the dagger container with a timeout to let some time to sync cache.

This issue is fixed or it's still a limitation ?

shrewd hazel Mar 14, 2024, 7:09 AM

#

I think I found a very minor issue with the helm chart, in particular with the values.yml file. The node tolerations and affinity entries, which are commented out, need to be at the level of engine. If someone less in the know (like me) just uncomments them where they are and assumes the entries should be under image, then tolerations and/ or affinity won't work. Should I put in a PR to fix it? Or just an issue?

GitHub

dagger/helm/dagger/values.yaml at 8c6752d513a58d9d717f695886831afe5...

Application Delivery as Code that Runs Anywhere. Contribute to dagger/dagger development by creating an account on GitHub.

shrewd hazel Mar 14, 2024, 9:07 AM

#

Btw, I have successfully launched my first Dagger engines. 😊

shrewd hazel Mar 14, 2024, 10:26 AM

#

The k8s guide shows connection to the dagger engine via a local machine with kubectl connectivity. Is there another way to get communication going with the engine pods, but internal to the cluster?

Gerhard noted this above:

Store the Dagger Engine unix socket on hostPath, and mount it into any pod that needs it. Not putting unnecessary pressure on the container network stack will help.
But my k8s-fu isn't the greatest here. Looking at the Argo Workflows guide, I'd need something like this env to add to the engine pod.

env:
- name: "_EXPERIMENTAL_DAGGER_RUNNER_HOST"
  value: "unix:///var/run/dagger/buildkitd.sock"

Will that do it? And if yes, how to get it into the helm chart values?

versed holly Mar 14, 2024, 12:09 PM

#

shrewd hazel The k8s guide shows connection to the dagger engine via a local machine with kub...

Correct. You have to share the volumes between the dagger engine and the container pods so that the socket is available. For example, in our case we are setting up github runners that will communicate to the engine pod. In the runner pod spec we have:
Volumes:

volumes:
- name: varrundagger
  hostPath:
    path: /var/run/dagger

Volume mounts:

- name: varrundagger
  mountPath: /var/run/buildkit

And finally, like you share above, the env variable:

env:
- name: _EXPERIMENTAL_DAGGER_RUNNER_HOST
  value: unix:///var/run/buildkit/buildkitd.sock

versed holly Mar 14, 2024, 12:09 PM

#

shrewd hazel I think I found a very minor issue with the helm chart, in particular with the `...

Great find! If you want to send a PR in that would be great. Leaving this here for reference: https://github.com/dagger/dagger/blob/main/CONTRIBUTING.md

GitHub

dagger/CONTRIBUTING.md at main · dagger/dagger

Application Delivery as Code that Runs Anywhere. Contribute to dagger/dagger development by creating an account on GitHub.

shrewd hazel Mar 14, 2024, 12:34 PM

#

Correct. You have to share the volumes

stuck oracle Mar 15, 2024, 7:09 PM

#

Maybe I've missed this along the way, but where do y'all host your Charts? I.e. Im working on my helm setup right now and need to know the dagger helm chart name / repo / verison. Would also love to see what can be configured with values files

stuck oracle Mar 15, 2024, 8:57 PM

#

Hmmm. Getting dagger spun up in a cluster today. I have Dagger running, but Im not sure what I'm doing wrong here on the node selection to get the dameon set to only attach to a certain node pool.

  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
          - matchExpressions:
              - key: pool
                operator: In
                values:
                  - ci-runners

stuck oracle Mar 15, 2024, 11:22 PM

#

For what it's worth. I spent the whole of today working on getting Dagger running inside of a K8s cluster and ran into a couple road blocks. The big one was that I couldn't the dagger CLI running in a self hosted gitlab runner either through the typical way outlined in the docs or by attempting to publish my own docker image which already had it installed. Kept running into this issue with the runner (https://gitlab.com/gitlab-org/charts/gitlab-runner/-/issues/477).

I also opened up this issue on the Dagger github about getting an official CLI docker image. (https://github.com/dagger/dagger/issues/6887)

GitHub

✨ Official Dockage image for the CLI? · Issue #6887 · dagger/dagger

What are you trying to do? The dagger docs for gitlab / github / etc have each job installing curl followed by installing the dagger CLI to use for a given CI run. It would be much nicer DX, and pr...

shrewd hazel Mar 16, 2024, 9:28 AM

#

versed holly Great find! If you want to send a PR in that would be great. Leaving this here f...

Done. https://github.com/dagger/dagger/pull/6892

GitHub

Update values.yaml fixes #6891 by smolinari · Pull Request #6892 · ...

Fixes #6891
Scott

shrewd hazel Mar 16, 2024, 9:53 AM

#

The output in the terminal via my Coder pod looks a bit off, but the dagger engine daemons + cli in another "coder pod" is working. Yay! 😄

shrewd hazel Mar 16, 2024, 10:11 AM

#

#

Seems all the dagger output doesn't get shown?

stuck oracle Mar 18, 2024, 5:15 PM

#

In the next step of getting Dagger running properly in K8s. Do ya'll have any examples of how to follow https://archive.docs.dagger.io/0.9/194031/kubernetes#step-2-connect-dagger-cli-to-dagger-engine-pod as it relates to self hosted runners? Do my runners need the ability to use kubectl to attach themselves to the deamonset running on the node?

Run Dagger on Kubernetes | Dagger

Introduction

#

Run Dagger on Kubernetes | Dagger

stuck oracle Mar 18, 2024, 8:27 PM

#

Is there a dagger engine version 0.10 on the helm registry? It seems like the engine on verison 0.1.1 is 0.9.10

stuck oracle Mar 19, 2024, 3:26 AM

#

Im realizing that I have my persistent volume setup properly on my cluster, but I haven't setup a PVC for the volume to be used by the dagger engine. Is there any particular setup that I should be using to set up that claim?

stuck oracle Mar 19, 2024, 3:42 AM

#

Im realizing that I have my persistent

stuck oracle Mar 20, 2024, 9:38 PM

#

Here's maybe an interesting one.... Has anyone gotten the dagger engine to run within a Tailscale network inside of k8s?

proud oyster Mar 20, 2024, 10:31 PM

#

Not to my knowledge, but whatever the best practices are for tailscale on k8s, they should apply to dagger as well

#

I did run tailscale inside a Dagger function 🙂 https://daggerverse.dev/mod/github.com/shykes/daggerverse/tailscale

Probably not relevant but I thought I'd mention it just in case

tailscale :: Daggerverse

Use tailscale as a Dagger module.

shrewd hazel Mar 28, 2024, 11:10 AM

#

Does anyone have an open source example of a CI pipeline using Dagger in k8s by chance? Something more than the argo-workflows example/ guide? I'm looking for some inspiration. 😊

versed holly Mar 28, 2024, 12:52 PM

#

Hey Scott! Are you looking for example setups or specific dagger functions?

shrewd hazel Mar 29, 2024, 6:46 AM

#

Hey Scott! Are you looking for example

heady dagger Apr 16, 2024, 1:39 AM

#

proud oyster I *did* run tailscale inside a Dagger function 🙂 https://daggerverse.dev/mod/gi...

That is wildly cool. Wish I could think of a use case for this.

heady dagger Apr 17, 2024, 2:54 PM

#

is the source code this this available?Marcos made it. I would like to refactor it to use talos linux. I can copy most of the source code in the video frames but as it is a year old I was wondering if there were improvements. https://www.youtube.com/watch?v=u1Q6RNaQHTY

YouTube

Dagger

Exploring Kubernetes in Dagger

In this demo, Marcos shares his experiences with running Kubernetes in Dagger and explores different possibilities for testing pipelines.

Want to ask the presenter a question about the demo? Join us on the Demo Discord Forum here to discuss this specific demo:
https://discord.com/channels/707636530424053791/1120935751069208699

▶ Play video

heady dagger Apr 17, 2024, 3:31 PM

#

when you guys fix your helm chart for dagger you will DEMOLISH the need for GHA https://docs.dagger.io/integrations/104820/kubernetes/

Kubernetes | Dagger

Deployment with Helm

proud oyster Apr 17, 2024, 4:32 PM

#

heady dagger when you guys fix your helm chart for dagger you will DEMOLISH the need for GHA ...

👋 We are actively working on improving it. It should already work fine though, are you encountering specific issues?

heady dagger Apr 17, 2024, 4:33 PM

#

referring to this issue https://github.com/dagger/dagger/issues/7105

GitHub

🐞 Kubernetes integration docs + helm chart need to be updated to th...

What is the issue? Coming from here: https://discord.com/channels/707636530424053791/1229798907362414673 Looks like our helm chart doesn't really work with the latest version of Dagger and it&#...

shrewd hazel Apr 17, 2024, 6:59 PM

#

Kubernetes | Dagger

south thorn Apr 22, 2024, 8:52 PM

#

@ebon perch will be talking about how to use Dagger and Kubernetes at the Kubernetes Atlanta meetup. If you are in the area, you can join in-person, but there is a virtual option too!

https://www.meetup.com/kubernetes-atlanta-meetup/events/300519463/

Meetup

Kubernetes Atlanta April 2024 Meetup, Thu, Apr 25, 2024, 6:30 PM ...

This is a hybrid meetup. In person and virtual via Zoom Webinar

Zoom Webinar Link: Will be posted closer to the meetup time

**7:00 - General announcements and new

weak cypress May 3, 2024, 9:58 AM

#

👋 I'm evaluating options on where to run a whole CI system and I have a very basic question...
I suppose there is no workaround to Dagger Engine requiring root capabilities. As that prevents the usage of GKE Autopilot, which would have been a great way to reduce the burden of maintaining a k8s cluster for our team.

daring wraith May 3, 2024, 10:09 AM

#

Hey @weak cypress! 👋 nice to meet you!

Yeah (unfortunately) we don't support doing rootless mode - there's a bunch of permissions that the dagger engine needs to be able to create isolation between the containers that it starts and manages. Without those permissions it can't do that very easily, or do all the fancy networking stuff it does.

There's some more info in https://docs.dagger.io/faq/#can-i-run-the-dagger-engine-as-a-rootless-container

south thorn May 17, 2024, 5:42 PM

#

New Kubernetes related demo here - https://discord.com/channels/707636530424053791/1240809235345047552

south thorn May 28, 2024, 5:56 PM

#

hey folks! A user has a question about using Dagger with Kubernetes here - https://discord.com/channels/707636530424053791/1245072471481385051

tulip wedge Jun 4, 2024, 5:33 PM

#

Hey so I was just reading this post: https://dagger.io/blog/argo-cd-kubernetes and that's pretty similar to what I've been thinking for our dagger setup once we get a little further in. One question I've been mulling over is whether or not to be concerned about pre-warming the node's local cache when a fresh node comes up. Does anyone do anything like that?

On-Demand Dagger Engines with Argo CD, EKS, and Karpenter - Dagger

Powerful, programmable CI/CD engine that runs your pipelines in
containers — pre-push on your local machine and/or post-push in CI

versed holly Jun 4, 2024, 6:01 PM

#

tulip wedge Hey so I was just reading this post: https://dagger.io/blog/argo-cd-kubernetes a...

Are you referring to Dagger's own cache?

tulip wedge Jun 4, 2024, 6:15 PM

#

Are you referring to Dagger's own cache?

south thorn Jul 1, 2024, 7:29 PM

#

Thanks to Koray and @lunar halo for creating a " how to use Dagger in a Kubernetes environment to manage the deployment of applications" demo repo. If anyone wants to check it out, you can do so here: https://github.com/developer-guy/kcd-munich-2024-demo

GitHub

GitHub - developer-guy/kcd-munich-2024-demo: No More YAML Soup: Tak...

No More YAML Soup: Taking Control with Dagger's Pipeline-as-Code Philosophy - developer-guy/kcd-munich-2024-demo

worn olive Jul 5, 2024, 12:34 AM

#

tulip wedge Hey so I was just reading this post: https://dagger.io/blog/argo-cd-kubernetes a...

Curious what Dagger folks are using for their architectural diagrams?

versed holly Jul 5, 2024, 10:21 AM

#

worn olive Curious what Dagger folks are using for their architectural diagrams?

I'm personally using Excalidraw for this kind of diagrams! I know @gerhard uses Mermaid that also allows you to add links and such

worn olive Jul 5, 2024, 2:17 PM

#

versed holly I'm personally using Excalidraw for this kind of diagrams! I know @gerhard uses ...

Nice! Clearly need to play w/ Mermaid more. We've just started migrating some of our diagrams to it recently. Thanks!

ionic hare Jul 5, 2024, 7:55 PM

#

Nice! Clearly need to play w/ Mermaid

elder bone Jul 9, 2024, 1:51 AM

#

Has anyone tried Dagger on K8s Windows Node?

daring wraith Jul 9, 2024, 10:32 AM

#

answered here: #1242180731548209293 message
we read all the channels here! it's fine to just ask in one place, we'll get back to you ❤️

weak cypress Jul 15, 2024, 12:37 PM

#

👋 I'm trying to upgrade the engine to v0.12 today and I'm getting an odd error when templating the helm chart) - We use argoCD which internally does helm template ...

metadata.labels: Invalid value: "v\"0.12.0\"": a valid label must be an empty string or consist of alphanumeric characters, '-', '_' or '.', and must start and end with an alphanumeric character (e.g. 'MyValue', or 'my_value', or '12345', regex used for validation is '(([A-Za-z0-9][-A-Za-z0-9_.]*)?[A-Za-z0-9])?'

Seems like the PR fixed the double vv$appVersion but introduced an incorrect label: v"$appVersion" (note the quotes after the v)

GitHub

fix(helm): ensure appVersion is always semver (#7917) · dagger/dagg...

And then add the "v" prefix in front of everywhere it needs to be.

Signed-off-by: Justin Chadwell

daring wraith Jul 15, 2024, 12:42 PM

#

Bleh

#

You're entirely right

#

fix in https://github.com/dagger/dagger/pull/7923

#

i absolutely love templated yaml 😢

#

sorry about that

weak cypress Jul 15, 2024, 12:53 PM

#

I was in the middle of filling an issue and submitting the PR 😄 Thanks to you 🙂

proud oyster Jul 15, 2024, 1:03 PM

#

Would be fun to have an official method to generate the Kub configuration with a Dagger pipeline, instead of helm

#

I honestly don't see much value in supporting helm in the first place

#

makes me want to write the module to replace it 😛

weak cypress Jul 17, 2024, 12:44 PM

#

Sorry to bother again about this... any chance the fixed chart can be released on the oci://registry.dagger.io/dagger-helm?

low gorge Jul 17, 2024, 7:31 PM

#

weak cypress Sorry to bother again about this... any chance the fixed chart can be released o...

Hi! We did as soon as we merged the fix. We basically updated the 0.12.0 tag to point to the fixed image: https://github.com/dagger/dagger/actions/runs/9937771665

weak cypress Jul 17, 2024, 8:35 PM

#

Mmm, I think that's the previous fix (which introduced the above).
This is the latest PR with the fix that's not published yet
https://github.com/dagger/dagger/pull/7930/files

#

Seems like the helm/chart/v0.12.0 tag needs to be moved to f7944784dab9a8b5fc76190bad6502e374ed5f4c?

low gorge Jul 18, 2024, 11:27 AM

#

weak cypress Seems like the `helm/chart/v0.12.0` tag needs to be moved to `f7944784dab9a8b5fc...

Got it! Fixing this now.

#

Done: https://github.com/dagger/dagger/actions/runs/9990665443 cc @versed holly @daring wraith

Can you please confirm @weak cypress that this fixes the issue for you?

weak cypress Jul 18, 2024, 11:36 AM

#

low gorge Got it! Fixing this now.

Yup, amazing! It's published 🚀

#

Thanks

low gorge Jul 18, 2024, 12:02 PM

#

Great to know that this unblocks you 💪

south thorn Jul 24, 2024, 10:50 PM

#

We'll cover a Kubernetes use case in our community call tomorrow. Come to watch the demo and ask all your questions 🙂 #general message

cerulean mountain Jul 25, 2024, 4:46 PM

#

cc @ionic hare we can continue chatting about ideas here 🙌

ionic hare Jul 25, 2024, 4:47 PM

#

Tilt-like workflows thread, let's roll

solemn storm Jul 25, 2024, 4:53 PM

#

proud oyster makes me want to write the module to replace it 😛

YES YES YES PLEASE write it and make it available soon...one of the most awaited thing for us.

versed holly Jul 25, 2024, 6:01 PM

#

solemn storm YES YES YES PLEASE write it and make it available soon...one of the most awaited...

Is installing Dagger on kubernetes using Helm not an option for you at the moment?

solemn storm Jul 25, 2024, 6:02 PM

#

versed holly Is installing Dagger on kubernetes using Helm not an option for you at the momen...

Actually we want to remove yml from our end to end workflow and want to make everything powered by dagger.

torpid crescent Jul 29, 2024, 4:36 PM

#

@cerulean mountain I don't know if this is something already known or somehow with any possible solution.. But I noticed that if I try to get logs doing kubectl from the host to the k3s running inside dagger, I got:

❯ kubectl logs test
Error from server: Get "https://10.87.0.25:10250/containerLogs/default/test/test": proxy error from 10.87.0.25:6443 while dialing 10.87.0.25:10250, code 502: 502 Bad Gateway

can it be related to the egress selector in k3s?

#

(to be clear, not an issue for any crucial test I want to do, but just noticed it)

cerulean mountain Jul 29, 2024, 4:54 PM

#

torpid crescent <@336241811179962368> I don't know if this is something already known or somehow...

think I know why. 1 sec

torpid crescent Jul 29, 2024, 4:58 PM

#

Last time I saw it in a similar situation was fixed editing --egress-selector-mode... don't want to give you wrong directions though

cerulean mountain Jul 29, 2024, 4:58 PM

#

can it be related to the egress selector in k3s?
that's what I was thinking about

torpid crescent Jul 29, 2024, 4:59 PM

#

Ok

#

If so, should be easy

cerulean mountain Jul 29, 2024, 5:00 PM

#

torpid crescent If so, should be easy

yep, that fixes it @torpid crescent

#

pushing a new version of the module with that fix 🙏

torpid crescent Jul 29, 2024, 5:01 PM

#

Awesome🔥🔥

cerulean mountain Jul 29, 2024, 7:11 PM

#

torpid crescent Awesome🔥🔥

new version is available, apologies for the delay. Got pulled into a meeting 🙌 https://daggerverse.dev/mod/github.com/marcosnils/daggerverse/k3s@c3ebb4b4d3ddb0dcfe1e6a47da5d2d0c4ef8784d

k3s :: Daggerverse

Runs a k3s server than can be accessed both locally and in your pipelines

torpid crescent Jul 29, 2024, 8:18 PM

#

cerulean mountain new version is available, apologies for the delay. Got pulled into a meeting 🙌 ...

not sure why the first try the pod always fails when using internal registry, regardless I confirm the log call is fixed now!

south thorn Jul 30, 2024, 9:03 PM

#

https://www.youtube.com/watch?v=F6-44Je5HvE

YouTube

Dagger

Kubernetes in Dagger

In this demo, Marcos Lilljedahl explains how to run Kubernetes clusters using Dagger. He addresses the challenges and solutions in integrating Kubernetes within Dagger and shares a practical setup using the k3s Dagger module below.

You’ll learn how to start a Kubernetes cluster, execute kubectl commands, and integrate Helm for deploying applic...

▶ Play video

torpid crescent Aug 1, 2024, 8:03 AM

#

@cerulean mountain the latest version of rancher/k3s image is breaking the k3s module at the level of the cgroup fix script... I think it would be a good idea to pass the name and the tag of the image as an argument, what do you think? I can open a PR in case

digital night Aug 1, 2024, 12:08 PM

#

torpid crescent <@336241811179962368> the latest version of rancher/k3s image is breaking the k3...

Also just ran into this. Do you perhaps already know which image was the last working one?

torpid crescent Aug 1, 2024, 12:13 PM

#

nope, just randomly picked an old-enough one... so I can only tell that is working with rancher/k3s:v1.28.1-k3s1

digital night Aug 1, 2024, 12:21 PM

#

torpid crescent nope, just randomly picked an old-enough one... so I can only tell that is worki...

Fair enough 😄 I will never remember why some tags are k3s1 and some k3s2 😄

cerulean mountain Aug 1, 2024, 2:14 PM

#

torpid crescent <@336241811179962368> the latest version of rancher/k3s image is breaking the k3...

yes! that SGTM! I guess receiving an optional argument in the module constructor might be the best

#

I'll check out why the module stopped working now

#

@torpid crescent found the cause of the issue, fixing now so it works in the latest version

digital night Aug 1, 2024, 2:38 PM

#

cerulean mountain <@364765295782526977> found the cause of the issue, fixing now so it works in t...

Thank you so much! ❤️

cerulean mountain Aug 1, 2024, 2:43 PM

#

fixed! https://daggerverse.dev/mod/github.com/marcosnils/daggerverse/k3s@e0bd6b9f5519c49db4b6eb0689927214720976f9

k3s :: Daggerverse

Runs a k3s server than can be accessed both locally and in your pipelines

#

also added a way to optionally specify te image via dagger call --name foo --image rancher/k3s:latest server up

south thorn Aug 1, 2024, 7:00 PM

#

Seeing y'all use this module is awesome ❤️ Thanks for reporting the issue @torpid crescent !

solemn storm Aug 4, 2024, 5:02 PM

#

@cerulean mountain here is little request https://github.com/marcosnils/daggerverse/issues/3

GitHub

Add daggerverse module for `microK8s` same as like K3S · Issue #3 ·...

Hello It would be nice if you can release one more daggerverse module for microK8s. It's really widely used and officially supported by canonical also. Thank You.

cerulean mountain Aug 4, 2024, 5:06 PM

#

solemn storm <@336241811179962368> here is little request https://github.com/marcosnils/dagge...

👍 @solemn storm how about if you give it a try and publish the module? I can provide async assistance if you get stuck

#

I don't have a personal need for that module now, so it's not likely that I'll create it any time soon

solemn storm Aug 4, 2024, 5:09 PM

#

cerulean mountain 👍 <@751480071591559318> how about if you give it a try and publish the module?...

ok me and our team will try and ask you if help required. Thanks

amber narwhal Aug 6, 2024, 7:56 AM

#

I just wanted to say @cerulean mountain I so want to try your module, I just don't have a lot of time these days. But it's on my list. Expect me to test the sh*t out of it. 😄

amber narwhal Aug 6, 2024, 9:18 AM

#

Something is happening here: https://github.com/sagikazarmark/daggerverse/pull/143

GitHub

Add Helm install by sagikazarmark · Pull Request #143 · sagikazarma...

cerulean mountain Aug 6, 2024, 6:09 PM

#

amber narwhal I just wanted to say <@336241811179962368> I so want to try your module, I just ...

you just reminded me that my module needs better docs and examples. Will add some later today / tomorrow 🙌

amber narwhal Aug 6, 2024, 6:15 PM

#

Thanks! It took me a minute to figure out how to use it. (needed to start the service manually)

daring wraith Aug 6, 2024, 6:19 PM

#

Aha wonderful I had the same question lol 😆 unfortunately starting the service for me seems to trigger some very strange dns issues in the engine for me that I need to debug

candid verge Aug 6, 2024, 9:32 PM

#

Hi, I was trying to use the k3s module with a service binding + fetching the config in a container but it was not working because the file is not yet generated.
Do you think there is a solution to use it without starting it outside of my module ?
My goal is to execute tests on a kubernetes operator so as a first step I would like to start a cluster
Actually I did something but I think it's not a very clean method: https://gist.github.com/Dudesons/a75b8f9d12b389837a0cebd01181b024

Gist

poc_operator_with_k3.go

poc_operator_with_k3.go. GitHub Gist: instantly share code, notes, and snippets.

amber narwhal Aug 6, 2024, 10:11 PM

#

candid verge Hi, I was trying to use the k3s module with a service binding + fetching the con...

Seems like you have a race condition there due to the goroutine?

Why not just start the service first?

Also, if all you need is kubectl, the k3s module already supports that.

candid verge Aug 6, 2024, 10:25 PM

#

amber narwhal Seems like you have a race condition there due to the goroutine? Why not just s...

I didn't have a race condition
You mean starting the service a first call from dagger cli ? If yes the service won't be in background ?
The kubectl is for a quick test but at the ned it will execute another tool for running tests

amber narwhal Aug 6, 2024, 10:26 PM

#

candid verge I didn't have a race condition You mean starting the service a first call from d...

If service initialization doesn't complete before you call Config, wouldn't that result in the same error? Isn't that a race condition?

#

No, I mean simply remove the goroutine and just wait for the service to start.

candid verge Aug 6, 2024, 10:46 PM

#

Ok I just did a try yes it's working I forgot I can call the Start method, thank you

#

Doing my tests I let a time.Sleep(30 * time.Second) .... When I remove it the k3s is up but coredns and another service are not yet started so the cluster is not totally ready

torpid crescent Aug 7, 2024, 9:06 AM

#

@cerulean mountain , thanks for all the help and effort... here the sneak peak for tomorrow https://daggerverse.dev/mod/github.com/interTwin-eu/interLink/ci@147b59e1dddf8a8c7d6f5d645cf5e814c6f03517 daggerfire

interlink :: Daggerverse

A module to instantiate and tests interLink components

#

We integrated this already in our gh CI, and is all green 🙂 a lot to improve in caching and co, so I might have questions, be prepared 😅

cerulean mountain Aug 7, 2024, 4:09 PM

#

torpid crescent <@336241811179962368> , thanks for all the help and effort... here the sneak pea...

🙏 happy to help! LMK if there's anything else I can help with daggerfire

south thorn Aug 7, 2024, 7:57 PM

#

<@&1122942621724184576> Dagger users - @torpid crescent is brining a great use case to the community call tomorrow. I look forward to seeing what y'all think! https://www.linkedin.com/posts/diego-ciangottini-8a03b98a_kubernetes-activity-7226881079016607744-HzO1?utm_source=share&utm_medium=member_desktop

Screenshot_2024-08-07_at_12.54.28_PM.png

south thorn Aug 9, 2024, 7:50 PM

#

Upcoming Kubernetes/Dagger livestream - https://www.youtube.com/live/BYUbm7ISrIY?list=PLnm27H265Le7Zz447qn1PS2OtPaQw38Sh

south thorn Aug 13, 2024, 8:45 PM

#

south thorn Upcoming Kubernetes/Dagger livestream - https://www.youtube.com/live/BYUbm7ISrIY...

Happening now 👆

south thorn Aug 14, 2024, 4:24 PM

#

@hybrid pecan is livestreaming the daggerizing of OpenUnison, which uses Helm Charts.

If you are getting started with Dagger and Kubernetes, this stream might be helpful for you! https://www.youtube.com/watch?v=ZDm8e4cS8ek

YouTube

Rawkode Academy

Developing & Building Open Source with Dagger | OpenUnison

This live stream series will demonstrate how open source projects can leverage Dagger to enhance their build automation and development environments. Dagger enables developers to define their pipelines and environments using general-purpose programming languages, increasing flexibility and maintainability compared to traditional configuration-ba...

▶ Play video

torpid crescent Aug 14, 2024, 4:35 PM

#

south thorn Happening now 👆

https://youtu.be/9FGOATYtpBM "refined" recording here

YouTube

SciGeeks universe

Mastering Kubernetes-based Testing with Dagger & CI/CD: A Step-by-S...

With this tutorial we test a dummy Kubernetes tool using @Dagger and CI/CD pipelines, we dive deep into modern DevOps practices era, learning how to streamline your workflow, and ensure robust, scalable deployments.

We start from scratch to equip you with the knowledge and skills to help you enhancing your Kubernetes testing experience.

00:00 ...

▶ Play video

south thorn Aug 14, 2024, 4:50 PM

#

torpid crescent https://youtu.be/9FGOATYtpBM "refined" recording here

Thanks for sharing! Adding to the docs resources, etc.

digital walrus Aug 18, 2024, 7:31 AM

#

any idea when this feature will be available “Bring-your-own storage for distributed caching” java mvn builds on short lived containers are really slow

proud oyster Aug 18, 2024, 2:18 PM

#

digital walrus any idea when this feature will be available “Bring-your-own storage for distrib...

No date yet but it's in progress. So definitely this year 🙂 We'll refine the target date as soon as we can.

In the meantime there are options on the infrastructure side. For example if you deploy the Daemonset from our docs, and point your dagger clients to that, you will benefit from the local storage of your kubernetes nodes. It won't be perfect caching, but it should be a noticeable improvement. And the longer-lived your kubernetes nodes, the more noticeable the improvements.

digital walrus Aug 19, 2024, 11:32 AM

#

proud oyster No date yet but it's in progress. So definitely this year 🙂 We'll refine the ta...

thanks will give it a go, most of our nodes are on spot nodes so not going to be great but at-least something till we can specify a distributed location

rigid wing Aug 22, 2024, 9:54 PM

#

dagger operator on k8s - my manager is conserned, that dagger is not safe running dagger operator as demonset How can I convince him. Note: we are already running jenkins operator, but I would like to give alternatives like dagger. Thanks in advance

versed holly Aug 23, 2024, 11:36 AM

#

Hi @rigid wing! What is the concern that your team is expressing?

If you are using Jenkins, you can think of dagger as compliment to it rather than an alternative. It will work alongside that infrastructure and make your pipelines faster and less dependent on jenkins-specific config files

cerulean mountain Aug 23, 2024, 8:47 PM

#

rigid wing dagger operator on k8s - my manager is conserned, that dagger is not safe runnin...

If you're already running the Jenkins operator, why does your manager think that Dagger represents a bigger threat compared to the Jenkins one?

tulip tapir Aug 25, 2024, 3:36 PM

#

managers in a nutshell

rigid wing Aug 26, 2024, 12:28 PM

#

Right 🙂 Well, the biggest concern is that it runs as a daemonset and with elevated privileges...

hasty briar Aug 26, 2024, 1:28 PM

#

rigid wing Right 🙂 Well, the biggest concern is that it runs as a daemonset and with eleva...

It would be good to understand how risk is being measured. Are there criteria that you use to measure acceptable risk in any piece of third-party software? E.G. SOC2, ISO27001, or perhaps signed, attested and published SBOMs?

amber narwhal Aug 26, 2024, 1:55 PM

#

@cerulean mountain do you think it would be possible to support container imports in your k3s module?

cerulean mountain Aug 26, 2024, 2:00 PM

#

amber narwhal <@336241811179962368> do you think it would be possible to support container imp...

👋 @amber narwhal . Imports as pre-bundle images when k3s starts?

#

I think it's tricky but doable. Mostly because there's a chicken-and-egg problem about k3s being up before actually calling k3s ctr images import. So the module will need to have some funky logic in its entrypoint script so it loads the images after k3s is effectively up. I guess the main limitation to implement this correctly is that the k3s command can't be executed remotely and needs to run in the same place where the k3s server is running

amber narwhal Aug 26, 2024, 2:28 PM

#

Hm...I just found the registry example in your module. I guess that's a good alternative.

amber narwhal Aug 26, 2024, 2:53 PM

#

Do you think that's a better approach to get container images built with Dagger installed into k3s @cerulean mountain ?

cerulean mountain Aug 26, 2024, 2:56 PM

#

amber narwhal Do you think that's a better approach to get container images built with Dagger ...

yes, I think so because you can:

Build your images
Push them to a Dagger registry service

^ all this can be efficiently cached

Start the k3s server using your custom registry
Use your k3s service as needed

#

that way you have a clear separation about what depends on what and have better caching IMO

amber narwhal Aug 26, 2024, 2:57 PM

#

cerulean mountain yes, I think so because you can: 1. Build your images 2. Push them to a Dagger...

Makes sense. I wonder if it would make sense to add a WithRegistryService endpoint to your k3s module to simplify the whole thing.

cerulean mountain Aug 26, 2024, 2:58 PM

#

amber narwhal Makes sense. I wonder if it would make sense to add a `WithRegistryService` endp...

yes! wanna send the PR!? I can take a look tomorrow otherwise

amber narwhal Aug 26, 2024, 2:59 PM

#

Let me see if I can get this working. I'll send a PR if I do.

torpid crescent Sep 12, 2024, 9:41 AM

#

hello! I'm testing 0.13 for a @cerulean mountain 's K3s based module of mine:

func (m *Interlink) NewInterlink(
    ctx context.Context,
    manifests *dagger.Directory,
    // +optional
    kubeconfig *dagger.File,
    // +optional
    localRegistry *dagger.Service,
    // +optional
    localCluster *dagger.Service,
    // +optional
    // +default="dciangot/docker-plugin:v1"
    pluginImage string,
    // +optional
    pluginEndpoint *dagger.Service,
    // +optional
    pluginConfig *dagger.File,
) (*Interlink, error) {

    //K3s := dag.K3S(m.Name, K3SOpts{Image: "rancher/k3s:v1.28.1-k3s1"}).With(func(k *K3S) *K3S {
    K3s := dag.K3S(m.Name).With(func(k *dagger.K3S) *dagger.K3S {
        return k.WithContainer(
            k.Container().
                WithEnvVariable("BUST", time.Now().String()).
                WithDirectory("/manifests", manifests).
                WithExec([]string{"sh", "-c", `
cat <<EOF > /etc/rancher/k3s/registries.yaml
mirrors:
  "registry:5000":
    endpoint:
      - "http://registry:5000"
EOF`}).
                WithServiceBinding("registry", m.Registry).
                WithServiceBinding("plugin", pluginEndpoint),
        )
    })

    K3s.Server().Start(ctx)
    return m, nil
}

weak cypress Sep 12, 2024, 10:37 AM

#

👋 What's the latest guidance around running dagger in k8s (permissions wide)

    privileged: true
    capabilities:
      add:
        - ALL

Is really ALL capabilities required?
Or is there a list of them I could set up to run the engine with enough access?

Context: I'll be meeting with the team that runs K8s as a service for our company and they already expressed concerns about the wide capabilities required

daring wraith Sep 12, 2024, 11:28 AM

#

🤔

#

i'm not sure why we have the spearate capabilities fields to add ALL

#

is it not enough to haveprivileged: true

#

cc @low gorge @versed holly - i don't think we use anything extra over buildkit in terms of privileges, and i've only seen that in the context of setting privileged

#

but note that privileged is effectively running the pod as root - we don't support running dagger in "rootless" mode (yet, maybe one day, though it's not anywhere on the roadmap)

torpid crescent Sep 13, 2024, 1:22 PM

#

mmm, apparently the K3s module is now unable to resolve dagger services from inside a pod... that is weird because it was working in v0.11.. For instance if I start from this, I cannot get any pod running insice the k3s cluster to resolve "registry" https://github.com/marcosnils/daggerverse/blob/main/k3s/examples/go/main.go , any idea?

GitHub

daggerverse/k3s/examples/go/main.go at main · marcosnils/daggerverse

Personal collection of Dagger modules. Contribute to marcosnils/daggerverse development by creating an account on GitHub.

vocal meadow Sep 13, 2024, 4:10 PM

#

Hello, everyone 👋
I'm at a stump here and any help is appreciated
I'm finishing the demo for KCD and i'm using @cerulean mountain k3s module to spin up a cluster with argo-workflows and argo-events installed, the workflow follows the example given in the docs (https://docs.dagger.io/integrations/argo-workflows/) and until here, so far so good. The problem is that dagger-engine sidecar container fails to start with the error in the image attached.
I'm running dagger in WSL2 on windows10.

Argo Workflows | Dagger

Dagger provides a programmable container engine that can be invoked from an Argo Workflow to run a Dagger pipeline. This allows you to benefit from Dagger's caching, debugging, and visualization features, whilst still keeping all of your existing Argo Workflows infrastructure.

vocal meadow Sep 13, 2024, 5:12 PM

#

Argo Workflows | Dagger

cerulean mountain Sep 13, 2024, 5:34 PM

#

torpid crescent mmm, apparently the K3s module is now unable to resolve dagger services from ins...

👋 can check in a bit 🙏

torpid crescent Sep 13, 2024, 5:53 PM

#

cerulean mountain 👋 can check in a bit 🙏

thanks! there is also a skopeo "bug", due to the entrypoint disabled by default, just an heads up, that was the easy part that I can make a PR later for..

cerulean mountain Sep 13, 2024, 6:01 PM

#

torpid crescent thanks! there is also a skopeo "bug", due to the entrypoint disabled by default,...

hmm are you sure pods within the cluster were able to resolve registry before? Not sure if that makes sense as service resolution works at the /etc/hosts level

#

and pods don't share the host /etc/hosts file 🤔

#

you might be able to resolve services by the service hostname, but the binding name I don't think it works

torpid crescent Sep 13, 2024, 6:02 PM

#

registry I can't tell, but other service is quite sure... I can retry the pipeline with the dagger v0.11 and see

cerulean mountain Sep 13, 2024, 6:03 PM

#

torpid crescent registry I can't tell, but other service is quite sure... I can retry the pipeli...

other services should be the same 🤔

torpid crescent Sep 13, 2024, 6:04 PM

#

let me look:
https://github.com/interTwin-eu/interLink/blob/main/ci/main.go#L130
here I bind plugin

#

and the pod config looking for that bind is here: https://github.com/interTwin-eu/interLink/blob/main/ci/manifests/interlink-config.yaml#L11

#

and the latest integration-test of the repo is green... I'm looing in dagger cloud for the traces, one sec

cerulean mountain Sep 13, 2024, 6:08 PM

#

maybe it's a k3s change? Could you try printing the /etc/hosts file inside the interlink pod?

#

if you see plugin and regisry there it's becuase k3s is setting those there somehow

#

you generally do that with HostAliases in k8s

torpid crescent Sep 13, 2024, 6:12 PM

#

yeah, in fact, the current behavior should be the correct one..

#

ahhhhhhh.... https://github.com/interTwin-eu/interLink/blob/main/ci/manifests/interlink.yaml#L31 here it is!!!!

#

that was the "trick", and I removed it in the current version... ok, now it makes sense, right? case closed

torpid crescent Sep 13, 2024, 6:16 PM

#

torpid crescent thanks! there is also a skopeo "bug", due to the entrypoint disabled by default,...

sorry for the noise, since you are here, do you think this makes sense or is it late friday another allucination?

cerulean mountain Sep 13, 2024, 6:22 PM

#

torpid crescent that was the "trick", and I removed it in the current version... ok, now it make...

yes, that makes sense now

cerulean mountain Sep 13, 2024, 6:23 PM

#

torpid crescent sorry for the noise, since you are here, do you think this makes sense or is it ...

this makes sense also. I'll go ahead and fix the module 🙏

cerulean mountain Sep 13, 2024, 6:44 PM

#

cerulean mountain this makes sense also. I'll go ahead and fix the module 🙏

done @torpid crescent https://daggerverse.dev/mod/github.com/marcosnils/daggerverse/k3s@8360c62bf045a9d8f058853811fa738b727f4402

k3s :: Daggerverse

Runs a k3s server than can be accessed both locally and in your pipelines

amber narwhal Sep 16, 2024, 3:28 PM

#

@cerulean mountain is there a reason why your k3s module requires 0.12.4? If not, do you mind bumping it down to 0.12.0? (I've been stuck on 0.12.0 for two months now due to a regression)

cerulean mountain Sep 16, 2024, 5:44 PM

#

amber narwhal <@336241811179962368> is there a reason why your k3s module requires 0.12.4? If ...

will do now

quartz lantern Sep 24, 2024, 9:14 AM

#

👋 We're considering mounting /var/lib/docker in a PV to give us persistent caching across node restarts in k8s; I was just wondering if you had any thoughts on whether that was a good idea.

daring wraith Sep 24, 2024, 10:53 AM

#

/var/lib/docker? i think you probably want /var/lib/dagger if you're looking for dagger caching 😄

#

but yes, this totally will work - but you do need to make sure that you only have one dagger instance accessing it at a time (if you try and have multiple users of it, the subsequent users will fail out, since it locks the entire db)

quartz lantern Sep 24, 2024, 2:00 PM

#

That's the one! Thanks, Jed 🙂

wheat owl Sep 27, 2024, 1:55 PM

#

Hi!👋🏻 I'm trying to build a container using dagger from a local dir/repo, and then publish it to a container registry (k3d/k3s on Docker Desktop for Mac) accessible locally as 127.0.0.1:5000.
I already searched the docs but I'm unable to find relevant documentation for this use case...
Is it possible, and is there an easy way to do it? TIA!

versed holly Sep 27, 2024, 4:56 PM

#

Hi!👋🏻 I'm trying to build a container

wheat owl Sep 30, 2024, 3:33 PM

#

Hi again everyone 👋🏻, I'm trying to push an image to a local registry using Dagger, but I'm encountering a TLS error due to a certificate verification issue.
Here’s the error I get:

Function execution error: resolve: failed to export: failed to push git.localhost:8443/demo/my-nginx-1:latest: failed to do request: Head "https://git.localhost:8443/v2/demo/my-nginx-1/blobs/sha256:024f2d8883919b1b7a966d1383e87249ff31ddfac3f24a828d7b19c3e953fae9": tls: failed to verify certificate: x509: certificate is valid for 08711bf71c6310b05f686aa8698fa573.07f116773d9e9f9d0869a70a95762eab.traefik.default, not git.localhost

Pushing with docker works, but in case of need in Docker I would typically solve a similar problem by configuring an insecure registry or by ignoring self-signed certificates.
I know it is possible to "Configure the Engine to use Custom Certificate Authorities" but I'd like an easier solution, for a kuibernetes based local dev environment I'm setting up.
If there's no other option, I don't mind creating a custom runner, but I would prefer not to have to dump the CA files and then add them to a volume for the custom runner.
Maybe this can be done with a specific ENV var, or with a directive in the dagger configuration files?
Thanks in advance!

worldly gate Oct 6, 2024, 5:25 PM

#

wheat owl Hi again everyone 👋🏻, I'm trying to push an image to a local registry using Da...

See here for example of insecure: https://discord.com/channels/707636530424053791/1271583698365579395

worldly gate Oct 6, 2024, 5:28 PM

#

worldly gate See here for example of insecure: https://discord.com/channels/70763653042405379...

Though I had to add an extra registry entry marking it insecure for non localhost. Bottom line is you can do this via build kit config in your engine.toml

lyric breach Oct 8, 2024, 6:05 PM

#

Connecting to dagger engine times out after upgrading Argo Workflow to the latest version (v3.5.11), dagger engine version v0.13.3

lyric breach Oct 8, 2024, 6:07 PM

#

lyric breach Connecting to dagger engine times out after upgrading Argo Workflow to the lat...

I have attach the full manifest for ref

📎 message.txt

lyric breach Oct 9, 2024, 5:55 AM

#

I am actually confused on which engine is used, the sidecar's or one installed in the cluster with helm?

vocal meadow Oct 9, 2024, 8:36 AM

#

I am actually confused on which engine

south thorn Oct 10, 2024, 8:49 PM

#

https://openmeter.io/blog/supercharge-helm-chart-development-with-dagger

Supercharge Helm chart development with Dagger | OpenMeter

Use Dagger to achieve isolation and reproducibility with Helm, including running linters, tests, and pushing to a registry.

tardy garnet Dec 4, 2024, 9:38 AM

#

Hey folks Is there a way to run the dagger engine without privileged access in k8s?

weak cypress Dec 4, 2024, 9:47 AM

#

tardy garnet Hey folks Is there a way to run the dagger engine without privileged access in k...

I'm afraid there is not.
You can check this demo where @proud oyster expands a bit on the topic (after ~13:00): https://youtu.be/Sn1w51Vh0mM?t=759

YouTube

Dagger

Running Dagger in an Enterprise

Join Nipuna Perera, Director of Cloud Engineering at Fidelity Investments as he shares insights on integrating Dagger into enterprise workflows. From handling compliance and security challenges to building seamless CI/CD pipelines, see how Dagger transforms software delivery in complex environments.

Want to learn more or have questions? Join us...

▶ Play video

sacred osprey Jan 15, 2025, 9:36 PM

#

Good news, I'm moving my homelab to a colocation site and taking advantage to do a bit of refactoring.
I'm considering using Talos linux for my k8s, because @low gorge is a big fan, but looking at the https://www.talos.dev/v1.9/introduction/support-matrix/ , it seems like they are phasing out community support in the next version and going the Enterprise route.
Unless someone has a better idea, I'll probably boot with Fedora and run k3s.

Support Matrix

Table of supported Talos Linux versions and respective platforms.

low gorge Jan 16, 2025, 10:32 AM

#

sacred osprey Good news, I'm moving my homelab to a colocation site and taking advantage to do...

Hey!

My interpretation of that table is:

Community support for 1.8 ended on 2024-12-17, when 1.9 was released
Community support for 1.9 ends on 2025-04-15, when 1.10 will be released (this needs to be confirmed since 1.10 stable is not out yet)

In my experience, the Sidero Labs team will do the right thing if it's a genuine bug. Here is the last one that I reported which was backported to 1.8 https://github.com/siderolabs/extensions/pull/580, even though 1.8 is technically out of community support.

I personally run a few homelabs on different versions on Talos, oldest one being v1.5.5, which is long overdue an upgrade. Since I don't do upgrades in place, I am looking for those few hours when I can restore from backup on one of my newer homelab hardware. In practice, 1.5 has been rock solid for me, and while I don't expect to get any support, there hasn't been any need for it.

If you do decide to go down the k3s path, you should probably talk to this like-minded friend: https://github.com/tailscale/tailscale/issues/10814#issuecomment-2479977752

Be on the lookout for generic device plugin issues on k3s - relevant if you want to use Tailscale, GPUs or any hardware devices in containers.

sacred osprey Jan 16, 2025, 1:15 PM

#

low gorge Hey! My interpretation of that table is: - Community support for 1.8 ended on 2...

Oh! Your interpretation is plausible and more encouraging than mine, thank you!
Since I don't know the Sidero folks , I was not sure what to expect. Your endorsement for homelab use is just what I needed.
Since I intend to use it as a community resource, I want to do all the config as code.
I suppose a good start would be https://github.com/onedr0p/cluster-template
unless you have a better suggestion?

GitHub

GitHub - onedr0p/cluster-template: A template for deploying a Talos...

A template for deploying a Talos Kubernetes cluster including Flux for GitOps - onedr0p/cluster-template

heady dagger Jan 16, 2025, 2:24 PM

#

Sidero is a great product suite but niche.

low gorge Jan 16, 2025, 5:57 PM

#

sacred osprey Oh! Your interpretation is plausible and more encouraging than mine, thank you! ...

That looks like a great resource, this is the first time that I come across it - just starred it.

Here is an alternative that I am familiar with: https://github.com/mischavandenburg/homelab . If you keep pulling on that thread, you will find a wealth of information from Mischa.

While you may already know about https://makeitwork.tv/from-homelab-to-production/ , I share all the code from the talk with members. This is complimentary for loyal fans like yourself. Just subscribe and I'll take care of the rest 👍

heady dagger Jan 17, 2025, 1:19 PM

#

since we are on the topic of homelabs a friend of mine is making this and its pretty sweet, typescript and pulumi based https://github.com/QC-Labs/orange-lab

GitHub

GitHub - QC-Labs/orange-lab: Private infrastructure for cloud natives

Private infrastructure for cloud natives. Contribute to QC-Labs/orange-lab development by creating an account on GitHub.

sacred osprey Jan 17, 2025, 3:24 PM

#

low gorge That looks like a great resource, this is the first time that I come across it -...

TIL, thank you!
I have been thinking about doing a channel to show collaborative coding, on OnlyFans (for brand recognition...face reveal for paid subscribers, custom work, etc..)

hardy lynx Feb 12, 2025, 7:23 PM

#

Hey all! I just recently started looking into Dagger and I was wondering if there were features or modules that would allow dagger to execute similar to tilt https://tilt.dev/
I'm looking for hot reloads on file changes with automatic build and deployments for local development.
If not I think this would be a killer feature to add into dagger.

Tilt

Kubernetes for Prod, Tilt for Dev

amber narwhal Feb 12, 2025, 7:39 PM

#

hardy lynx Hey all! I just recently started looking into Dagger and I was wondering if ther...

There is an open issue tracking that: https://github.com/dagger/dagger/issues/6990

GitHub

✨ Add support for live development with running Dagger services · ...

What are you trying to do? I want to be able to make changes to code locally and see them reflected in my running Dagger services similar to how docker-compose and docker run --v works. In particul...

proud oyster Feb 13, 2025, 7:57 AM

#

We will build it I promise 🙂

@hardy lynx if you could note your interest in that issue 👆that would help prioritize it! thanks

hardy lynx Feb 13, 2025, 9:27 AM

#

proud oyster We will build it I promise 🙂 <@291719239814086656> if you could note your inte...

✨ amazing! I know this is a stretch and I will mention it on the issue, but do you think dagger would ever invest in a local web ui similar to tilt? I find this feature of tilt very useful as a dev/devops engineer. It would give visibility into dag status + logs for local services + buttons to trigger dag functions such as tests while hot reload is active. I like tilt features but I am not a fan of their domain specific language and it is ill suited for reuse in a ci pipeline.

proud oyster Feb 13, 2025, 11:55 PM

#

hardy lynx ✨ amazing! I know this is a stretch and I will mention it on the issue, but do y...

We don't have the cycles to develop a local UI (already lots of work developing a Cloud UI...) but I think it would make for a great community project. What's cool is that you could run the UI as a Dagger module. In other words you wouldn't have to run it alongside Dagger on the host system - you could run it as an app on Dagger 🙂

hallow lodge Feb 20, 2025, 4:40 PM

#

Hi everyone, I'm using Dagger to create ephemeral containers for integration tests. Now I have to test an application that is supposed to run inside a Kubernetes cluster (it needs to run virtctl to connect to a virtual machine created via Kubevirt), is there a way to create such an environment using Dagger?

#

To be more precise: I'd like to spin up a cluster, and inside of it a VM and a pod running my containerized application that I want to test

#

It might be completely out of scope wrt what Dagger can do so I figured I'd be better off asking before trying something impossible 😅

gritty coyote Feb 20, 2025, 4:42 PM

#

hallow lodge Hi everyone, I'm using Dagger to create ephemeral containers for integration tes...

I'm not sure there is a module to create the infra (could be wrong). I think most of the team uses k3s. Might be worth looking at Daggerverse for some potential matches: https://daggerverse.dev/search?q=kubernetes

kubernetes :: Daggerverse

Search Daggerverse modules.

gritty coyote Feb 20, 2025, 4:43 PM

#

hallow lodge It might be completely out of scope wrt what Dagger can do so I figured I'd be b...

It might be completely out of scope wrt what Dagger
When you write Dagger modules, its your code - so the sky is the limit... its more "what can't you do with Dagger"? 😁

#

Its usually more about deploying to k8s than it is provisioning... that being said... here is an example of using Dagger (without a container) to interact directly with the AWS SDK in Python... https://github.com/jasonmccallister/aws-dagger-example/blob/main/.dagger/src/aws_dagger_example/main.py#L94

If you can write a module/function that wraps the commands needed for virtctl you can call it in your code before doing the rest of the container work what depends on the cluster being created

fringe wagon Feb 22, 2025, 3:21 PM

#

Ola!
I had an issue trying to deploy dagger engine using helm on our CI clusters. We’re using flux and I think there’s a bug with the version label when version contains invalid characters.
Anyway, I proposed https://github.com/dagger/dagger/pull/9679 which should address the issue.

GitHub

fix(helm): sanitize version label by b4nst · Pull Request #9679 · d...

The Helm app.kubernetes.io/version label is not sanitized. This can lead to incorrect template generation on some system updating the version.
For example with flux, we need to use an OCI registry ...

bitter idol Mar 12, 2025, 10:43 AM

#

Who's headed to Kubecon? 😄

daring wraith Mar 12, 2025, 10:48 AM

#

a few of us from dagger will be there, we've got a booth 😄 looking forward to seeing ya 👋

bitter idol Mar 12, 2025, 10:50 AM

#

Sweet! Thought I saw you guys on the sponsor list - do you know which booth you're at?

daring wraith Mar 12, 2025, 10:53 AM

#

oh i actually don't know off the top of my head! I think @south thorn might?

#

i do know we have a hack night that we're running on the tuesday: https://kccnceu2025.sched.com/event/1txLA/dagger-hack-night-hosted-by-dagger

KubeCon + CloudNativeCon Europe 2025: Dagger Hack Night Hosted by D...

View more about this event at KubeCon + CloudNativeCon Europe 2025

harsh creek Mar 12, 2025, 12:13 PM

#

This might actually belong here

south thorn Mar 12, 2025, 4:17 PM

#

bitter idol Sweet! Thought I saw you guys on the sponsor list - do you know which booth you'...

We will be at Platform Engineering Day (co-lo day before KubeCon). You'll see us at a table there.

At KubeCon, our booth is #N453.

And as Justin mentioned, we'd love to see you at the Hack Night too on April 1st! You don't need a KubeCon ticket to attend, so everyone is welcome!

Make sure to register to save your spot: https://lu.ma/hlx7s6ym

Dagger Hack Night: From Platform Engineering to Agent Engineering ·...

Dagger is an open-source runtime for composable workflows—perfect for AI agents and CI/CD automation alike. Whether you’re streamlining DevOps with modular,…

peak bolt Mar 22, 2025, 4:17 PM

#

is it possible to point my dagger CLI at the dagger engine running on a remote cluster?

peak bolt Mar 22, 2025, 4:28 PM

#

peak bolt is it possible to point my dagger CLI at the dagger engine running on a remote c...

to clarify, i want to run the dagger cli on my laptop.

shrewd hazel Mar 23, 2025, 6:22 AM

#

peak bolt to clarify, i want to run the dagger cli on my laptop.

AFAIK, this isn't possible, because the communication between the engine and runners is internal to the node they are both running on. Dagger doesn't have a service to connect to. But AFAIK, you should be able to just install Dagger CLI locally and run your modules locally. The idea being, you can run the tasks/ pipelines anywhere you would have a docker-like runtime running.

Just FYI (and humble bragging 😊), in my k8s setup, I'm running the CLI inside a [Coder workspace] (https://coder.com/docs/user-guides/workspace-access) on the node with the engine for development of the CI pipeline. This allows me to setup up things like webhooks and the like to control CI processes and be able see them work "in action".

gritty coyote Mar 23, 2025, 10:49 PM

#

peak bolt is it possible to point my dagger CLI at the dagger engine running on a remote c...

I believe you are looking for this configuration? Setting _EXPERIMENTAL_DAGGER_RUNNER_HOST will allow you to run the engine remotely

https://docs.dagger.io/configuration/custom-runner/

Custom Runner | Dagger

A runner is the "backend" of Dagger where containers are actually executed.

shrewd hazel Mar 24, 2025, 6:30 AM

#

gritty coyote I believe you are looking for this configuration? Setting `_EXPERIMENTAL_DAGGER_...

@gritty coyote - @peak bolt mentioned a "remote cluster", which means k8s to me. I guess that needs clarification, but if Dagger is running in a k8s cluster, there is no service for it and no way to create a service for it, AFAIK (I'd love to be told I'm wrong). Without making a service available, there is no way to get access to the runner from outside the cluster. The only thing that points to me being wrong is the tcp option. But then, one would need to know how to present the port from the runner/ engine perspective and that is the piece of knowledge I'm missing to also help.

vale ore Mar 24, 2025, 10:32 AM

#

shrewd hazel <@418233653592719364> - <@1073852738967965706> mentioned a "remote cluster", whi...

You can def connect to remote resources, its a common pattern and lots of folks do it.

The main "gotcha" is you're responsible for securing the connection yourself.

So tcp is a decent choice if you are port forwarding via SSH or using something like tailscale.

In the same way that kubectl can connect to a remote k8s services, dagger cli can point to a pod using this form kube-pod://<podname>?context=<context>&namespace=<namespace>&container=<container> - as long as your kubectl is already configured to know how to reach this service.

peak bolt Mar 24, 2025, 12:19 PM

#

vale ore You can def connect to remote resources, its a common pattern and lots of folks ...

yes this is what i did and it worked, thanks!

shrewd hazel Mar 24, 2025, 12:51 PM

#

vale ore You can def connect to remote resources, its a common pattern and lots of folks ...

I'm confused. I'm looking at my Dagger engine pod in k8s. There are no ports opened on it. My "runner" pods connect via the socket connectivity over the node. How can I connect the CLI over TCP, if there is no port open on the engine pod?

vale ore Mar 24, 2025, 1:30 PM

#

shrewd hazel I'm confused. I'm looking at my Dagger engine pod in k8s. There are no ports ope...

I think you could open a port if you wanted to in the kubernetes config, but instead of doing that I think using the kube-pod: option with a properly configured kubeconfig file is the way to go

The overall point is that there is no dagger-specific way to connect to stuff securely, however you do it today you should choose the best corresponding option from this list: https://docs.dagger.io/configuration/custom-runner/#connection-interface

Custom Runner | Dagger

A runner is the "backend" of Dagger where containers are actually executed.

shrewd hazel Mar 24, 2025, 2:38 PM

#

vale ore I think you could open a port if you wanted to in the kubernetes config, but ins...

I'll have to take a look again. I didn't see anything I could adjust to add a port. And using Kubectl isn't a good solution IMHO, because it means every dev needing to just do devOps stuff needs the keys to the castle. 🙂

proud oyster Mar 24, 2025, 7:01 PM

#

It's fair to say that it's possible but experimental - because there are lots of different possible architectures, and lots of different preferences in the community. So we want to learn more before we declare a certain architecture better than others.

shrewd hazel Mar 25, 2025, 5:36 AM

#

I took a second look. There is a port setting for the engine in values.yaml. I set that and the engine pod now has an exposed port, which would allow me to create a service and expose to the outside world (if needed). I'll stick to the socket solution though, as it is working for us. At least now though, I know I can get the engine API exposed, should I need it, which wasn't clear to me before. Thanks all and sorry for my ignorance.

proud oyster Mar 27, 2025, 1:12 AM

#

shrewd hazel I took a second look. There is a port setting for the engine in `values.yaml`. I...

no problem at all. It's all experimental and undocumented so 100% normal to not be aware.

whole notch Apr 4, 2025, 6:36 AM

#

Hi. I have a project with multiple repos in BitBucket. I'm leveraging autoscaled BB CI runners on my EKS cluster to run CI jobs. Now, spinning a dagger engine in every CI job is a bit.. underwhelming. It's wasteful and does not give me the benefit of a dagger cache.

I'm looking into spinning up dagger engine on the same EKS using your helmchart as a daemonset. Instructions say I'm to configure CI runners to point to dagger engine pods. The problem is, my CI runners are auto-scaled and when a CI job is enqueued, there is no way to know on which node it will be scheduled. So if my CI definition yaml points to a pod name, there's a very good chance that pod will be running on a completely different node, in a completely different AZ.

Is there a way to pin dagger-client pods to dagger-engine daemonset pods running on the same node, no matter how many nodes are in the cluster?

EDIT: one of the reasons I'm concerned about this is because some projecs have multi-gig codebases. And copying them to another server doesn't feel right, when there already IS a perfectly fine dagger engine running on the same server 🙂

whole notch Apr 4, 2025, 7:00 AM

#

Hi. I have a project with multiple repos

fringe wagon Apr 24, 2025, 6:00 PM

#

Hey there, I think there's an issue either in the documentation or the Helm chart wrt host mount path.
In the helm chart, the mount is on the form of:

/run/dagger-{{ include "dagger.fullname" . }}

which, by default would lead to /run/dagger-dagger.

However in the doc for GitlabCI runner the example mentions host_path = "/run/dagger" which would lead to a corrupted mount. I don't know if it worth an issue, but just wanted people to be aware.

cerulean mountain Apr 26, 2025, 1:46 PM

#

fringe wagon Hey there, I think there's an issue either in the documentation or the Helm char...

hey there! thanks for reporting. Seems like the helm chart needs fixing. We should open an issue 🙏

#

seems like after this, PR https://github.com/dagger/dagger/pull/9845 if you install the helm chart with the default instructions provided here: https://docs.dagger.io/ci/integrations/kubernetes/#example, then the defined volumes are mounted as follows:

        serviceAccountName: default
        terminationGracePeriodSeconds: 300
        volumes:
        - hostPath:
            path: /var/lib/dagger-dagger-dagger-helm
            type: ""
          name: varlibdagger
        - hostPath:
            path: /run/dagger-dagger-dagger-helm
            type: ""
          name: varrundagger
    updateStrategy:

which seems to me it's a bit odd since we have a good amount of dagger names in the path 😛

Kubernetes | Dagger

This section covers different strategies for deploying Dagger on a Kubernetes cluster.

#

seems like that should be fixed 🙏 cc @low gorge @versed holly

cerulean mountain Apr 26, 2025, 1:58 PM

#

cerulean mountain seems like that should be fixed 🙏 cc <@796825768600141844> <@62839208788007321...

https://github.com/dagger/dagger/issues/10277

GitHub

Default helm chart installation has an odd volume path · Issue #10...

When installing the official Dagger helm chart following the docs here https://docs.dagger.io/ci/integrations/kubernetes/#example, the resulting volume paths seem to contain the dagger word a bit t...

heady dagger Apr 27, 2025, 12:14 PM

#

cerulean mountain seems like after this, PR `https://github.com/dagger/dagger/pull/9845` if you in...

That is super cool. I was wondering how storage might be configured and if the dagger chart is compatible with a Longhorn storage layer.

half canyon May 1, 2025, 9:01 PM

#

cerulean mountain seems like after this, PR `https://github.com/dagger/dagger/pull/9845` if you in...

my result on Rancher is

Volumes:
  varlibdagger:
    Type:          HostPath (bare host directory volume)
    Path:          /var/lib/dagger-dagger-helm
    HostPathType:  
  varrundagger:
    Type:          HostPath (bare host directory volume)
    Path:          /run/dagger-dagger-helm
    HostPathType:

cerulean mountain May 1, 2025, 9:31 PM

#

half canyon my result on Rancher is ``` Volumes: varlibdagger: Type: HostPath...

strange that we're getting different results. Are you installing using the default helm install instructions?

#

       terminationGracePeriodSeconds: 300
        volumes:
        - hostPath:
            path: /var/lib/dagger-dagger-dagger-helm
            type: ""
          name: varlibdagger
        - hostPath:
            path: /run/dagger-dagger-dagger-helm
            type: ""
          name: varrundagger
    updateStrategy:

that's what I get

half canyon May 1, 2025, 10:15 PM

#

I’ll look at my testing output on RKE2 from the other day and see what I got there

half canyon May 2, 2025, 2:23 AM

#

got same results using docs instructions...are we prefixing with ${namespace}- or something?

#

I have some recommendations from ChatGPT...based on the fact that it installs differently through the Rancher UI compared to helm upgrade.

some options are to do one of:

Override fullname explicitly in values.yaml:

fullnameOverride: dagger-engine

or
2. Change your fullname helper to just .Release.Name
In _helpers.tpl:

{{- define "dagger.fullname" -}}
{{- .Release.Name | trunc 63 | trimSuffix "-" }}
{{- end }}

Seems we may want to also move toward calling our release dagger vs dagger-helm, but not sure who that would break at this point. Maybe later.

Will explore more tomorrow.

#

Also look at

path: /run/{{ include "dagger.fullname" . }}

and ensure it's not

path: /run/dagger-{{ include "dagger.fullname" . }}

shrewd hazel May 2, 2025, 5:16 AM

#

cerulean mountain ``` terminationGracePeriodSeconds: 300 volumes: - hostPat...

To add to the strangeness. My Dagger container's volumes.

#

It was a stock install via Rancher UI many moons ago. I've been upgrading regularly though with no problems.

cerulean mountain May 2, 2025, 5:18 AM

#

half canyon I have some recommendations from ChatGPT...based on the fact that it installs di...

I've opened an issue with a small proposal for a fix.

#

https://github.com/dagger/dagger/issues/10277

fallow arrow May 2, 2025, 7:23 PM

#

Hey @cerulean mountain I am trying to use your k3s module within my company and running into a strange issue. I had to point it to our internal registry mirrrors. To do that, I have to add /etc/rancher/k3s/registries.yaml so when the server starts up it picks it up. In your module, the folder /etc/rancher/k3s is a cache volume to persist the k3s.yaml (KUBECONFIG). So what I did was modify the module to set K3S_KUBECONFIG_OUTPUT to a different folder and set that as the ccache. However, now I can't do the server up without changing the cache name every time. The config it tries to use is stale (certs are wrong). It works if I change the --name every time. I can't figure out what's going on and was wondering if you ran into something similar.

cerulean mountain May 2, 2025, 7:35 PM

#

fallow arrow Hey <@336241811179962368> I am trying to use your k3s module within my company a...

hey @fallow arrow I was observing a similar thing today. Seems like something changed in k3s (using the latest image in the module 😬 ) and now when running the module with the same cluster twice, you get that certs issue.. I'm currently troubleshooting to find the last version where this was working and pinning the k3s image to that in the meantime

cerulean mountain May 2, 2025, 7:36 PM

#

cerulean mountain hey <@163822683799158784> I was observing a similar thing today. Seems like som...

it's even making our own dagger/daggerhelm charts tests fail

fallow arrow May 2, 2025, 7:41 PM

#

Oh! good to hear we are on the same page! 😄 sorry that it's breaking for you too

#

I could get it to work by removing the /var/lib/rancher cache. But that means it bootstraps a new cluster and resources every time

cerulean mountain May 2, 2025, 7:58 PM

#

fallow arrow I could get it to work by removing the `/var/lib/rancher` cache. But that means...

yep

#

it used to work where the same cluster could be started multiple times though

fallow arrow May 2, 2025, 8:00 PM

#

Right, i was seeing that behavior just a couple of days ago. It's coincidence that I tested it today by moving the kubeconfig cache and it broke so made me thing my change broke it

cerulean mountain May 2, 2025, 8:05 PM

#

ok @fallow arrow seems like v1.29.15-k3s1 works

#

I assume it's related to the fact that the IP of the service container changes on every run and the certificates become invalid which makes sense. Which will make it somehow hard to make it work in newer versions.

fallow arrow May 2, 2025, 8:09 PM

#

fwiw, I notice that local-path-provisioner is missing in the latest version on kube-system

cerulean mountain May 2, 2025, 8:10 PM

#

#

seems to be there in latest?

fallow arrow May 2, 2025, 8:12 PM

#

hmm I didn't see it.. another thing, every time I start the server it spins up a new node but isn't able to clean up old ones. Are you seeing the same?

#

pods are also stuck in Terminating

#

btw, I see the local-path-provisioner in latest, false alarm

cerulean mountain May 2, 2025, 8:14 PM

#

fallow arrow hmm I didn't see it.. another thing, every time I start the server it spins up a...

haven't checked that in the past tbh 😬 . Was that working before?

fallow arrow May 2, 2025, 8:14 PM

#

cerulean mountain haven't checked that in the past tbh 😬 . Was that working before?

it wasn't working for me since I started testing this. Like last week

#

works fine when I remove the /var/lib/rancher cache though. But then again that bootstraps everything.

#

latest is on v1.32 so that's 3 kube versions ahead of the working 1.29

cerulean mountain May 2, 2025, 8:17 PM

#

fallow arrow latest is on v1.32 so that's 3 kube versions ahead of the working 1.29

yep, I'm aware. Strange that last week was working since between 1.29 and 1.32 may versions were released

#

ok, find a "good enough" workaround I believe

#

if I remove the server/tls folder when calling Server, that will bootstrap the TLS certificates automatically. Thing is that the old node will still appear. I believe that's inevitable @fallow arrow since the previous state will still be stored in ETCD

#

I think that has always happened and I'd expect it to be like that. Eventually the node will become NotReady

fallow arrow May 2, 2025, 8:21 PM

#

so rm -rf server/tls before server start?

cerulean mountain May 2, 2025, 8:21 PM

#

fallow arrow so `rm -rf server/tls` before server start?

yep, about to send an update to the module now

fallow arrow May 2, 2025, 8:21 PM

#

what's the full path?

cerulean mountain May 2, 2025, 8:22 PM

#

/var/lib/rancher/k3s/server/tls

fallow arrow May 2, 2025, 8:28 PM

#

I think a cache bust is also needed

cerulean mountain May 2, 2025, 8:30 PM

#

fallow arrow I think a cache bust is also needed

yep, that's what I've added

#

publishing now

fallow arrow May 2, 2025, 8:30 PM

#

does your helm example still work?

cerulean mountain May 2, 2025, 8:31 PM

#

I think I was the one causing this issue in v0.1.9 since I've moved the /var/lib/rancher to a cache volume instead of a mounted temp

cerulean mountain May 2, 2025, 8:31 PM

#

fallow arrow does your helm example still work?

checking.. haven't tried that

fallow arrow May 2, 2025, 8:31 PM

#

Error: INSTALLATION FAILED: Kubernetes cluster unreachable

cerulean mountain May 2, 2025, 8:35 PM

#

seems to work here @fallow arrow

candid verge May 7, 2025, 10:24 AM

#

Hi, I'm running dagger with self hosted github runner pod.
The pod is composed of 2 containers:

github runner with a dagger cli
dind container

Sometimes I can see some very slow step from dagger "internal actions".

29  : loadPackage DONE [11.0s]
22  : go SDK: load runtime DONE [53.4s]
30  : loadPackage DONE [23.2s]

1   : with-source with-aws-creds --src=~/.aws/ with-kube-config --src=~/.kube/ with-remote-ci-config apply --env=dev --region=eu-west-1 --account=eustaging --stack=market_trends --tfPlan=./untracked_files/tfplan_eustaging
10  : │ load module
33  : │ │ inspecting module metadata
16  : │ │ initializing module DONE [1m16s]
18  : ModuleSource.asModule DONE [1m16s]
34  : Module.serve: Void
34  : Module.serve DONE [0.0s]

On some job the function the execution is taking 1min40-2min but the whole job took 5-6min.
On another call where I want to send a notification to slack the notification fired is taking +-1s but the whole execution of dagger 35-40s

I'm asking if it can be due to have dagger in a dind container and should I remove my dind container to use directly dagger with the host engine ?
For the moment I didn't install dagger as a daemonset because when a dagger version is changed for test / rollout, people of each team will handle the upgrade by specifiying the new agent label

versed holly May 7, 2025, 12:37 PM

#

candid verge Hi, I'm running dagger with self hosted github runner pod. The pod is composed o...

Hey @candid verge!!

Running Dagger in dind on Kubernetes has caused some performance hits for us, mainly driven by overlayfs slowness. Have you tried running this steps locally? If so, what is the usual performance you see there?

candid verge May 7, 2025, 12:38 PM

#

versed holly Hey <@188601917151117313>!! Running Dagger in dind on Kubernetes has caused som...

yes I run it I don't have value in mind but it was not so slow

#

so I should try to cahnge my dind container by the dagger one ? (tbh I don't remember why I use a dind instead of dagger directly)

versed holly May 7, 2025, 12:45 PM

#

So far running Dagger standalone, connecting via UDS and mounting /var/lib/dagger with xfs has given us the best performance improvements. We wrote about it here: https://dagger.io/blog/argo-cd-kubernetes. There is a section called "Nodes" were we briefly explain how we did it and the reason behind it.

You can try that out! Out of curiosity, are there any CPU limits on the runner container? What kind of hardware are you rocking there?

#

Happy to help with the setup 👍

candid verge May 7, 2025, 1:19 PM

#

versed holly So far running Dagger standalone, connecting via UDS and mounting `/var/lib/dagg...

there is cpu limit only request, it's running on ec2 with these families of instances: "c6a.large", "c6a.xlarge", "c6a.2xlarge", "c5a.large", "c5a.xlarge", "c5a.2xlarge"

#

it's running in an eks cluster

#

We don't have specific hardware setup actually it's the first time we are deploying in our ci in order to migrate on it
I will try to setup dagger in sidecar of the container + a volume with xfs + uds

#

I keep you in touch next week I will be off end of this week

#

thank you for all these informations

versed holly May 7, 2025, 4:36 PM

#

candid verge thank you for all these informations

No problem!! Happy to pair a bit on an audio channel if you need! party_gopher

hasty briar May 7, 2025, 8:17 PM

#

I'm unsure where this one belongs, #kubernetes or #github , but here goes. I'm hosting GitHub Runners via ARC in AKS. All has been fine up to (and including) v0.16.1 using the Dagger Helm Chart, and hostPath mounts from the runner pods for /var/run/dagger.

With v0.18.6 (from v0.16.1) this is now failing. My runners are unable to connect to the Dagger Engine deployed to the Host. My pipelines endlessly error with:

! connection error: desc = "transport: Error while dialing: dial unix /run/dagger/engine.sock: connect: no such file or directory"
moby.buildkit.v1.Control/Info
moby.buildkit.v1.Control/Info ERROR [0.0s]

One oddity I have spotted, is that the runner pods have the /run/dagger/buildkitd.sock, but the Dagger-Engine DaemonSet pods have the /run/dagger/engine.sock

Even though my RunnerDeployment configuration is (with cuts for sensitivity) this:

apiVersion: actions.summerwind.dev/v1alpha1
kind: RunnerDeployment
metadata:
  name: dagger-runners
  namespace: runner-pool
spec:
  replicas: 2
  template:
    spec:
      organization: --snip--
      image: ghcr.io/--snip--:v0.61.0
      labels:
        - build
        - dagger-runner
      dockerEnabled: true
      dockerdWithinRunnerContainer: true
      securityContext:
        fsGroup: 1001
        fsGroupChangePolicy: OnRootMismatch
      containers:
        - name: runner
          volumeMounts:
            - name: dagger
              mountPath: /run/dagger
          env:
            - name: _EXPERIMENTAL_DAGGER_RUNNER_HOST
              value: unix:///run/dagger/engine.sock
      volumes:
        - name: dagger
          hostPath:
            path: /run/dagger

I'm not sure where the buildkitd.sock is coming from, and why the engine.sock is not available on the runner. Any pointers? 🥹

proud oyster May 7, 2025, 9:50 PM

#

@eternal kraken @versed holly 👆 at first glance this looks like a case of the engine changing its default socket path, and either 1) the latest helm chart still using the old path, or 2) latest helm chart using the correct path but needing some sort of manual migration from older helm chart?

cerulean mountain May 8, 2025, 3:38 AM

#

proud oyster <@949034677610643507> <@628392087880073217> 👆 at first glance this looks like ...

this: https://github.com/dagger/dagger/issues/10277

GitHub

Default helm chart installation has an odd volume path · Issue #10...

When installing the official Dagger helm chart following the docs here https://docs.dagger.io/ci/integrations/kubernetes/#example, the resulting volume paths seem to contain the dagger word a bit t...

cerulean mountain May 8, 2025, 3:39 AM

#

cerulean mountain this: https://github.com/dagger/dagger/issues/10277

waiting for Matias's and Gerhard's comment there.

candid verge May 19, 2025, 10:16 AM

#

Hi, I'm trying to deploy dagger in kubernetes as a sidecar of my github runner but when I'm starting my runner the dagger engine crash but it's not very clear what I'm missing.
I share my github runner manifest + pod logs + events
(This is a first step, the second step will to create a pvc with xfs but before creating a webhook mutator I wanted to validate the setup)

📎 pod.log 📎 pod_event.txt 📎 gh_runner.yaml

candid verge May 19, 2025, 1:30 PM

#

Hi, I'm trying to deploy dagger in

candid verge May 20, 2025, 2:04 PM

#

So far running Dagger standalone,

limber folio May 23, 2025, 1:35 PM

#

What permissions does the dagger cli need in order to talk to the dagger engine instance running in cluster?

versed holly May 23, 2025, 4:22 PM

#

limber folio What permissions does the dagger cli need in order to talk to the dagger engine ...

It doesn't require any special permissions. You have a few ways of connecting to the engine, the two common ones when running in a cluster is TCP or a unix domain socket. For the latter you would need to mount the correct socket. But in terms of permissions you are good!

limber folio May 23, 2025, 5:06 PM

#

Sorry, should have been more specific, I was using the kube-pod URL setting in the env var, which seemed to require a service account that can access the kube API. Thanks for getting back to me.

I'm trying to configure my gitlab runner to use it and am finding it to be quite slow but I can't tell where it is slow because the dagger cli was not giving any output, I've just added a -v and am now getting output for the job

limber folio May 23, 2025, 5:24 PM

#

🤔 -v only worked in outputting the log on one job

versed holly May 23, 2025, 6:44 PM

#

limber folio Sorry, should have been more specific, I was using the `kube-pod` URL setting in...

Ah, good to know 👍 . Does that mean you were able to make it work with the sa configured? Internally buildkit ends up doing kubectl exec so it needs create on pods/exec and get on pods (FYI: https://github.com/moby/buildkit/blob/0f85fe73978ed3d0b40d935438fa0a16eebbb0ae/client/connhelper/kubepod/kubepod.go#L21-L33)

Performance using a tunnel is usually not great even for simple tasks. Is the gitlab runner running on the same cluster as the engine?

limber folio May 23, 2025, 6:46 PM

#

Yeah, got it working with the properly configured sa and role. So would it be better to use tcp://<address:port> connection rather than kube-pod://<podname>?

#

The runner is on the same cluster as the dagger engine pods, they are setup in the daemonset from the helm chart

versed holly May 23, 2025, 6:52 PM

#

TCP would definitely be better, nothing beats a unix domain socket though so if they are running on the same host you could mount /var/run/buildkit and use unix://var/run/buildkit/buildkitd.sock. That said, I think I misread your message

The heavy lifting is mostly done by the engine itself. If the pipeline is being slow then there's a few things worth checking. My first question would be: are you sending a dagger.Directory from the client to the engine? If so, I would first test using tcp:// (preferably unix://) instead

limber folio May 23, 2025, 6:54 PM

#

Yes, we are loading the directory, I'll see about mounting the socket

versed holly May 23, 2025, 6:55 PM

#

Nice, let me know if I can help! If you are using our helm chart, then doing this on the gitlab runner pod should be enough:

  env:
    - name: _EXPERIMENTAL_DAGGER_RUNNER_HOST
      value: unix:///var/run/buildkit/buildkitd.sock
  volumeMounts:
    - name: varrundagger
      mountPath: /var/run/buildkit
volumes:
- name: varrundagger
  hostPath:
    path: /var/run/dagger

limber folio May 23, 2025, 6:59 PM

#

On the pod or in the runner config.toml? I assume the latter

versed holly May 23, 2025, 7:00 PM

#

I'm not entirely sure how Gitlab runners are configured. However it is, the end result needs to be that there is a volume that mounts the host's /var/run/buildkit directory into the container of the runner

#

That way you have access to the socket the dagger engine opened

limber folio May 23, 2025, 7:14 PM

#

Just looking at the dagger pod it seem to have /run/dagger-dagger-engine-dagger-helm and /var/lib/dagger... should I be using the run path?

versed holly May 23, 2025, 7:21 PM

#

You should. How did you install the helm chart? We'll be fixing that name soon (FYI: https://github.com/dagger/dagger/issues/10277)

limber folio May 23, 2025, 7:23 PM

#

Installed via the oci image with flux helm repo and release

#

And the socket is engine.sock

#

Job completed in 5 minutes, still no log output showing. Assuming the pods do not replicate their cache between each other?

#

Anyway, gonna leave it alone for now and check it again on Monday, thanks for the help @versed holly

limber folio May 24, 2025, 9:00 AM

#

couldn't help myself, took a look again this morning, removed the devbox setup, running registry.dagger.io/engine:v0.18.8 directly as the CI Job image and the dagger call commands directly instead of via devbox run and it's all a lot faster. Makes me wonder what is it devbox is doing that is impacting things so much.

visual abyss Jun 11, 2025, 1:34 PM

#

Hi, I am running the Dagger engine inside a K3s cluster. The K3s pods can utilize the NVIDIA GPU, but the Dagger builds cannot.

I’m getting the following error when running:
dagger -m github.com/samalba/dagger-modules/nvidia-gpu call has-gpu

I deployed the engine using the Helm chart with the following overrides (see the photo).

Has anyone tested GPU utilization from the engine running inside Kubernetes?
Thanks in advance.

versed holly Jun 13, 2025, 4:47 PM

#

visual abyss Hi, I am running the Dagger engine inside a K3s cluster. The K3s pods can utiliz...

Hey @visual abyss!!

I haven't yet tested Dagger with Nvidia in a production setup yet. I think @cerulean mountain or @low gorge know somebody that has. If not I can put some time in next week and try to repro!

visual abyss Jun 13, 2025, 7:28 PM

#

versed holly Hey <@781904948044627969>!! I haven't yet tested Dagger with Nvidia in a produc...

Thanks, @versed holly for the reply. Any help would be much appreciated.

harsh creek Jun 18, 2025, 12:53 PM

#

I am running Dagger in k8s using ephemeral-storage. I have set resource limits but Dagger engine does not seem to be aware of the virtual disk size and thus does not garbage collect causing diskpressure and eventual eviction. How do I make Dagger aware of the disk size? I have no desire to use a custom gc policy unless that is the only way to handle it

harsh comet Jun 27, 2025, 7:09 PM

#

We are running two Dagger Engines in two parallel running Argo Workflows/Steps. One of the two was unable to connect to the Dagger sidecar via socket. Is there a limitation?

harsh comet Jun 30, 2025, 9:12 PM

#

I'm trying to push to include a docker registry certificate since two days and can not get it to work. Seems the only option is to build a custom engine docker image or to mount into the engine image. No other, simple, configuration available.

high ermine Jul 1, 2025, 1:50 PM

#

Hi, I do try to deploy dagger on a K8s cluster but I have an issues with readines probe. Even if I use the official engine dagger image readines probe it fails and I have the following Event: Error: start engine: no driver for scheme "" found

Used image: registry.dagger.io/engine:v0.18.2

Could anyone give any hint about how this can be fixed ?

daring wraith Jul 1, 2025, 2:06 PM

#

high ermine Hi, I do try to deploy dagger on a K8s cluster but I have an issues with readine...

have you set _EXPERIMENTAL_DAGGER_RUNNER_HOST anywhere?

high ermine Jul 1, 2025, 2:06 PM

#

daring wraith have you set `_EXPERIMENTAL_DAGGER_RUNNER_HOST` anywhere?

yes

daring wraith Jul 1, 2025, 2:07 PM

#

what have you set it to? 👀

high ermine Jul 1, 2025, 2:08 PM

#

# Environment variables env: - name: _EXPERIMENTAL_DAGGER_RUNNER_HOST value: "kubernetes"

#

first time when I use it, so most probably I do it wrong

daring wraith Jul 1, 2025, 2:09 PM

#

yup, that's wrong. you need to use the format documented here: https://docs.dagger.io/configuration/custom-runner/#connection-interface

Custom Runner | Dagger

A runner is the "backend" of Dagger where containers are actually executed.

high ermine Jul 1, 2025, 2:12 PM

#

Thank you

high ermine Jul 1, 2025, 8:18 PM

#

~~is there any other way to connect from another pod to dagger without using _EXPERIMENTAL_DAGGER_RUNNER_HOST ? Don't know, having dagger behind a service for example ?~~

#

~~I mean, inside of same K8s cluster~~

twilit dust Jul 5, 2025, 6:12 PM

#

Anyone running dagger in an openshift environment? Tried the instructions but I have not had success. I think due to SCC issues?

shrewd hazel Jul 6, 2025, 2:24 AM

#

@twilit dust - Yeah. Dagger needs root privileges to do its thing. So, SCC will definitely get in the way. Though, I believe you can get around it. You just have to set up a service account with wider permissions and assign it to Dagger to use.

twilit dust Jul 15, 2025, 1:50 AM

#

shrewd hazel <@1250219794989318195> - Yeah. Dagger [needs root privileges to do its thing](ht...

Thats gonna be a show stopper, DoD Regs / stigs and all plus airgap and all

shrewd hazel Jul 15, 2025, 3:27 AM

#

twilit dust Thats gonna be a show stopper, DoD Regs / stigs and all plus airgap and all

In the file is an AI answer (Gemini), but it might be helpful and would be interesting to see what the experts say about the answer's exactness/ truthfulness/ correctness. 🙂

📎 message.txt

wet linden Jul 17, 2025, 1:05 PM

#

Hey, I was going over docs on how to setup k8.

wet linden Jul 17, 2025, 1:46 PM

#

There a couple of thing I want understand and ask:

Currently we run our CI in k8 which has temporary nodes i.e the no. of nodes increase or decrease based on load. Our current pipeline looks something like this: Git push -> Trigger GHA -> GHA controller creates a new pod to run pipeline -> New pods builds and pushes ...... Now:
1.a: The 1st link says, I am required to install dagger CLI locally. With 0.18.12, is it still required?
1.b: Because of temp nodes, we won't have a predictable pipeline time, since nodes would be deleted. So to prevent this, I was thinking of mounting an EFS to the nodes, where we can store the docker cache. I found this tool: https://github.com/kubernetes-sigs/aws-efs-csi-driver to mount.
My understanding is Dagger engine caches it's pipeline in Docker's cache. So there is no separate path that needs to be cached.
Dagger would still need to be triggered by GitHub Actions. What will I need to change here. If dagger CLI isn't installed, how will this work? There's a GitHub actions for Dagger, which will install the CLI I assume. But that CLI will be installed inside GitHub's pod, so how will it talk to Dagger's controller?

If anyone has done 3, please share the steps

PS: I have little to no knowledge of k8, as compared to container.

#

Also, what I said above, is this even possible?

proud oyster Jul 17, 2025, 2:02 PM

#

@wet linden the Dagger engine does not use the Docker cache (or any other feature of docker). It stores its cache in a local state directory.

If you run the Dagger engine in Docker or Kubernetes (pretty common), then that local state directory will be in a volume. It's up to you to manage that volume to balance data persistence, performance, reliability etc.

However you should be mindful of the following:

Dagger does not support concurrent writes to its state directory
Dagger is very IO-intensive, so if you mount its state directory from a remote source with poor IO latency, you will get poor performance

wet linden Jul 17, 2025, 2:29 PM

#

proud oyster <@437495595892998145> the Dagger engine does not use the Docker cache (or any ot...

Thanks. Then I guess mounting EFS to even to dagger doesn't make sense.

In my initial PoC, I ran a persistent node with dagger CLI install in a self-hosted github actions runner. But GHA became our bottleneck since it only supported 1 job at a time. While one dagger engine can run multiple pipelines at a time(as far as I know and based on some testing)

So, I know Dagger has [experimental](https://github.com/dagger/dagger/issues/9516 and/or https://docs.dagger.io/configuration/custom-runner/#connection-interface) support for remote engine i.e the dagger client/cli can run in GHA pods and engine can run a persistent machine. This should ideally be the best solution. Right?

proud oyster Jul 17, 2025, 2:35 PM

#

wet linden Thanks. Then I guess mounting EFS to even to dagger doesn't make sense. In my i...

From an infra storage point of view, it's pretty similar to eg. running a database with its state in a docker or kubernetes volume

proud oyster Jul 17, 2025, 2:40 PM

#

wet linden Thanks. Then I guess mounting EFS to even to dagger doesn't make sense. In my i...

Yes, with the current version of the engine there are 2 well-tested architectures for a self-hosted Github Actions cluster on Kubernetes:

Run a dagger engine on each node of your CI cluster, using a daemonset. Then configure your CI runner to connect to its local node's dagger engine using a unix socket. This is the default configuration in our official helm chart, and in the docs.
Run a dagger engine on a separate machine, and connect to it remotely with DAGGER_EXPERIMENTAL_RUNNER_HOST. We've labeled it experimental to reserve the right to break the protocol in future releases, but it works well. You can also run a cluster of engines, and load-balance across them, although that's slightly less chartered territory. And blindly load-balancing a wide variety of dagger workloads tends to lower your cache hit rate (cache locality is strongest for successive runs of the same pipeline). One very promising architecture is to run dedicated engines for certain pipelines, and configure DAGGER_EXPERIMENTAL_RUNNER_HOST so that successive runs of the same workflow are always routed to the same engine (or pool of engines).

#

Typically, the main constraint for any architecture is cache distribution.

We're working hard to decouple storage and compute in the engine, which will make everything much simpler. Soon!

glad locust Jul 17, 2025, 5:05 PM

#

For 3

Dagger would still need to be triggered by GitHub Actions. What will I need to change here. If dagger CLI isn't installed, how will this work? There's a GitHub actions for Dagger, which will install the CLI I assume. But that CLI will be installed inside GitHub's pod, so how will it talk to Dagger's controller?

Thats correct, the dagger-for-github action (https://github.com/dagger/dagger-for-github) will install the CLI, some more exaples here: https://docs.dagger.io/ci/integrations/github-actions

The github actions pod will be able to use the dagger CLI to run your dagger functions, and those will be executed on the dagger engine specified by DAGGER_EXPERIMENTAL_RUNNER_HOST like solomon mentioned. More info on that here https://docs.dagger.io/configuration/custom-runner/#connection-interface

wet linden Jul 18, 2025, 12:23 PM

#

Thanks both of you. Now I have a better understanding of how dagger works and what I need to do to reach a desired solution.

#

Just curious @proud oyster, decoupling CLI and engine doesn't risk the Dagger (cloud's) business. From what I remember, one of features dagger cloud offers is cross region caching. I believe with this experimental feature, it sort of in a way possible to replicate that?

#

@glad locust with this experimental flag, we will most likey be using the 6th option(ip address and port). But I don't see a port being exposed using docker ps in dagger engine, nor it is present in somewhere docs.

proud oyster Jul 18, 2025, 12:32 PM

#

wet linden Just curious <@488409085998530571>, decoupling CLI and engine doesn't risk the D...

We used to sell hosted distributed caching as an experimental service, but paused that, because of the engine limitations.

Yes, once we decouple storage and compute, we make it easier to eg. use a S3 bucket for shared cache distribution. In theory you could say it hurts our business opportunities. But in practice, not really - storing engine cache on a S3 bucket should be the bare minimum. There is a lot more value that a commercial product can add beyond that.

#

Also: you mention decoupling dagger CLI and engine, but that's different and not what I mean by "decoupling compute and storage"

wet linden Jul 18, 2025, 12:43 PM

#

Hmmm, makes sense.

you mention decoupling dagger CLI and engine, but that's different and not what I mean by "decoupling compute and storage"

Yeah, I was reading https://github.com/dagger/dagger/issues/9516 confused CLI/Engine with compute and storage.
So what the team is trying to achieve is: Separation of responsibility between CLI and Engine and within the engine, separation of compute and storage.

Great work man. I just feel like any org that I join and has a broken CI/CD, I ask them to replace it with Dagger. Like an elixir which heals everything XD.

glad locust Jul 18, 2025, 3:09 PM

#

wet linden <@135620352201064448> with this experimental flag, we will most likey be using t...

You can configure the engine to listen on a tcp port by passing the extra args --addr tcp://0.0.0.0:1234 to the engine container. However, if you're running the engine on the same node as your CI runners, either as a sidecar or daemonset, connecting over a unix socket is a more common approach achieved by creating a shared volume between the CI runner pod and engine pod for the socket

wet linden Jul 21, 2025, 5:33 AM

#

glad locust You can configure the engine to listen on a tcp port by passing the extra args `...

Hey Kyle,
If the current setup is: Remote Dagger Engine and GHA running in k8 pods.
Then to do a dagger call. I would need to do a partial clone of the repo in k8 pods, and run dagger in remote machine. How will it do the build? If all the source code that needs to built resides on k8 pod?

A hacks I know around this is: Can do a partial clone of only the repo clone the repo again in a function and pass the source code to build function.

Is there a better way to do this?

glad locust Jul 21, 2025, 7:23 PM

#

wet linden Hey Kyle, If the current setup is: Remote Dagger Engine and GHA running in k8 po...

regardless of where it runs, dagger always has this client<->server relationship between the CLI and engine. Anything passed to a function through an argument, like your source Directory, is transferred to the engine at runtime and likewise anything exported from the engine is transferred to the client side.

You mention partial clone - if the entire source needed to build isn't cloned for the dagger call, you could optimize this interaction by passing the git ref you want to build as the argument rather than a local directory. For example dagger call build --source https://github.com/myorg/myproject@abcd1234 instead of dagger call build --source .

steep thorn Jul 25, 2025, 4:39 PM

#

Quick question, when a dagger container is run on k8s, is that container run as a pod or is the engine using the container runtime directly or is it more at the cgroup level?

proud oyster Jul 25, 2025, 5:03 PM

#

dagger engine runs as a privileged pod on k8s; then it runs its own containers itself by hitting the kernel directly

#

basically dagger can use k8s as a provisioner, but it doesn't rely on it as a runtime

#

same with docker, podman etc

steep thorn Jul 25, 2025, 5:08 PM

#

Okay cool, so would that be through the cgroups api then? Kind does something similar to get CRI-O running in containers

proud oyster Jul 25, 2025, 5:11 PM

#

steep thorn Okay cool, so would that be through the cgroups api then? Kind does something si...

yes cgroups, namespaces etc. As far as linux is concerned it's all the same

Dagger engine bundles the following components:

buildkit
containerd
runc
a bunch of glue

All integrated into a standalone container orchestrator and runtime

steep thorn Jul 25, 2025, 5:19 PM

#

Is the dagger runtime OCI compliant as a result?

#

Is the dagger runtime OCI compliant as a result?

fallow olive Jul 29, 2025, 8:30 AM

#

proud oyster dagger engine runs as a privileged pod on k8s; then it runs its own containers i...

Found this message through searching : )
I'm wondering what's your stand on this apporach, basically i love your product and used it heavily for the last week.
the problem is that we run our ci on jenkins through kubernetes (each ci runs on a ephemeral pod)
my org won't allow me to deploy a priviliged pod since they see it very fairly as a vulenrablity/threat.
I see Dagger as a product that will help me write better CI/CD code from many perspectives, hope i'm not being rude here but I’d really appreciate any advice, documentation, or community insights that could help me communicate the value and security posture of dagger in k8s environment, for example we currently build images with kaniko because of it : )

proud oyster Jul 29, 2025, 11:53 AM

#

steep thorn Is the dagger runtime OCI compliant as a result?

you mean as a drop-in replacement for oci runtimes such as runc? Or a possible cri backend? If that's what you mean, then no.

If you mean that Dagger is interoperable with the OCI image format and registry protocol, then yes 👍

cyan lily Jul 29, 2025, 3:46 PM

#

Based on our previous discussion, it seems that the Dagger Engine uses a container-in-container pattern.
I was wondering about the resource limits and requests defined in the pod spec — should the containers launched by Dagger respect those?
From what I’ve tested on my cluster, it doesn’t seem like they do.

proud oyster Jul 29, 2025, 5:10 PM

#

fallow olive Found this message through searching : ) I'm wondering what's your stand on this...

Hello! It's a fair question. Dagger is vertically integrated: it bundles its own container runtime, orchestrator and cache system. This is what makes its unique features possible.

You should take that vertical integration into account when securing Dagger.

Since Dagger is a system component, you should focus on securing it at the node level, not the pod level. If you consider your dagger workloads untrusted, then you should run them in a separate cluster, or in a segregated set of nodes in the same cluster.

fallow olive Jul 29, 2025, 6:32 PM

#

proud oyster Hello! It's a fair question. Dagger is vertically integrated: it bundles its own...

It makes sense to me now 🫠
thanks!

weak cypress Jul 29, 2025, 10:53 PM

#

For what is worth: That's how we approached our internal security review, (we had a very similar environment, jenkins running in k8s with ephemeral worker pods). It helped a lot when Solomon put it in that context - You need to think of dagger as its own system (not another workload that you bundle into a k8s cluster if that makes sense)

We have moved our CI workloads into their own k8s cluster, isolated from any other workloads and ran many privileged engines

steep thorn Jul 30, 2025, 10:43 AM

#

proud oyster you mean as a drop-in replacement for oci runtimes such as runc? Or a possible c...

Thanks for the clarification!

nova dome Aug 11, 2025, 6:32 PM

#

Hey everyone. I'm currently using the k3s module to stand up a kubernetes cluster to do some testing.

However, part of my tests require that I use CAPD (Docker implementation of CAPI). Which requires the docker socket for creating "Machines" that are backed by Docker. It appears that there is no docker socket running in this env.

My question is, should I start using the kind cluster instead (which module, there are quite a few in the daggerverse?) Or should I try to get the k3s module to work with CAPD?

Thanks for any advice!

k3s :: Daggerverse

Runs a k3s server than can be accessed both locally and in your pipelines

nova dome Aug 11, 2025, 7:53 PM

#

nova dome Hey everyone. I'm currently using the [k3s module](https://daggerverse.dev/mod/g...

@cerulean mountain maybe you have an opinion? I'd basically need to mount the docker socket into the k3s env.

nova dome Aug 11, 2025, 7:54 PM

#

nova dome Hey everyone. I'm currently using the [k3s module](https://daggerverse.dev/mod/g...

When I do this using a Kind cluster, I usually have these defined in my Kind cluser config:

  extraMounts:
  - hostPath: /var/run/docker.sock
    containerPath: /var/run/docker.sock

nova dome Aug 11, 2025, 11:14 PM

#

nova dome Hey everyone. I'm currently using the [k3s module](https://daggerverse.dev/mod/g...

It appears that CAPD cannot connect to the cluster that it has created:

E0811 20:10:44.313611 1 cluster_accessor.go:262] "Connect failed" err="error creating HTTP client and mapper: cluster is not reachable: Get "https://x.x.x.x:6443/?timeout=5s\": context deadline exceeded" controller="clustercache" controllerGroup="cluster.x-k8s.io" controllerKind="Cluster" Cluster="default/test-cluster" namespace="default" name="test-cluster" reconcileID="d3e99ed5-a9fe-4e36-8295-c4e40e0a0b11"

So I'm not really sure what the right way is to get k3s to communicate to docker machines that end up being created on the host machine. I must be fundamentally be missing a network link or something similar.

cerulean mountain Aug 12, 2025, 12:55 AM

#

nova dome Hey everyone. I'm currently using the [k3s module](https://daggerverse.dev/mod/g...

hey @nova dome, how about using the k3s cluster-api provider? https://github.com/k3s-io/cluster-api-k3s. Seems like that should "in theory" be easier to integrate with the k3s module

cerulean mountain Aug 12, 2025, 12:56 AM

#

nova dome When I do this using a Kind cluster, I usually have these defined in my Kind clu...

in this case it makes sense because kind uses docker behind the scenes to provision the cluster. K3s doesn't need it since everything is embedded within the k3s runtime

nova dome Aug 12, 2025, 3:22 AM

#

cerulean mountain hey <@724114799751069727>, how about using the k3s cluster-api provider? https:...

Thanks for the reply @cerulean mountain ! Let me investigate this path. I'll report back when I get something working.

#

@limber folio looks like we have something worth trying here.

nova dome Aug 12, 2025, 3:44 AM

#

So it looks like I'd need to modify the way I'm defining the DockerMachineTemplate (among other things)

Right now I 'm using, for example:

apiVersion: infrastructure.cluster.x-k8s.io/v1beta1
kind: DockerMachineTemplate
metadata:
  name: quick-start-default-worker-machinetemplate
  namespace: test-cluster
spec:
  template:
    spec:
      extraMounts:
      - containerPath: /var/run/docker.sock
        hostPath: /var/run/docker.sock

In the k3s repo, I can install the provider and then use:

apiVersion: infrastructure.cluster.x-k8s.io/v1beta1
kind: DockerMachineTemplate
metadata:
  name: k3s-control-plane
spec:
  template:
    spec: {}

Which evidently does not need to mount the docker socket at all.

nova dome Aug 12, 2025, 3:41 PM

#

@cerulean mountain just a quick update.
I fired up the dagger container that starts the k3s cluster, and then followed the steps starting at Install Providers

Unfortunately, the same issue arises with the networking. The management cluster is still not able to communicate with the workload cluster:

2025-08-12T15:27:27Z    ERROR   Reconciler error        {"controller": "kthreescontrolplane", "controllerGroup": "controlplane.cluster.x-k8s.io", "controllerKind": "KThreesControlPlane", "KThreesControlPlane": {"name":"test1-control-plane","namespace":"default"}, "namespace": "default", "name": "test1-control-plane", "reconcileID": "de38f8fd-390c-4b62-8425-0ae829263f1f", "error": "failed to get API group resources: unable to retrieve the complete list of server APIs: v1: Get \"https://x.x.x.x:6443/api/v1?timeout=30s\": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)", "errorCauses": [{"error": "failed to get API group resources: unable to retrieve the complete list of server APIs: v1: Get \"https://x.x.x.x:6443/api/v1?timeout=30s\": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)"}]}

So somehow, we need to setup the routes properly when firing up the k3s container to begin with.

#

We're starting our service roughly, using these steps:

func (m *MyProject) newKubernetesService(name string, opts ...KubernetesServiceOpts) (*KubernetesService, error) {
    k3s := dag.K3S(name)

    base := k3s.Container().
        WithExec([]string{"mkdir", "-p", "/var/lib/rancher/k3s/agent/images"})

    for _, o := range opts {
        for _, image := range o.LoadImages {
            id := m.randomName("name-", 8)
            imagePath := fmt.Sprintf("/var/lib/rancher/k3s/agent/images/%s.tar", id)
            base = base.WithMountedFile(imagePath, image.AsTarball())
        }
    }

    server := base.AsService(dagger.ContainerAsServiceOpts{
        Args: []string{
            "sh", "-c",
            "k3s server --cluster-init --bind-address $(ip route | grep src | awk '{print $NF}') --disable traefik --disable metrics-server --egress-selector-mode=disabled > /dev/null 2>&1",
        },
        InsecureRootCapabilities: true,
        UseEntrypoint:            true,
    })

    return &KubernetesService{
        Service:    server,
        KubeConfig: k3s.Config(),
    }, nil
}

And then later on:

// Execute the end-to-end tests
func (m *MyProject) EndToEndTest(ctx context.Context) *dagger.File {
    k8sOpts := KubernetesServiceOpts{
        LoadImages: []*dagger.Container{m.Container()},
    }

    k8s, err := m.newKubernetesService("e2e-test", k8sOpts)
    if err != nil {
        return nil
    }

    _, err = k8s.Start(ctx)
    if err != nil {
        return nil
    }
        // Testing code....
}

cerulean mountain Aug 12, 2025, 4:12 PM

#

nova dome <@336241811179962368> just a quick update. I fired up the dagger container that...

thx for the update. I have a bit of time this evening to check this out

#

if by any chance you have the time to create a public repo with your ongoing efforts that will make things easier for me

nova dome Aug 12, 2025, 8:30 PM

#

@cerulean mountain Got a minimal example running here: https://github.com/AcidLeroy/dagger-capd

GitHub

GitHub - AcidLeroy/dagger-capd: Demonstrate dagger with CAPD (clust...

Demonstrate dagger with CAPD (cluster api docker). Contribute to AcidLeroy/dagger-capd development by creating an account on GitHub.

heady dagger Aug 22, 2025, 2:37 AM

#

does anyone have the source on how to install daggerverse in a k8 cluster? I found a video that shows this but I can't find an implementation.

proud oyster Aug 22, 2025, 3:14 AM

#

heady dagger does anyone have the source on how to install daggerverse in a k8 cluster? I fou...

https://docs.dagger.io/ci/integrations/kubernetes

Kubernetes | Dagger

This section covers different strategies for deploying Dagger on a Kubernetes cluster.

south thorn Sep 22, 2025, 8:33 PM

#

analog coral Nov 24, 2025, 5:02 PM

#

I'm clearly confused by something related to the dagger engine deployed on kubernetes nodes. I've deployed the engine using the helm chart in the dagger repo, pretty much unchanged. We're using gitlab, and I've configured the runner to be privileged and mount the /var/run dir for access to the dagger socket. However, my dagger jobs don't seem to connect to the local engine and instead seem to want to try to connect to dagger cloud:

$ echo "Dagger Engine: ${_EXPERIMENTAL_DAGGER_RUNNER_HOST}" # collapsed multi-line command
Dagger Engine: unix:///run/dagger/engine.sock
1 : connect
1 : [0.0s] | cloud url=https://dagger.cloud/traces/setup
2 : ┆ starting engine
2 : ┆ starting engine DONE [0.0s]
3 : ┆ connecting to engine

analog coral Nov 24, 2025, 6:58 PM

#

Are image pull secrets filtered down to the engine that pulls the images? I've got the dagger engine deployed on kubernetes as a daemonset using the dagger helm chart. I set imagePullSecrets in the values.yaml and verified the pods have the pull secret in their config. All my dagger modules use custom base images, and I'm getting a 401 for every pull the dagger engine is trying to make.

I've verified the credentials in the pull secret are in fact valid.

#

Do I maybe need to do something to add RegistryCrendentials inside the module? (ie dag.Container.WithRegistryAuth().From())

zenith imp Nov 25, 2025, 9:47 PM

#

I'm curious why the helm chart and documentation reference a pod instead of just creating a Service with spec.internalTrafficPolicy set to "Local" and using that via tcp?

analog coral Dec 5, 2025, 3:31 PM

#

zenith imp I'm curious why the helm chart and documentation reference a pod instead of just...

I'm testing this configuration, and so far it seems like it should work w/o issue.

analog coral Dec 5, 2025, 5:01 PM

#

FYI, this was discussed in this thread:
https://discord.com/channels/707636530424053791/1446525232213922054

frank niche Dec 12, 2025, 11:27 AM

#

Help! How can I limit the memory a dagger-engine consumes? I have a Pod with a dagger-engine running that has memory and cpu limits, but they seem to be ignored. The engine starts to consume up the whole nodes memory until the kubelet node-checker start to error out. Or is there nay kind of upper limit one can impose on the dagger-engine and its started container-executions?

left shuttle Dec 15, 2025, 6:45 AM

#

@proud oyster I'm reading https://docs.dagger.io/ci/integrations/kubernetes, and I'm trying to figure out, if there's a way yet to control scheduling or guarantee resources (for each function that gonna be run) ? or that something not yet available ?

peak bolt Dec 21, 2025, 3:02 PM

#

@cerulean mountain I am trying to get your K3s go example working and wonder if you can help? I cloned your repo and I am running the example found here https://github.com/marcosnils/daggerverse/tree/main/k3s/examples/go

Maybe it is working and I am confused but I run this command and expect it to display the cluster info in my terminal

dagger call k-3-skubectl --args="cluster-info" stdout

However, it never returns. Here is a trace where I let it run for 5min before canceling.
https://dagger.cloud/cafe/traces/2fa2f2f9430c88ab013759db82c4c128?listen=c7af574b0c3ad054#d5829dcb1e6ed718

I am running podman on a Mac. It is probably something simple but if you could give me some suggestions of how to debug I would appreciate it, thanks

GitHub

daggerverse/k3s/examples/go at main · marcosnils/daggerverse

Personal collection of Dagger modules. Contribute to marcosnils/daggerverse development by creating an account on GitHub.

Dagger Cloud

Browse and visualize Dagger traces.