Skip to content

Docker Daemon dies due to healthcheck #52330

@andrebrait

Description

@andrebrait

Description

Background

I had this issue with 27.5.1 and now with 29.3.1 as well.
I've had a set of containers running in unRAID for a while now, but recently I noticed my services would go down randomly, often after a couple hours. Trying to docker ps resulted in a message saying it could not connect to the socket and asking whether the daemon was running.

It had indeed died.

Checking the error logs didn't reveal that much, but the debug logs gave me some hints: before the crash, I saw an unusual number of healthcheck commands being performed, particularly to my CrowdSec container; but that does not seem to be entirely correlated.

The only thing I know is that it is connected to healthchecks.

Things I tried and seemed not to have any influence in this:

  1. Using only bridge networks
  2. Using a macvlan network
  3. Changing backing filesystem (zfs, btrfs)
  4. Changing storage driver (zfs, btrfs, overlay2)
  5. Upgrading from unRAID 7.2.4 (Docker 27.5.1) to 7.3.0 (Docker 29.3.1)
  6. Not mounting the Docker socket to any containers
  7. Changing RAM - also tested with memtest86+ 8.00 with 16 passes
  8. Changing Motherboard
  9. Changing my boot device

Disabling the healthchecks has gotten rid of the issue 100%. It's now been running for about a week whereas previously I got at most 24h.

Reproduce

  1. docker run my containers with healthcheck
  2. wait
  3. it crashes

Expected behavior

Docker Daemon should not die because of a healthcheck, no matter how bad the healthcheck might be.

docker version

Client:
 Version:           29.3.1
 API version:       1.54
 Go version:        go1.25.8
 Git commit:        c2be9cc
 Built:             Wed Mar 25 16:12:39 2026
 OS/Arch:           linux/amd64
 Context:           default

Server: Docker Engine - Community
 Engine:
  Version:          29.3.1
  API version:      1.54 (minimum version 1.40)
  Go version:       go1.25.8
  Git commit:       f78c987a
  Built:            Wed Mar 25 16:14:56 2026
  OS/Arch:          linux/amd64
  Experimental:     false
 containerd:
  Version:          v2.2.2
  GitCommit:        301b2dac98f15c27117da5c8af12118a041a31d9
 runc:
  Version:          1.3.4
  GitCommit:        v1.3.4-0-gd6d73eb
 docker-init:
  Version:          0.19.0
  GitCommit:        de40ad0

docker info

Client:
 Version:    29.3.1
 Context:    default
 Debug Mode: false
 Plugins:
  buildx: Docker Buildx (Docker Inc.)
    Version:  v0.32.1
    Path:     /usr/libexec/docker/cli-plugins/docker-buildx
  compose: Docker Compose (Docker Inc.)
    Version:  v2.40.3
    Path:     /usr/lib/docker/cli-plugins/docker-compose

Server:
 Containers: 38
  Running: 38
  Paused: 0
  Stopped: 0
 Images: 32
 Server Version: 29.3.1
 Storage Driver: overlay2
  Backing Filesystem: zfs
  Supports d_type: true
  Using metacopy: false
  Native Overlay Diff: true
  userxattr: false
 Logging Driver: json-file
 Cgroup Driver: cgroupfs
 Cgroup Version: 2
 Plugins:
  Volume: local
  Network: bridge host ipvlan macvlan null overlay
  Log: awslogs fluentd gcplogs gelf journald json-file local splunk syslog
 CDI spec directories:
  /etc/cdi
  /var/run/cdi
 Swarm: inactive
 Runtimes: io.containerd.runc.v2 runc
 Default Runtime: runc
 Init Binary: docker-init
 containerd version: 301b2dac98f15c27117da5c8af12118a041a31d9
 runc version: v1.3.4-0-gd6d73eb
 init version: de40ad0
 Security Options:
  seccomp
   Profile: builtin
  cgroupns
 Kernel Version: 6.18.20-Unraid
 Operating System: Unraid OS 7.3 x86_64
 OSType: linux
 Architecture: x86_64
 CPUs: 16
 Total Memory: 62.58GiB
 Name: NAS
 ID: 7842bc14-0b75-4c17-af87-949347eb783b
 Docker Root Dir: /var/lib/docker
 Debug Mode: false
 Experimental: false
 Insecure Registries:
  ::1/128
  127.0.0.0/8
 Live Restore Enabled: false
 Product License: Community Engine
 Firewall Backend: iptables

WARNING: No swap limit support

Additional Info

Also posting here some of the logs from when the crashes happened and a redacted debug log:

The crashes don't all seem to be that related, but they all stopped when I turned off all healthchecks here.

Here are all the containers I'm running as of today. All images pulled just now:

IMAGE                                                            COMMAND                  CREATED        STATUS        
lscr.io/linuxserver/bazarr:latest                                "/init"                  4 hours ago    Up 4 hours    
ghcr.io/immich-app/immich-server:v2.7.2                          "tini -- /bin/bash -…"   4 hours ago    Up 4 hours    
ghcr.io/immich-app/immich-machine-learning:v2.7.2-openvino       "tini -- python -m i…"   4 hours ago    Up 4 hours    
amir20/dozzle:latest                                             "/dozzle"                4 hours ago    Up 4 hours    
ghcr.io/goauthentik/server:2026.2.2                              "dumb-init -- ak ser…"   10 hours ago   Up 10 hours   
ghcr.io/goauthentik/server:2026.2.2                              "dumb-init -- ak wor…"   10 hours ago   Up 10 hours   
lscr.io/linuxserver/code-server:latest                           "/init"                  29 hours ago   Up 29 hours   
lscr.io/linuxserver/plex                                         "/init"                  29 hours ago   Up 29 hours   
lscr.io/linuxserver/qbittorrent:latest                           "/init"                  42 hours ago   Up 42 hours   
lscr.io/linuxserver/radarr:latest                                "/init"                  42 hours ago   Up 42 hours   
lscr.io/linuxserver/prowlarr:latest                              "/init"                  42 hours ago   Up 42 hours   
stirlingtools/stirling-pdf                                       "tini -- /scripts/in…"   42 hours ago   Up 42 hours   
lscr.io/linuxserver/sonarr                                       "/init"                  3 days ago     Up 3 days     
lscr.io/linuxserver/nextcloud:33.0.2                             "/init"                  5 days ago     Up 5 days     
redis:8.6.2-alpine                                               "docker-entrypoint.s…"   5 days ago     Up 5 days     
redis:8.6.2-alpine                                               "docker-entrypoint.s…"   5 days ago     Up 5 days     
gotenberg/gotenberg:8.29.1                                       "/usr/bin/tini -- go…"   5 days ago     Up 5 days     
apache/tika:3.3.0.0                                              "/bin/sh -c 'exec ja…"   5 days ago     Up 5 days     
crowdsecurity/crowdsec:v1.7.7                                    "/bin/bash /docker_s…"   5 days ago     Up 9 hours    
lscr.io/linuxserver/socket-proxy:latest                          "/docker-entrypoint.…"   5 days ago     Up 5 days     
redis:8.6.2-alpine                                               "docker-entrypoint.s…"   5 days ago     Up 42 hours   
redis:8.6.2-alpine                                               "docker-entrypoint.s…"   5 days ago     Up 5 days     
collabora/code:25.04.9.4.1                                       "/start-collabora-on…"   5 days ago     Up 5 days     
traefik:3.6.12                                                   "/entrypoint.sh trae…"   10 days ago    Up 42 hours   
ghcr.io/seerr-team/seerr:latest                                  "docker-entrypoint.s…"   11 days ago    Up 6 days     
santiagosayshey/profilarr:latest                                 "/entrypoint.sh guni…"   11 days ago    Up 6 days     
lscr.io/linuxserver/lidarr:latest                                "/init"                  11 days ago    Up 6 days     
flaresolverr/flaresolverr                                        "/usr/bin/dumb-init …"   11 days ago    Up 6 days     
ghcr.io/paperless-ngx/paperless-ngx:2.20.13                      "/init"                  11 days ago    Up 6 days     
postgres:16.13-alpine                                            "docker-entrypoint.s…"   11 days ago    Up 6 days     
postgres:16.13-alpine                                            "docker-entrypoint.s…"   11 days ago    Up 6 days     
ghcr.io/immich-app/postgres:14-vectorchord0.4.3-pgvectors0.2.0   "/usr/local/bin/immi…"   11 days ago    Up 6 days     
valkey/valkey:9                                                  "docker-entrypoint.s…"   11 days ago    Up 6 days     
postgres:16.13-alpine                                            "docker-entrypoint.s…"   11 days ago    Up 6 days     
maxmindinc/geoipupdate:latest                                    "/usr/bin/entry.sh"      11 days ago    Up 6 days     
qmcgaw/gluetun:v3                                                "/gluetun-entrypoint"    11 days ago    Up 6 days     
ghcr.io/jarvis2f/telegram-files:latest                           "/sbin/tini -- /bin/…"   2 weeks ago    Up 6 days     
c4illin/convertx:latest                                          "bun run dist/src/in…"   2 weeks ago    Up 6 days     

Metadata

Metadata

Assignees

No one assigned

    Labels

    kind/bugBugs are bugs. The cause may or may not be known at triage time so debugging may be needed.status/0-triage

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions